如何使用 PIG 脚本获取两个纪元时间值之间的毫秒数
How to get the MilliSeconds between two Epoch time values using PIG script
Game_ID |开始时间 |结束时间
1 | 1235000140| 1235002457
2 | 1235000377| 1235003300
3 | 1235000414| 1235056128
1 | 1235000414| 1235056128
2 | 1235000377| 1235003300
这里我想获取两个 Epoch 时间字段 BeginTime 和 EndTime 之间的毫秒数。然后计算每场比赛的平均时间。
games = load 'games.txt' using PigStorage('|') as (gameid: int, begin_time: long, end_time:long);
dump games;
(1,1235000140,1235002457)
(2,1235000377,1235003300)
(3,1235000414,1235056128)
(1,1235000414,1235056128)
(2,1235000377,1235003300)
第一步:计算时差
difference = foreach games generate gameid, end_time - begin_time as time_lapse;
dump difference;
(1,2317)
(2,2923)
(3,55714)
(1,55714)
(2,2923)
步骤 2: 将数据分组 Game_ID
game_group = group difference by gameid;
dump game_group;
(1,{(1,55714),(1,2317)})
(2,{(2,2923),(2,2923)})
(3,{(3,55714)})
第 3 步: 然后求平均值
average = foreach game_group generate group, AVG(difference.time_lapse);
dump average;
(1,29015.5)
(2,2923.0)
(3,55714.0)
Game_ID |开始时间 |结束时间
1 | 1235000140| 1235002457
2 | 1235000377| 1235003300
3 | 1235000414| 1235056128
1 | 1235000414| 1235056128
2 | 1235000377| 1235003300
这里我想获取两个 Epoch 时间字段 BeginTime 和 EndTime 之间的毫秒数。然后计算每场比赛的平均时间。
games = load 'games.txt' using PigStorage('|') as (gameid: int, begin_time: long, end_time:long);
dump games;
(1,1235000140,1235002457)
(2,1235000377,1235003300)
(3,1235000414,1235056128)
(1,1235000414,1235056128)
(2,1235000377,1235003300)
第一步:计算时差
difference = foreach games generate gameid, end_time - begin_time as time_lapse;
dump difference;
(1,2317)
(2,2923)
(3,55714)
(1,55714)
(2,2923)
步骤 2: 将数据分组 Game_ID
game_group = group difference by gameid;
dump game_group;
(1,{(1,55714),(1,2317)})
(2,{(2,2923),(2,2923)})
(3,{(3,55714)})
第 3 步: 然后求平均值
average = foreach game_group generate group, AVG(difference.time_lapse);
dump average;
(1,29015.5)
(2,2923.0)
(3,55714.0)