I have a feed in the following format:
Hour Key ID Value 1 K1 001 3 1 K1 002 2 2 K1 005 4 1 K2 002 1 2 K2 003 5 2 K2 004 6
and I want to group the feed (Hour, Key) , then sum the Value , but save the ID as a tuple:
({1, K1}, {001, 002}, 5) ({2, K1}, {005}, 4) ({1, K2}, {002}, 1) ({2, K2}, {003, 004}, 11)
I know how to use FLATTEN to generate the sum of Value , but I donβt know how to output ID as a tuple. This is what I have so far:
A = LOAD 'data' AS (Hour:chararray, Key:chararray, ID:chararray, Value:int); B = GROUP A BY (Hour, Key); C = FOREACH B GENERATE FLATTEN(group) AS (Hour, Key), SUM(A.Value) AS Value ;
Can you explain how to do this? Appreciate it!
source share