Hi All,I have the following question:
Snippet of my sample.txt. First column is id, however each row can have 
variable number of columns. 
id1 100 200 300 400 500id2 10 20 30id1 800 900 600id3 10 20 30 40 50 60 70 80 
90 100id1 1 2 3 4 5 6 7 8 9id2 40 50 60 70 80 90id3 200
sample = LOAD 'sample.txt' [how should I specify schema here]sample_grpd = 
GROUP sample by $0;sample_result = FOREACH sample_grpd generate group, 
FLATTEN(TOBAG([what should go here]))
group by id so that the result is:
id1 100 200 300 400 500 800 900 600  1 2 3 4 5 6 7 8 9id2 10 20 30 40 50 60 70 
80 90id3 10 20 30 40 50 60 70 80 90 100 200

Any help with this, will be greatly appreciated!
                                          

Reply via email to