Is this like a tricky interview question? I don't see the pattern between those three numbers you listed and the sample of the table.
770011 770083 524 1e-120 89 12 770083 770011 494 1e-120 39 100 ahh, I guess these are related because id1=id2 an id2=id1... Here's a first pass at the problem. Project: P1 = foreach table generate id1 as id1, id2 as id2, *; P2 = foreach table generate id2 as id1, id1 as id2, *; J = join P1 by (id1, id2), P2 by (id1,id2); and now J contains pairs of rows from original table where id1 and id2 are reversed. is this what you want? On Wed, Jun 9, 2010 at 6:54 PM, Renato Marroquín Mogrovejo < [email protected]> wrote: > Hi everyone, today I came across with a particular query that I don't know > how to model in PIG. Part of my data looks like this: > > Id1 Id2 Sc Va P1 P2 > --------- --------- ----- --------- ----- ---- > 770011 990201 401 1e-125 100 65 > 990201 770011 440 1e-125 100 42 > 770011 770083 524 1e-120 89 12 > 770083 770011 494 1e-120 39 100 > 990201 770083 341 1e-125 73 41 > 770083 990201 421 1e-125 90 85 > . > . > . > > what I would like to retrieve is something like > this: 770011 990201 770083 > because they are records actually related. > Any kind of ideas are highly appreciated. Thanks in advanced. > > Renato M. >
