You can do an left outer join of A and B and then filter by B is null. http://pig.apache.org/docs/r0.10.0/basic.html#join-outer
On Tue, Jan 22, 2013 at 4:46 AM, James Newhaven <[email protected]>wrote: > Hi, > > I have two relations - A and B. Both just contain user ids. > > I want to get a list of users who are in A but not in B. > > I am running Pig 0.9.1 and think this might be possible with the DIFF > function. I can see that DIFF requires one relation that contains the two > bags. > > How can I create a relation that contains two bags so it can be supplied to > the DIFF function? > > Any suggestions would be appreciated. > > Thanks, > James > -- *Note that I'm no longer using my Yahoo! email address. Please email me at [email protected] going forward.*
