If it's for purposes other than a Join, you could write a UDF to use distributed cache. Look at the section "Loading the Distributed Cache" http://ofps.oreilly.com/titles/9781449302641/writing_udfs.html
On Wed, Nov 14, 2012 at 11:44 AM, Ruslan Al-Fakikh <[email protected]>wrote: > Maybe this is what you are looking for: > http://ofps.oreilly.com/titles/9781449302641/advanced_pig_latin.html > see "Replicated join" > > > On Tue, Nov 13, 2012 at 11:46 AM, yingnan.ma <[email protected]> > wrote: > > > Hi , > > > > I used the distributed cache in the hadoop though the "setup" and > "static" > > store an hashset in the mem; > > > > and I try to use the distributed cache in the Pig, and I don't know how > to > > store an hashset in the mem,I just can cache the file in the mem. > > > > Any advise would be fine, Thank you so much! > > > > Best Regards > > > > Malone > > > > 2012-11-13 > > > > > > >
