Check https://cwiki.apache.org/confluence/display/PIG/FAQ#FAQ-Q%3AIloaddatafromadirectorywhichcontainsdifferentfile.HowdoIfindoutwherethedatacomesfrom%3F
Daniel On Sun, Jan 8, 2012 at 10:45 PM, Yulia Tolskaya <[email protected]> wrote: > Hello, > I am wondering if there is a way for me to load multiple files into pig, > while still keeping track of what record came from what file. To give some > background, I have about half a million files of one phrase per line, and I > need to note which document each phrase belongs to. > > Thanks for your help! > Yulia
