Check out the SPLIT function: https://pig.apache.org/docs/r0.14.0/basic.html#SPLIT
Split your input into two projections and store them into different files. On Mon, Dec 15, 2014 at 12:41 PM, Margus Roo <mar...@roo.ee> wrote: > Hi > > I have files contain timestamp. I'd like to parse row by row and put them > into file by timestamp. > in example > > original file: > 20140801,...,...,...,...,... > 20140802,...,...,...,...,... > 20140801,...,...,...,...,... > ... > > So I'd like to parse this rows to separate files 20140801 and 20140802 so > that file > 20140801.csv contains: > 20140801,...,...,...,...,... > 20140801,...,...,...,...,... > > and 20140802.csv contains > 20140802,...,...,...,...,... > > I tried to write my own custom StorageFunc but as much I understand I can > not do it there. > I read about MultyStorage maybe this is the right tool to try? Or Pig > totally wrong tool for that problem? > > -- > Margus (margusja) Roo > http://margus.roo.ee > skype: margusja > +372 51 480 > >