Check out the SPLIT function:
https://pig.apache.org/docs/r0.14.0/basic.html#SPLIT

Split your input into two projections and store them into different files.

On Mon, Dec 15, 2014 at 12:41 PM, Margus Roo <mar...@roo.ee> wrote:

> Hi
>
> I have files contain timestamp. I'd like to parse row by row and put them
> into file by timestamp.
> in example
>
> original file:
> 20140801,...,...,...,...,...
> 20140802,...,...,...,...,...
> 20140801,...,...,...,...,...
> ...
>
> So I'd like to parse this rows to separate files 20140801 and 20140802 so
> that file
> 20140801.csv contains:
> 20140801,...,...,...,...,...
> 20140801,...,...,...,...,...
>
> and 20140802.csv contains
> 20140802,...,...,...,...,...
>
> I tried to write my own custom StorageFunc but as much I understand I can
> not do it there.
> I read about MultyStorage maybe this is the right tool to try? Or Pig
> totally wrong tool for that problem?
>
> --
> Margus (margusja) Roo
> http://margus.roo.ee
> skype: margusja
> +372 51 480
>
>

Reply via email to