I said 64MB, but it can be 128MB, or 5KB. It doesn't matter the number. I just want to extract data and put into several files with specific size. Basically, I am doing a cat to a big txt file, and I want to split the content into multiple files with a fixed size.
On 7 June 2013 10:14, Johnny Zhang <[email protected]> wrote: > Pedro, you can try Piggybank MultiStorage, which split results into > different dir/files by specific index attribute. But not sure how it can > make sure the file size is 64MB. Why 64MB specifically? what's the > connection between your data and 64MB? > > Johnny > > > On Fri, Jun 7, 2013 at 12:56 AM, Pedro Sá da Costa <[email protected] > >wrote: > > > I am using the instruction: > > > > store A into 'result-australia-0' using PigStorage('\t'); > > > > to store the data in HDFS. But the problem is that, this creates 1 file > > with 500MB of size. Instead, want to save several 64MB files. How I do > > this? > > > > -- > > Best regards, > > > -- Best regards,
