Dave,

so i should create three files , A,B and C with contents like below(without 
headers and trailers)
file A:
abcdefghijklmnop
abcdefghijklmnop
abcdefghijklmnop

file B:
abcdefghijklmnop
abcdefghijklmnop

file C:
abcdefghijklmnop
abcdefghijklmnop
abcdefghijklmnop
abcdefghijklmnop

is there any way to split file with  the line number? or how i can split into 3 
files?

Thanks
Jumsheed



> On Jan 6, 2015, at 9:54 AM, David Warshaw <d...@cobrain.com> wrote:
> 
> Carrying headers and trailers through Pig (or really any ETL pipeline) as
> data rows will be awkward.
> De-concatenated (or pre-concatenated) files with the metadata already
> stripped out could be loaded using the PigStorage loader with the tag path
> setting. This would allow you to differentiate the records by source in
> your script.
> 
> On Tue, Jan 6, 2015 at 9:29 AM, Jumsheed <jumsh...@gmail.com> wrote:
> 
>> Yes i checked SPLIT and MultiStorage , but i didn't find find any way to
>> group each section.
>> 
>> On Tue, Jan 6, 2015 at 8:55 AM, Shahab Yunus <shahab.yu...@gmail.com>
>> wrote:
>> 
>>> Have you looked at the SPLIT operator in Pig? Does that help?
>>> http://pig.apache.org/docs/r0.12.0/basic.html#SPLIT
>>> 
>>> Regards,
>>> Shahab
>>> 
>>> On Tue, Jan 6, 2015 at 8:51 AM, Jumsheed <jumsh...@gmail.com> wrote:
>>> 
>>>> Hi,
>>>> 
>>>> I have a file with data in below format,
>>>> 
>>>> A
>>>> abcdefghijklmnop
>>>> abcdefghijklmnop
>>>> abcdefghijklmnop
>>>> 3
>>>> B
>>>> abcdefghijklmnop
>>>> abcdefghijklmnop
>>>> 2
>>>> C
>>>> abcdefghijklmnop
>>>> abcdefghijklmnop
>>>> abcdefghijklmnop
>>>> abcdefghijklmnop
>>>> 4
>>>> 
>>>> i need to create three files like
>>>> 
>>>> file1:
>>>> A
>>>> abcdefghijklmnop
>>>> abcdefghijklmnop
>>>> abcdefghijklmnop
>>>> 3
>>>> 
>>>> file2:
>>>> B
>>>> abcdefghijklmnop
>>>> abcdefghijklmnop
>>>> 2
>>>> 
>>>> file3:
>>>> C
>>>> abcdefghijklmnop
>>>> abcdefghijklmnop
>>>> abcdefghijklmnop
>>>> abcdefghijklmnop
>>>> 4
>>>> 
>>>> is there any way you can suggest?
>>>> 
>>>> Thanks
>>>> Jumsheed
>>>> 
>>> 
>> 

Reply via email to