PigStorage, the default function for the LOAD operator, uses a tab
character ('\t') as the default field delimiter.http://hadoop.apache.org/pig/docs/r0.7.0/piglatin_ref2.html#PigStorage So, if tmp.dat was tab separated as: 1 2 {(3,4),(5,6)} 1 2 {(3,4),(5,6),(10,12)} 7 8 {(3,4)} your load operator should just work. Thanks, Akira On Thu, May 27, 2010 at 2:48 AM, prasenjit mukherjee <[email protected]> wrote: > I am tryign this load statement to load the following data : r = load > 'tmp.dat' AS (f1:int, f2:int, B: bag { T: tuple (g1:int, g2:int) }); > > 1,2,{(3,4),(5,6)} > 1,2,{(3,4),(5,6),(10,12)} > 7,8,{(3,4)} > > But it is not working. Any help ? > > -Thanks, > Prasen >
