data.txt can be:
({(1,2,3),(4,5,6)})
({(7,8,9),(10,11,12)}
Also bag can be nested, eg:
A = LOAD 'data.txt' AS (B: bag {t:tuple(BB: bag {tt:tuple(t1:int, t2:int,
t3:int)})});
data.txt:
{({(1,2,3),(4,5,6)}),({(7,8,9),(10,11,12)})}
Daniel
On 06/14/2011 12:46 PM, Saumitra Shahapure wrote:
Hello,
When we have LOAD clause with Bag as its member, what type of input file
structure is expected? Can default PigStorage() function handle that?
e.g. in A = LOAD 'data.txt' AS (B: bag {T: tuple(t1:int, t2:int,
t3:int)});
What structure of data.txt is expected? Is it possible to write StoreFunc
in
this case?
Also if we have multidimensional data (like 2D n*n matrix, n varies with
input), can we expect Bag which contains each row as nested Bag?