Also, pig passes on the string to HDFS, which supports globs, so this works:
raw = LOAD '/input_data/dir{1,2,3}'
On May 7, 2010, at 4:14 PM, Richard Ding wrote:
> This feature is supported since 0.6 (PIG-1071). But the correct form is
>
> raw = LOAD '/input_data/dir1,/input_data/dir2,/input_data/dir3' USING
> PigStorage ('\t') AS (.....);
>
> Thanks,
> -Richard
> -----Original Message-----
> From: Katukuri, Jay [mailto:[email protected]]
> Sent: Friday, May 07, 2010 3:53 PM
> To: [email protected]
> Subject: List of directories in Load
>
> Hi all,
> Is it possible to specify multiple HDFS directories in 'Load' function.
> Ex: raw = LOAD '/input_data/dir1', '/input_data/dir2',
> '/input_data/dir3' USING PigStorage ('\t') AS (.....);
>
> Thanks,
> Jay