I don't have my script handy but all I am doing is something like:

A = LOAD $in using PigStorage("\t") as (col:chararray, col2:chararray);
STORE A INTO '{Table}' USING using
com.vertica.pig.VerticaStorer(‘localhost’,'verticadb502′,’5935′, 'user');


When I run as pig -f script6.pig -p in="/examples/2/part-m-0000[0-4]" it
creates 2 rows

but if I run them individually 4 times giving the actual file names then it
doesn't have any duplicates
On Sat, Mar 24, 2012 at 1:36 PM, Bill Graham <[email protected]> wrote:

> Can you provide the script you're running? That will help people better
> understand what you're doing.
>
> On Saturday, March 24, 2012, Mohit Anchlia <[email protected]> wrote:
> > Could someone please help me understand or give some pointers to me,
> >
> > On Fri, Mar 23, 2012 at 4:57 PM, Mohit Anchlia <[email protected]
> >wrote:
> >
> >> I am running a script to load data in the database. When I use [0-4] I
> see
> >> 2 rows being created for every record that I process. But when I run
> them
> >> individually then it works. Could someone please help me understand or
> >> troubleshoot this behaviour?
> >>
> >>
> >> pig -f script6.pig -p in="/examples/2/part-m-0000[0-4]" --creates 2 rows
> >>
> >> pig -f script6.pig -p in="/examples/2/part-m-00000 --works
> >>
> >> pig -f script6.pig -p in="/examples/2/part-m-00001 --works
> >>
> >> pig -f script6.pig -p in="/examples/2/part-m-00002 --works
> >>
> >> pig -f script6.pig -p in="/examples/2/part-m-00003 --works
> >>
> >> pig -f script6.pig -p in="/examples/2/part-m-00004 --works
> >>
> >
>
> --
> *Note that I'm no longer using my Yahoo! email address. Please email me at
> [email protected] going forward.*
>

Reply via email to