Nope.  I ran exactly the listed commands.  And like I said, the ones that
show a different batch id were urls that I didn't inject.  So no idea how
they got in there.

On Tue, Jul 31, 2012 at 1:44 PM, <[email protected]> wrote:

> Hi,
>
> Most likely you run generate command a few times and did not run updatedb.
> So, each generate command assigned different batchId s to its own set of
> urls.
>
> Alex.
>
>
>
> -----Original Message-----
> From: Bai Shen <[email protected]>
> To: user <[email protected]>
> Sent: Tue, Jul 31, 2012 10:26 am
> Subject: Re: Different batch id
>
>
> Is there a specific place it's located?  I turned on debugging, but I'm not
> seeing a batch id.
>
> On Mon, Jul 30, 2012 at 1:14 PM, Lewis John Mcgibbney <
> [email protected]> wrote:
>
> > Can you stick on debug logging and see what the batch ID's actually are?
> >
> > On Mon, Jul 30, 2012 at 6:12 PM, Bai Shen <[email protected]>
> wrote:
> > > I set up Nutch 2.x with a new instance of HBase.  I ran the following
> > > commands.
> > >
> > > bin/nutch inject urls
> > > bin/nutch generate -topN 1000
> > > bin/nutch fetch -all
> > > bin/nutch parse -all
> > >
> > > When looking at the parse log, I'm seeing a bunch of "different batch
> id"
> > > messages.  These are all on urls that I did not inject into the
> database.
> > >
> > > Any ideas what's causing this?
> > >
> > > Thanks.
> >
> >
> >
> > --
> > Lewis
> >
>
>
>

Reply via email to