Hi, Most likely you run generate command a few times and did not run updatedb. So, each generate command assigned different batchId s to its own set of urls.
Alex. -----Original Message----- From: Bai Shen <[email protected]> To: user <[email protected]> Sent: Tue, Jul 31, 2012 10:26 am Subject: Re: Different batch id Is there a specific place it's located? I turned on debugging, but I'm not seeing a batch id. On Mon, Jul 30, 2012 at 1:14 PM, Lewis John Mcgibbney < [email protected]> wrote: > Can you stick on debug logging and see what the batch ID's actually are? > > On Mon, Jul 30, 2012 at 6:12 PM, Bai Shen <[email protected]> wrote: > > I set up Nutch 2.x with a new instance of HBase. I ran the following > > commands. > > > > bin/nutch inject urls > > bin/nutch generate -topN 1000 > > bin/nutch fetch -all > > bin/nutch parse -all > > > > When looking at the parse log, I'm seeing a bunch of "different batch id" > > messages. These are all on urls that I did not inject into the database. > > > > Any ideas what's causing this? > > > > Thanks. > > > > -- > Lewis >

