Nope. I ran exactly the listed commands. And like I said, the ones that show a different batch id were urls that I didn't inject. So no idea how they got in there.
On Tue, Jul 31, 2012 at 1:44 PM, <[email protected]> wrote: > Hi, > > Most likely you run generate command a few times and did not run updatedb. > So, each generate command assigned different batchId s to its own set of > urls. > > Alex. > > > > -----Original Message----- > From: Bai Shen <[email protected]> > To: user <[email protected]> > Sent: Tue, Jul 31, 2012 10:26 am > Subject: Re: Different batch id > > > Is there a specific place it's located? I turned on debugging, but I'm not > seeing a batch id. > > On Mon, Jul 30, 2012 at 1:14 PM, Lewis John Mcgibbney < > [email protected]> wrote: > > > Can you stick on debug logging and see what the batch ID's actually are? > > > > On Mon, Jul 30, 2012 at 6:12 PM, Bai Shen <[email protected]> > wrote: > > > I set up Nutch 2.x with a new instance of HBase. I ran the following > > > commands. > > > > > > bin/nutch inject urls > > > bin/nutch generate -topN 1000 > > > bin/nutch fetch -all > > > bin/nutch parse -all > > > > > > When looking at the parse log, I'm seeing a bunch of "different batch > id" > > > messages. These are all on urls that I did not inject into the > database. > > > > > > Any ideas what's causing this? > > > > > > Thanks. > > > > > > > > -- > > Lewis > > > > >

