I've just started looking into nutch2.* code, after a year working with
nutch1.*.
I was very enthusiastic with the gora integration. 
Of course, its going to take time untill I'm familiar with 2.* code as well
as with 1.* code.

Anyway, i'll be glad to get on board.
Regarding the shouldProccess() issue, it annoys me too. Seems odd that in a
simple test case I get so many  "different batch id (null)" messages. 

regrading the second issue, it seems from my test that it does loop until it
reaches depth. 
Fetcher just finished with nothing new fetched, and the same with Parser but
there I get all those "different batch id (null)" messages.
Also, I didn't see where shouldStop is updated to true. But i've made just a
brief review. maybe I missed something.



--
View this message in context: 
http://lucene.472066.n3.nabble.com/why-is-nutch2-1-trying-to-parse-the-same-documnets-again-and-again-tp4043317p4043335.html
Sent from the Nutch - User mailing list archive at Nabble.com.

Reply via email to