I've just started looking into nutch2.* code, after a year working with nutch1.*. I was very enthusiastic with the gora integration. Of course, its going to take time untill I'm familiar with 2.* code as well as with 1.* code.
Anyway, i'll be glad to get on board. Regarding the shouldProccess() issue, it annoys me too. Seems odd that in a simple test case I get so many "different batch id (null)" messages. regrading the second issue, it seems from my test that it does loop until it reaches depth. Fetcher just finished with nothing new fetched, and the same with Parser but there I get all those "different batch id (null)" messages. Also, I didn't see where shouldStop is updated to true. But i've made just a brief review. maybe I missed something. -- View this message in context: http://lucene.472066.n3.nabble.com/why-is-nutch2-1-trying-to-parse-the-same-documnets-again-and-again-tp4043317p4043335.html Sent from the Nutch - User mailing list archive at Nabble.com.

