If you read the thread up you'll see that thing is about pages with redirects.
I hadn't time to investigate this deeply, so decided just to count that it is a nutch issue and modify the nutch code. Regarding the changes made - see my diff file attached. http://old.nabble.com/file/p26543086/fetcher.diff fetcher.diff J. Smith wrote: > > Yes, please. I'll be very grateful. > But also I'm curious why this heppaning... Maybe someone can explain? > -- View this message in context: http://old.nabble.com/Nutch-indexes-less-pages%2C-then-it-fetches-tp26078798p26543086.html Sent from the Nutch - User mailing list archive at Nabble.com.