Hi,
After some research I am able to find out why the problem of crawling with
jabong page was there.

Actually, when we use nutch we have to configure it first. Initially, there
are some default configurations set in nutch-deault.xml present in conf
directory. You have to set the file content limit to -1. Initially there was
some length parameter specified So it was not actually parsing the whole
page. Only that much length was parsed. That's why we miss some of the links
to next pages.



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Nutch-not-crawling-jabong-tp3857630p4010062.html
Sent from the Nutch - User mailing list archive at Nabble.com.

Reply via email to