Olive g wrote:
Hi gurus,

I posted questions on how to do incremental crawls on 0.8 a few days ago and thank you all for your help. However, when I tried to workaround (see http://www.mail-archive.com/nutch-user%40lucene.apache.org/msg04111.html), inverlinks crashed when there were more than 5 input parts.


You should understand very clearly that what you are doing is NOT supported and very non-standard. It might (or might not) have worked as a one time workaround to get you out of trouble.

Nutch DOES support incremental crawling and indexing, and the way it does is described in the tutorial (http://wiki.apache.org/nutch/NutchTutorial). Please follow the tutorial where it says about "Step-by-Step or Whole-web Crawling" - you will save yourself (and us) a lot of grief.

--
Best regards,
Andrzej Bialecki     <><
___. ___ ___ ___ _ _   __________________________________
[__ || __|__/|__||\/|  Information Retrieval, Semantic Web
___|||__||  \|  ||  |  Embedded Unix, System Integration
http://www.sigram.com  Contact: info at sigram dot com




-------------------------------------------------------
This SF.Net email is sponsored by xPML, a groundbreaking scripting language
that extends applications into web and mobile media. Attend the live webcast
and join the prime developer group breaking into this new coding territory!
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=110944&bid=241720&dat=121642
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to