Olive g wrote:
Hi gurus,
I posted questions on how to do incremental crawls on 0.8 a few days
ago and thank you all for your help. However, when I tried to
workaround (see
http://www.mail-archive.com/nutch-user%40lucene.apache.org/msg04111.html),
inverlinks crashed when there were more than 5 input parts.
You should understand very clearly that what you are doing is NOT
supported and very non-standard. It might (or might not) have worked as
a one time workaround to get you out of trouble.
Nutch DOES support incremental crawling and indexing, and the way it
does is described in the tutorial
(http://wiki.apache.org/nutch/NutchTutorial). Please follow the tutorial
where it says about "Step-by-Step or Whole-web Crawling" - you will save
yourself (and us) a lot of grief.
--
Best regards,
Andrzej Bialecki <><
___. ___ ___ ___ _ _ __________________________________
[__ || __|__/|__||\/| Information Retrieval, Semantic Web
___|||__|| \| || | Embedded Unix, System Integration
http://www.sigram.com Contact: info at sigram dot com
-------------------------------------------------------
This SF.Net email is sponsored by xPML, a groundbreaking scripting language
that extends applications into web and mobile media. Attend the live webcast
and join the prime developer group breaking into this new coding territory!
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=110944&bid=241720&dat=121642
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general