Nicolás Lichtmaier wrote:

I'd like to limit nutch to fetch, refetch and index just the injected URLs. Will setting db.max.outlinks.per.page to 0 enable me to do that? If not... how could achive what I'm looking to?
You need to run "updatedb" with "-noAdditions" switch.

That doesn't work. And in the code, in org.apache.nutch.crawl.CrawlDb's main method there's absolutely no handling of any parameter.
How could I achive this?

Perhaps you should start from reporting which version you are using ... The version in trunk/ certainly supports this argument. The version in 0.8.1 does not support it, but it's easy to add.

--
Best regards,
Andrzej Bialecki     <><
___. ___ ___ ___ _ _   __________________________________
[__ || __|__/|__||\/|  Information Retrieval, Semantic Web
___|||__||  \|  ||  |  Embedded Unix, System Integration
http://www.sigram.com  Contact: info at sigram dot com


Reply via email to