Nicolás Lichtmaier wrote:
I'd like to limit nutch to fetch, refetch and index just the
injected URLs. Will setting db.max.outlinks.per.page to 0 enable me
to do that? If not... how could achive what I'm looking to?
You need to run "updatedb" with "-noAdditions" switch.
That doesn't work. And in the code, in
org.apache.nutch.crawl.CrawlDb's main method there's absolutely no
handling of any parameter.
How could I achive this?
Perhaps you should start from reporting which version you are using ...
The version in trunk/ certainly supports this argument. The version in
0.8.1 does not support it, but it's easy to add.
--
Best regards,
Andrzej Bialecki <><
___. ___ ___ ___ _ _ __________________________________
[__ || __|__/|__||\/| Information Retrieval, Semantic Web
___|||__|| \| || | Embedded Unix, System Integration
http://www.sigram.com Contact: info at sigram dot com