Hi All,

I am not using the Nutch indexer but indexing using my own utility method
after every page is fetched and I need to bypass any additional steps that
Nutch executes in a crawl .Along those line I have identified the following
steps to implement.


   1. Disable LinkDB creation by commenting out LinkDB.invert() method.
   2. Not store the fetch_content in a segment which is used to create an
   index by setting the property fetcher.store.content to false.


I am clear about #1 from discussion I have had with Sebastian earlier.

About #2 I need to know if having fetcher.store.content set to false would
be a good idea ?


Thanks.

Reply via email to