Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change 
notification.

The following page has been changed by SebastienLeCallonnec:
http://wiki.apache.org/nutch/FAQ

The comment on the change is:
Corrected layout mistake.

------------------------------------------------------------------------------
    * Now you could either generate new segments. Maybe you whould use -adddays 
to allow bin/nutch generate to put all the urls in the new fetchlist again. Add 
more then 7 days if you did not make a updatedb.
    * Or send the process a unix STOP signal. You should be able to index the 
part of the segment for crawling which is allready fetched. Then later send a 
CONT signal to the process. Do not turn off your computer between! :)
  
- '''How can I force fetcher to use custom nutch-config?
+ '''How can I force fetcher to use custom nutch-config?'''
    * Create a new sub-directory under $NUTCH_HOME/conf, like conf/myconfig
    * Copy these files from $NUTCH_HOME/conf to the new directory: 
common-terms.utf8, mime-types.*, nutch-conf.xsl, nutch-default.xml, 
regex-normalize.xml, regex-urlfilter.txt
    * Modify the nutch-default.xml to suite your needs

Reply via email to