Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change 
notification.

The "Tutorial on incremental crawling" page has been changed by Gabriele 
Kahlout.
http://wiki.apache.org/nutch/Tutorial%20on%20incremental%20crawling?action=diff&rev1=6&rev2=7

--------------------------------------------------

  # 1. $ mv whole-web-crawling-incremental 
$NUTCH_HOME/whole-web-crawling-incremental
  # 2. $ cd $NUTCH_HOME
  # 3. $ chmod +x whole-web-crawling-incremental
- # 4. $ ./whole-web-crawling-incremental
+ # 4. $ ./whole-web-crawling-incremental seeds 5 2
  
- # Usage: ./whole-web-crawling-incremental [it_seedsDir-path 
urls-to-fetch-per-iteration depth]
+ # Usage: ./whole-web-crawling-incremental it_seedsDir-path 
urls-to-fetch-per-iteration depth
  # Start
  
  rm -r crawl # fresh crawl

Reply via email to