Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change 
notification.

The following page has been changed by RichardBraman:
http://wiki.apache.org/nutch/NutchTutorial

------------------------------------------------------------------------------
  
  == Step-by-Step or Whole-web Crawling ==
  
- Whole-web crawling is designed to handle very large crawls which may take 
weeks to complete, running on multiple machines.  This also permits more 
control over the crawl process, and incremental crawling.
+ Whole-web crawling is designed to handle very large crawls which may take 
weeks to complete, running on multiple machines.  This also permits more 
control over the crawl process, and incremental crawling.  It is important to 
note that whole web crawling does not necessarily mean crawling the entire 
world wide web.  We can limit a whole web crawl to just a list of the URLs we 
want to crawl.  This is done by using a filter just like we the one we used 
when we did the crawl command (above).
  
  === Step-by-Step: Concepts ===
  

Reply via email to