Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change 
notification.

The following page has been changed by MatthewHolt:
http://wiki.apache.org/nutch/IntranetRecrawl

------------------------------------------------------------------------------
  
  Here are a couple of scripts for recrawling your Intranet.
  
- = Version 0.7.2 = 
+ == Version 0.7.2 ==
  Place in the main nutch directory and run.
  
- == Example Usage ==
+ === Example Usage ===
  ./recrawl crawl 10 31
  
- == Script ==
+ === Script ===
  
  #!/bin/bash
  
@@ -70, +70 @@

  # Merge indexes
  ls -d $segments_dir/* | xargs bin/nutch merge $index_dir
  
- = Version 0.8.0 = 
+ == Version 0.8.0 ==
  Place in the bin sub-directory within Nutch and run.
  
- == Example Usage ==
+ === Example Usage ===
  ./usr/local/nutch/bin/recrawl /usr/local/tomcat/webapps/ROOT 
/usr/local/nutch/crawl 10 30
  
- == Code ==
+ === Code ===
  #!/bin/bash
  
  # Nutch recrawl script.

Reply via email to