Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change 
notification.

The following page has been changed by MatthewHolt:
http://wiki.apache.org/nutch/IntranetRecrawl

------------------------------------------------------------------------------
  
  Here are a couple of scripts for recrawling your Intranet.
  
- = Version 0.7.2 = 
+ == Version 0.7.2 ==
  Place in the main nutch directory and run.
  
- == Example Usage ==
+ === Example Usage ===
  ./recrawl crawl 10 31
  
- == Script ==
+ === Script ===
  
  #!/bin/bash
  
@@ -70, +70 @@

  # Merge indexes
  ls -d $segments_dir/* | xargs bin/nutch merge $index_dir
  
- = Version 0.8.0 = 
+ == Version 0.8.0 ==
  Place in the bin sub-directory within Nutch and run.
  
- == Example Usage ==
+ === Example Usage ===
  ./usr/local/nutch/bin/recrawl /usr/local/tomcat/webapps/ROOT 
/usr/local/nutch/crawl 10 30
  
- == Code ==
+ === Code ===
  #!/bin/bash
  
  # Nutch recrawl script.

-------------------------------------------------------------------------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys -- and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
_______________________________________________
Nutch-cvs mailing list
Nutch-cvs@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nutch-cvs

Reply via email to