Dear Wiki user, You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification.
The following page has been changed by MatthewHolt: http://wiki.apache.org/nutch/IntranetRecrawl ------------------------------------------------------------------------------ Here are a couple of scripts for recrawling your Intranet. - = Version 0.7.2 = + == Version 0.7.2 == Place in the main nutch directory and run. - == Example Usage == + === Example Usage === ./recrawl crawl 10 31 - == Script == + === Script === #!/bin/bash @@ -70, +70 @@ # Merge indexes ls -d $segments_dir/* | xargs bin/nutch merge $index_dir - = Version 0.8.0 = + == Version 0.8.0 == Place in the bin sub-directory within Nutch and run. - == Example Usage == + === Example Usage === ./usr/local/nutch/bin/recrawl /usr/local/tomcat/webapps/ROOT /usr/local/nutch/crawl 10 30 - == Code == + === Code === #!/bin/bash # Nutch recrawl script. ------------------------------------------------------------------------- Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT & business topics through brief surveys -- and earn cash http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV _______________________________________________ Nutch-cvs mailing list Nutch-cvs@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nutch-cvs