[Nutch Wiki] Trivial Update of Crawl by susam
Dear Wiki user, You have subscribed to a wiki page or wiki category on Nutch Wiki for change notification. The Crawl page has been changed by susam. The comment on this change is: Fixed wiki markup for codes. http://wiki.apache.org/nutch/Crawl?action=diffrev1=8rev2=9 -- === NUTCH_HOME === If you are not executing the script as 'bin/runbot' from Nutch directory, you should either set the environment variable 'NUTCH_HOME' or edit the following in the script:- + {{{ - {{{if [ -z $NUTCH_HOME ] + if [ -z $NUTCH_HOME ] then - NUTCH_HOME=.}}} + NUTCH_HOME=. + }}} Set 'NUTCH_HOME' to the path of the Nutch directory (if you are not setting it as an environment variable, since if environment variable is set, the above assignment is ignored). === CATALINA_HOME === 'CATALINA_HOME' points to the Tomcat installation directory. You must either set this as an environment variable or set it by editing the following lines in the script:- + {{{ - {{{if [ -z $CATALINA_HOME ] + if [ -z $CATALINA_HOME ] then - CATALINA_HOME=/opt/apache-tomcat-6.0.10}}} + CATALINA_HOME=/opt/apache-tomcat-6.0.10 + }}} Similar to the previous section, if this variable is set in the environment, then the above assignment is ignored.
[Nutch Wiki] Trivial Update of Crawl by susam
Dear Wiki user, You have subscribed to a wiki page or wiki category on Nutch Wiki for change notification. The Crawl page has been changed by susam. The comment on this change is: Fixed typo. http://wiki.apache.org/nutch/Crawl?action=diffrev1=9rev2=10 -- Similar to the previous section, if this variable is set in the environment, then the above assignment is ignored. == Can it re-crawl? == - The author has used this script to re-crawl a couple of times. However, no real world testing has been done for re-crawling. Therefore, you may try to use the script of re-crawl. If it works out fine or it doesn't work properly for re-crawl, please let us know. + The author has used this script to re-crawl a couple of times. However, no real world testing has been done for re-crawling. Therefore, you may try to use the script for re-crawl. If it works fine or it doesn't work properly for re-crawl, please let us know. == Script == {{{