Dear Wiki user, You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification.
The "NutchTutorial" page has been changed by JoeLencioni: http://wiki.apache.org/nutch/NutchTutorial?action=diff&rev1=36&rev2=37 Comment: Removing pre 1.3 stuff Try the following command: - {{{ bin/nutch (version < 1.3) }}} - - or - - {{{runtime/local/bin/nutch (version >= 1.3) }}} + {{{runtime/local/bin/nutch}}} This will display the documentation for the Nutch command script. @@ -184, +180 @@ Now we're ready to search! - == Command Line Searching (version < 1.3) == + == Command Line Searching == - Simplest way to verify the integrity of your crawl is to launch NutchBean from command line: + '''''This section needs to be updated for Nutch 1.3. [[NutchTutorialPre1.3|Pre 1.3 tutorial can be found here.]]''''' - {{{ bin/nutch org.apache.nutch.searcher.NutchBean apache }}} + == Installing in Tomcat == + '''''This section needs to be updated for Nutch 1.3. [[NutchTutorialPre1.3|Pre 1.3 tutorial can be found here.]]''''' - where ''apache'' is the search term (note that NutchBean will only search pages in the {{{crawl}}} directory, so if you named the crawl directory something else, NutchBean will not find any results). After you have verified that the above command returns results you can proceed to setting up the web interface. - - == Installing in Tomcat (version < 1.3) == - To search you need to put the nutch war file into your servlet container. (If instead of downloading a Nutch release you checked the sources out of SVN, then you'll first need to build the war file, with the command {{{ant war}}}.) - - Assuming you've unpacked Tomcat as ~/local/tomcat, then the Nutch war file may be installed with the commands: - - {{{ - mkdir ~/local/tomcat/webapps/nutch - cp nutch*.war ~/local/tomcat/webapps/nutch/ - jar xvf ~/local/tomcat/webapps/nutch/nutch-1.1.war - rm nutch-1.1.war; - }}} - The webapp finds its indexes in ./crawl, relative to where you start Tomcat, so use a command like (platform dependent): - - {{{ ~/local/tomcat/bin/catalina.sh start }}} - - If you want to put your search index at a different location. Edit the webapps/nutch/WEB-INF/classes/nutch-site.xml and add the following - - {{{ - <property> - <name>searcher.dir</name> - <value>/somewhere/crawl<value> <!-- There must be a crawl/index directory to run off !--> - </property> - }}} - If your index is changed you need to restart Tomcat with a command like (platform dependent): - - {{{ - /etc/init.d/tomcat restart - }}} - Also it is recommended to make a copy of the index for Tomcat, so that you can crawl and update your index independently. - - Then visit: http://localhost:8080/nutch/ -

