Dear Wiki user, You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification.
The following page has been changed by evertjwa: http://wiki.apache.org/nutch/GettingNutchRunningWithMacOsx ------------------------------------------------------------------------------ = Running Nutch with Mac OSX = - Nutch runs almost out of the box on OSX. + == Downloading and setting up Tomcat == @@ -39, +39 @@ Click 'Deploy' Check http://localhost:8080/nutch-0.7.1/en/search.html. You should see the Nutch Search Form. + == Crawling == - Using Terminal, set your JAVA_HOME, and cd to the nutch directory. From here you can follow the manual. + Note that the nutch command line tool (in our case nutch-0.7.1/bin/nutch) is not installed under the Tomcat web-application ($CATALINA_HOME/webapps/nutch-0.7.1/WEB-INF/...). You can either leave it there or move it manually to your tomcat/webapps/nutch/WEB-INF/classes. In the first case you will have to do some classpath configuring or maintain two nutch-site.xml files (one for indexing and one for searching). - A nice feature of the Mac Terminal (and all the other Mac applications) is that it is scriptable with AppleScript. The applescript below can be used as an example to automate tasks. + Using Terminal, cd to the directory where your bin/nutch is located. From here you can follow the instructions from the [http://lucene.apache.org/nutch/tutorial.html/ tutorial]. + + Just like any other mac application the Terminal is scriptable which is a nice feature. The applescript below will start a crawl just by doubleclicking it's icon. {{{ tell application "Terminal"
