Dear Wiki user, You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification.
The "RunNutchInEclipse" page has been changed by LewisJohnMcgibbney: http://wiki.apache.org/nutch/RunNutchInEclipse?action=diff&rev1=26&rev2=27 === Configure Nutch === - * see the [[http://wiki.apache.org/nutch/NutchTutorial|Tutorial]] and follow all configuration steps, ensure that you '''DO NOT''' undertake any crawling. The directory structure for Nutch-1.3 enables us to edit nutch-site.xml.template, nutch-default.xml and regex-urlfilter.txt.template in our /conf directory, these properties will then be automatically built into our /runtime build folder. + * see the [[http://wiki.apache.org/nutch/NutchTutorial|Tutorial]] and follow all configuration steps, ensure that you '''DO NOT''' undertake any crawling. The directory structure for Nutch trunk enables us to edit nutch-site.xml.template, nutch-default.xml and regex-urlfilter.txt.template in our /conf directory, these properties will then be automatically built into our /runtime build folder. * ensure that you change the property "plugin.folders" to "./src/plugin" on $NUTCH_HOME/conf/nutch-site.xml - * Once we have ensured that Nutch 1.3 is correctly configured we can progress to building within Eclipse. + * Once we have ensured that Nutch trunk is correctly configured we can progress to building within Eclipse. === Build Nutch === - * We can now progress to building Nutch by simply dragging the build.xml file into the Ant perspective and double clicking on the build file. If you configured the project correctly, Eclipse will build Nutch for you into "tmp_build" and you should see something similar to the following: + * We can now progress to building Nutch by simply dragging the build.xml file into the Ant perspective and double clicking on the build file. If you configured the project correctly, Eclipse will build Nutch for you into "bin" and you should see something similar to the following: {{{ - [copy] Copying 1 file to /home/lewis/apache-nutch-1.3/runtime/deploy - [copy] Copying 1 file to /home/lewis/apache-nutch-1.3/runtime/deploy/bin - [copy] Copying 1 file to /home/lewis/apache-nutch-1.3/runtime/local/lib - [copy] Copying 1 file to /home/lewis/apache-nutch-1.3/runtime/local/lib/native - [copy] Copying 20 files to /home/lewis/apache-nutch-1.3/runtime/local/conf - [copy] Copying 1 file to /home/lewis/apache-nutch-1.3/runtime/local/bin - [copy] Copying 42 files to /home/lewis/apache-nutch-1.3/runtime/local/lib - [copy] Copying 107 files to /home/lewis/apache-nutch-1.3/runtime/local/plugins BUILD SUCCESSFUL Total time: 33 seconds }}} @@ -115, +107 @@ {{{ <property> <name>plugin.folders</name> - <value>/home/....../nutch-1.3/src/plugin</value> + <value>/home/....../trunk/src/plugin</value> }}} === No plugins loaded during unit tests in Eclipse === @@ -127, +119 @@ === debugging Hadoop classes === Sometimes (fairly often) it makes sense to also have the Hadoop classes available during debugging. This should really second nature as Nutch heavily relies upon the underlying Hadoop infrastructure. Therefore you can check out (svn) the Hadoop sources into your Eclipse IDE and combine to debug this way. You can: - * Checkout the Hadoop version that should be used within Nutch 1.3 + * Checkout the Hadoop version that should be used within Nutch trunk * configure a Hadoop project similar to the Nutch project within your Eclipse IDE * add the Hadoop project as a dependent project of Nutch project * you can now also set break points within Hadoop classes like inputformat implementations etc.

