Dear Wiki user, You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification.
The "RunNutchInEclipse" page has been changed by LewisJohnMcgibbney: http://wiki.apache.org/nutch/RunNutchInEclipse?action=diff&rev1=16&rev2=17 = RunNutchInEclipse = - This page acts as a new resource for working with Nutch from within the Eclipse IDE. It is intended to provide a comprehensive beginning resource for the configuration, building, crawling and debugging of Nutch 1.3 in the above context. + This page acts as a resource for working with Nutch from within the Eclipse IDE. It is intended to provide a comprehensive beginning resource for the configuration, building, crawling and debugging of Nutch 1.3 in the above context. == Tested with == * Nutch release 1.3 @@ -14, +14 @@ GNOME 2.32.1 == Before you start == - Setting up Nutch to run into Eclipse can be tricky, and most of the time you are much faster if you edit Nutch in Eclipse but run the scripts from the command line (my 2 cents). However, it's very useful to be able to debug Nutch in Eclipse. But again you might be quickier by looking at the logs (logs/hadoop.log)... + Setting up Nutch to run into Eclipse can be tricky, and most of the time you are much faster if you edit Nutch in Eclipse but run the scripts from the command line. However, it's very useful to be able to debug Nutch in Eclipse and is also extremely useful when applying and testing patches as it enables you to see them working in a larger context. This being said, you will still benefit greatly by looking at the hadoop.log output. <<TableOfContents(3)>> == Steps == === Install Nutch === - * Grab a fresh release of Nutch 0.8 or make a fresh checkout of Nutch 0.8 from svn - * Do not build Nutch now. Make sure you have no .project and .classpath files in the Nutch directory + * Grab an official release of Nutch 1.3 from [[http://www.apache.org/dyn/closer.cgi/nutch/|here]] or check out the latest branch development [[https://svn.apache.org/repos/asf/nutch/branches/|here]]. + * Do not build Nutch now. Make sure you have no .project and .classpath files in the Nutch directory and that Nutch has not built the /runtime directory '''N.B.''' This is absolutely essential. === Create a new java project in Eclipse === - * File > New > Project > Java project > click Next + * File > New > Project > Java project + * Name the project - * select "Create project from existing source" and use the location where you downloaded Nutch + * Select "Create project from existing source" and use the location where you downloaded Nutch - * click on Next, and wait while Eclipse is scanning the folders + * Click on Next, and wait while Eclipse is scanning the folders - * add the folder "conf" to the classpath (scroll down the list and right-click on "conf". This step is necessary) + * Add the folder "conf" to the classpath, this can be done by right-clicking on the project, select "properties" then "Java Build Path" tab (left menu) and then the "Libraries" tab. Click "Add Class Folder..." button, and select "conf" from the list) + * Go to "Order and Export" tab, find the entry for added "conf" folder (it will most likely be at the bottom of the list) and move it to the top (by checking it and clicking the "Top" button). This is required so Eclipse will take config (nutch-default.xml, etc.) resources from our "conf" folder and not from somewhere else. - * Eclipse should have guessed all the java files that must be added on your classpath. If it's not the case, add "src/java", "src/test" and all plugin "src/java" and "src/test" folders to your source folders. Also add all jars in "lib" and in the plugin lib folders to your libraries + * Eclipse should have guessed all the Java files that must be added to your classpath. If that's not the case, add "src/java", "src/test" and all plugin "src/java" and "src/test" folders to your source folders. - * set output dir to "tmp_build", create it if necessary + * Click the "Source" tab and set the default output folder to "$NUTCH_HOME/bin/tmp_build". (You may need to create the tmp_build folder.) + * Click the "Finish" button * DO NOT add "build" to classpath - * '''or you can use [[attachment:.classpath]] file''' ==== If you're using the trunk ==== As of revision 511012 there were a few plugins on the trunk and a couple other files that did not build, and are actually excluded from the ant projects. You may want to remove the following projects from the build structure:

