Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change 
notification.

The "RunNutchInEclipse" page has been changed by LewisJohnMcgibbney:
http://wiki.apache.org/nutch/RunNutchInEclipse?action=diff&rev1=16&rev2=17

  = RunNutchInEclipse =
- This page acts as a new resource for working with Nutch from within the 
Eclipse IDE. It is intended to provide a comprehensive beginning resource for 
the configuration, building, crawling and debugging of Nutch 1.3 in the above 
context.
+ This page acts as a resource for working with Nutch from within the Eclipse 
IDE. It is intended to provide a comprehensive beginning resource for the 
configuration, building, crawling and debugging of Nutch 1.3 in the above 
context.
  
  == Tested with ==
   * Nutch release 1.3
@@ -14, +14 @@

       GNOME 2.32.1
  
  == Before you start ==
- Setting up Nutch to run into Eclipse can be tricky, and most of the time you 
are much faster if you edit Nutch in Eclipse but run the scripts from the 
command line (my 2 cents). However, it's very useful to be able to debug Nutch 
in Eclipse. But again you might be quickier by looking at the logs 
(logs/hadoop.log)...
+ Setting up Nutch to run into Eclipse can be tricky, and most of the time you 
are much faster if you edit Nutch in Eclipse but run the scripts from the 
command line. However, it's very useful to be able to debug Nutch in Eclipse 
and is also extremely useful when applying and testing patches as it enables 
you to see them working in a larger context. This being said, you will still 
benefit greatly by looking at the hadoop.log output.
  
  <<TableOfContents(3)>>
  
  == Steps ==
  === Install Nutch ===
-  * Grab a fresh release of Nutch 0.8 or make a fresh checkout of Nutch 0.8 
from svn
-  * Do not build Nutch now. Make sure you have no .project and .classpath 
files in the Nutch directory
+  * Grab an official release of Nutch 1.3  from 
[[http://www.apache.org/dyn/closer.cgi/nutch/|here]] or check out the latest 
branch development [[https://svn.apache.org/repos/asf/nutch/branches/|here]].
+  * Do not build Nutch now. Make sure you have no .project and .classpath 
files in the Nutch directory and that Nutch has not built the /runtime 
directory '''N.B.''' This is absolutely essential.
  
  === Create a new java project in Eclipse ===
-  * File > New > Project > Java project > click Next
+  * File > New > Project > Java project
+  * Name the project 
-  * select "Create project from existing source" and use the location where 
you downloaded Nutch
+  * Select "Create project from existing source" and use the location where 
you downloaded Nutch
-  * click on Next, and wait while Eclipse is scanning the folders
+  * Click on Next, and wait while Eclipse is scanning the folders
-  * add the folder "conf" to the classpath (scroll down the list and 
right-click on "conf". This step is necessary)
+  * Add the folder "conf" to the classpath, this can be done by right-clicking 
on the project, select "properties" then "Java Build Path" tab (left menu) and 
then the "Libraries" tab. Click "Add Class Folder..." button, and select "conf" 
from the list)
+  * Go to "Order and Export" tab, find the entry for added "conf" folder (it 
will most likely be at the bottom of the list) and move it to the top (by 
checking it and clicking the "Top" button). This is required so Eclipse will 
take config (nutch-default.xml, etc.) resources from our "conf" folder and not 
from somewhere else.
-  * Eclipse should have guessed all the java files that must be added on your 
classpath. If it's not the case, add "src/java", "src/test" and all plugin 
"src/java" and "src/test" folders to your source folders. Also add all jars in 
"lib" and in the plugin lib folders to your libraries
+  * Eclipse should have guessed all the Java files that must be added to your 
classpath. If that's not the case, add "src/java", "src/test" and all plugin 
"src/java" and "src/test" folders to your source folders. 
-  * set output dir to "tmp_build", create it if necessary
+  * Click the "Source" tab and set the default output folder to 
"$NUTCH_HOME/bin/tmp_build". (You may need to create the tmp_build folder.)
+  * Click the "Finish" button
   * DO NOT add "build" to classpath
-  * '''or you can use [[attachment:.classpath]] file'''
  
  ==== If you're using the trunk ====
  As of revision 511012 there were a few plugins on the trunk and a couple 
other files that did not build, and are actually excluded from the ant 
projects.  You may want to remove the following projects from the build 
structure:

Reply via email to