Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change 
notification.

The "RunNutchInEclipse" page has been changed by LewisJohnMcgibbney:
http://wiki.apache.org/nutch/RunNutchInEclipse?action=diff&rev1=26&rev2=27

  
  
  === Configure Nutch ===
-  * see the [[http://wiki.apache.org/nutch/NutchTutorial|Tutorial]] and follow 
all configuration steps, ensure that you '''DO NOT''' undertake any crawling. 
The directory structure for Nutch-1.3 enables us to edit 
nutch-site.xml.template, nutch-default.xml and regex-urlfilter.txt.template in 
our /conf directory, these properties will then be automatically built into our 
/runtime build folder.
+  * see the [[http://wiki.apache.org/nutch/NutchTutorial|Tutorial]] and follow 
all configuration steps, ensure that you '''DO NOT''' undertake any crawling. 
The directory structure for Nutch trunk enables us to edit 
nutch-site.xml.template, nutch-default.xml and regex-urlfilter.txt.template in 
our /conf directory, these properties will then be automatically built into our 
/runtime build folder.
   * ensure that you change the property "plugin.folders" to "./src/plugin" on 
$NUTCH_HOME/conf/nutch-site.xml
-  * Once we have ensured that Nutch 1.3 is correctly configured we can 
progress to building within Eclipse.
+  * Once we have ensured that Nutch trunk is correctly configured we can 
progress to building within Eclipse.
  
  === Build Nutch ===
-  * We can now progress to building Nutch by simply dragging the build.xml 
file into the Ant perspective and double clicking on the build file. If you 
configured the project correctly, Eclipse will build Nutch for you into 
"tmp_build" and you should see something similar to the following:
+  * We can now progress to building Nutch by simply dragging the build.xml 
file into the Ant perspective and double clicking on the build file. If you 
configured the project correctly, Eclipse will build Nutch for you into "bin" 
and you should see something similar to the following:
  {{{
-  [copy] Copying 1 file to /home/lewis/apache-nutch-1.3/runtime/deploy
-      [copy] Copying 1 file to /home/lewis/apache-nutch-1.3/runtime/deploy/bin
-      [copy] Copying 1 file to /home/lewis/apache-nutch-1.3/runtime/local/lib
-      [copy] Copying 1 file to 
/home/lewis/apache-nutch-1.3/runtime/local/lib/native
-      [copy] Copying 20 files to 
/home/lewis/apache-nutch-1.3/runtime/local/conf
-      [copy] Copying 1 file to /home/lewis/apache-nutch-1.3/runtime/local/bin
-      [copy] Copying 42 files to /home/lewis/apache-nutch-1.3/runtime/local/lib
-      [copy] Copying 107 files to 
/home/lewis/apache-nutch-1.3/runtime/local/plugins
  BUILD SUCCESSFUL
  Total time: 33 seconds
  }}}
@@ -115, +107 @@

  {{{
  <property>
    <name>plugin.folders</name>
-   <value>/home/....../nutch-1.3/src/plugin</value>
+   <value>/home/....../trunk/src/plugin</value>
  }}}
  
  === No plugins loaded during unit tests in Eclipse ===
@@ -127, +119 @@

  
  === debugging Hadoop classes ===
  Sometimes (fairly often) it makes sense to also have the Hadoop classes 
available during debugging. This should really second nature as Nutch heavily 
relies upon the underlying Hadoop infrastructure. Therefore you can check out 
(svn) the Hadoop sources into your Eclipse IDE and combine to debug this way. 
You can:
-   * Checkout the Hadoop version that should be used within Nutch 1.3
+   * Checkout the Hadoop version that should be used within Nutch trunk
    * configure a Hadoop project similar to the Nutch project within your 
Eclipse IDE
    * add the Hadoop project as a dependent project of Nutch project
    * you can now also set break points within Hadoop classes like inputformat 
implementations etc.

Reply via email to