Hi all
I have installed nutch trunk from svn. It's installed (under Linux)
in /opt/nutch-svn/trunk with a soft link to this directory as
/opt/nutch, and $NUTCH_HOME is set to /opt/nutch I have issued the
ant command in this directory (build.xml file exists).
When I issued the command bin/nutch inject crawl/crawldb urls I got
the No such file or directory leading me to discover the bin directory
is located under the src directory. Amending the command to be
src/bin/nutch inject crawl/crawldb urls gives me the error message:
Exception in thread "main" java.lang.NoClassDefFoundError:
org/apache/nutch/crawl/InjectorJob
Caused by: java.lang.ClassNotFoundException: org.apache.nutch.crawl.InjectorJob
at java.net.URLClassLoader$1.run(URLClassLoader.java:202)
at java.security.AccessController.doPrivileged(Native Method)
at java.net.URLClassLoader.findClass(URLClassLoader.java:190)
at java.lang.ClassLoader.loadClass(ClassLoader.java:307)
at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:301)
at java.lang.ClassLoader.loadClass(ClassLoader.java:248)
Could not find the main class: org.apache.nutch.crawl.InjectorJob.
Program will exit.
I assume this is a problem with the class path but don't get this
problem with the binary version if downloaded.
If I provide a link in $NUTCH_HOME to src/bin as bin then issue the
command bin/nutch inject crawl/crawldb urls I get the following error
message:
Exception in thread "main" java.lang.NoClassDefFoundError:
org/apache/hadoop/util/PlatformName
Caused by: java.lang.ClassNotFoundException: org.apache.hadoop.util.PlatformName
at java.net.URLClassLoader$1.run(URLClassLoader.java:202)
at java.security.AccessController.doPrivileged(Native Method)
at java.net.URLClassLoader.findClass(URLClassLoader.java:190)
at java.lang.ClassLoader.loadClass(ClassLoader.java:307)
at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:301)
at java.lang.ClassLoader.loadClass(ClassLoader.java:248)
Could not find the main class: org.apache.hadoop.util.PlatformName.
Program will exit.
Exception in thread "main" java.lang.NoClassDefFoundError:
org/apache/nutch/crawl/InjectorJob
Caused by: java.lang.ClassNotFoundException: org.apache.nutch.crawl.InjectorJob
at java.net.URLClassLoader$1.run(URLClassLoader.java:202)
at java.security.AccessController.doPrivileged(Native Method)
at java.net.URLClassLoader.findClass(URLClassLoader.java:190)
at java.lang.ClassLoader.loadClass(ClassLoader.java:307)
at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:301)
at java.lang.ClassLoader.loadClass(ClassLoader.java:248)
Could not find the main class: org.apache.nutch.crawl.InjectorJob.
Program will exit.
In both cases I'm unsure what to set the class path as. Can any help?
Alternatively is this not a classpath issue but something else
instead?
many thanks
Paul