Hi,

There are a number of major issues with your attempts to get Nutch working.

Please check out our wiki for tutorials on Nutch.

Only Nutch distributions obtained from the official Apache resources
are supported e.g. mirrors... and development versions available from
our SVN area. All of these locations are linked to from the Nutch home
page.

hth

Lewis

On Thu, Oct 18, 2012 at 3:05 PM, sumarlidason <[email protected]> wrote:
> Good Morning,
>
> I am working on building nutch from source on centos to be used in
> conjunction with solr and hadoop.
>
> So far I have...
> download the source, ( http://www.gtlib.gatech.edu/pub/apache/nutch/2.1/ )
> built with ant, successfully,
> created a bin folder,
> download the nutch script, (
> https://svn.apache.org/repos/asf/nutch/branches/branch-1.2/bin/nutch )
> set three environmental variables:
> JAVA_HOME=/usr/java/jdk1.6.0_26/
> NUTCH_HOME=/root/Downloads/apache-nutch-2.1/
> NUTCH_JAVA_HOME=/usr/java/jdk1.6.0_26/
>
> When attempting to run, i get the following error,
>
> [root@hdpjt01 apache-nutch-2.1]# bin/nutch crawl urls -dir crawl -depth 3
> -topN 5
> Exception in thread "main" java.lang.NoClassDefFoundError:
> org/apache/hadoop/util/PlatformName
> Caused by: java.lang.ClassNotFoundException:
> org.apache.hadoop.util.PlatformName
>         at java.net.URLClassLoader$1.run(URLClassLoader.java:202)
>         at java.security.AccessController.doPrivileged(Native Method)
>         at java.net.URLClassLoader.findClass(URLClassLoader.java:190)
>         at java.lang.ClassLoader.loadClass(ClassLoader.java:306)
>         at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:301)
>         at java.lang.ClassLoader.loadClass(ClassLoader.java:247)
> Could not find the main class: org.apache.hadoop.util.PlatformName.  Program
> will exit.
> Exception in thread "main" java.lang.NoClassDefFoundError:
> org/apache/nutch/crawl/Crawl
> Caused by: java.lang.ClassNotFoundException: org.apache.nutch.crawl.Crawl
>         at java.net.URLClassLoader$1.run(URLClassLoader.java:202)
>         at java.security.AccessController.doPrivileged(Native Method)
>         at java.net.URLClassLoader.findClass(URLClassLoader.java:190)
>         at java.lang.ClassLoader.loadClass(ClassLoader.java:306)
>         at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:301)
>         at java.lang.ClassLoader.loadClass(ClassLoader.java:247)
> Could not find the main class: org.apache.nutch.crawl.Crawl.  Program will
> exit.
>
> As I drafted this post, i see a possible problem, the script im running is
> from a 1.2 branch, where can I get a script for 2.1? When i attempt to run
> the jar directly i get other errors. I read somewhere about merging jars so
> it could be ran stand alone?
>
> please if someone can assist...
>
>
>
> --
> View this message in context: 
> http://lucene.472066.n3.nabble.com/building-from-src-tp4014501.html
> Sent from the Nutch - User mailing list archive at Nabble.com.



-- 
Lewis

Reply via email to