Richard Braman wrote:
I am trying to get nutch up and running on fedora core 5. after
building trunk from svn with eclipse, setting environment variables,
and confiuring my nutch-site, I ran the inject command. Here is the
error I get when running inject: It looks like an incorrect library
configuration, but its strange because i built it from eclipse.
[EMAIL PROTECTED] ~]# /root/workspace/nutch/bin/nutch inject crawl/crawldb
/root/workspace/nutch/urls/
Injector: starting
Injector: crawlDb: alaskacruises/crawldb
Injector: urlDir: /root/workspace/nutch/urls
Injector: Converting injected urls to crawl db entries.
Injector: java.lang.RuntimeException: java.lang.ClassCastException:
org.apache.xerces.dom.DocumentImpl cannot be cast to gnu.xml.dom.DomDocument
at org.apache.hadoop.conf.Configuration.write (Configuration.java:515)
at org.apache.hadoop.mapred.JobClient.submitJob (JobClient.java:288)
at org.apache.hadoop.mapred.JobClient.runJob (JobClient.java:333)
at org.apache.nutch.crawl.Injector.inject (Injector.java:146)
at org.apache.nutch.crawl.Injector.run (Injector.java:173)
at org.apache.nutch.util.ToolBase.doMain (ToolBase.java:145)
at org.apache.nutch.crawl.Injector.main (Injector.java:162)
Caused by: java.lang.ClassCastException:
org.apache.xerces.dom.DocumentImpl cannot be cast to gnu.xml.dom.DomDocument
at gnu.xml.transform.TransformerImpl.transform (libgcj.so.7)
at org.apache.hadoop.conf.Configuration.write (Configuration.java:516)
...6 more
Look at the stacktrace - you are using GCJ. Nutch requires Sun JDK.
--
Best regards,
Andrzej Bialecki <><
___. ___ ___ ___ _ _ __________________________________
[__ || __|__/|__||\/| Information Retrieval, Semantic Web
___|||__|| \| || | Embedded Unix, System Integration
http://www.sigram.com Contact: info at sigram dot com