[
https://issues.apache.org/jira/browse/MAHOUT-691?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13031365#comment-13031365
]
Jake Mannix commented on MAHOUT-691:
------------------------------------
Well, what I found was the job jar indeed has all of the right classes unpacked
in it, but my attempt to run seq2sparse on a real hadoop cluster still fails:
11/05/10 20:35:52 INFO mapred.JobClient: Task Id :
attempt_201104300433_143470_m_000000_2, Status : FAILED
Error: java.lang.ClassNotFoundException: org.apache.lucene.analysis.Analyzer
at java.net.URLClassLoader$1.run(URLClassLoader.java:200)
at java.security.AccessController.doPrivileged(Native Method)
at java.net.URLClassLoader.findClass(URLClassLoader.java:188)
at java.lang.ClassLoader.loadClass(ClassLoader.java:307)
at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:301)
at java.lang.ClassLoader.loadClass(ClassLoader.java:252)
at java.lang.ClassLoader.loadClassInternal(ClassLoader.java:320)
at java.lang.ClassLoader.defineClass1(Native Method)
at java.lang.ClassLoader.defineClass(ClassLoader.java:621)
at
java.security.SecureClassLoader.defineClass(SecureClassLoader.java:124)
at java.net.URLClassLoader.defineClass(URLClassLoader.java:260)
at java.net.URLClassLoader.access$000(URLClassLoader.java:56)
at java.net.URLClassLoader$1.run(URLClassLoader.java:195)
at java.security.AccessController.doPrivileged(Native Method)
at java.net.URLClassLoader.findClass(URLClassLoader.java:188)
at java.lang.ClassLoader.loadClass(ClassLoader.java:307)
at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:301)
at java.lang.ClassLoader.loadClass(ClassLoader.java:252)
at java.lang.ClassLoader.loadClassInternal(ClassLoader.java:320)
at
org.apache.mahout.vectorizer.document.SequenceFileTokenizerMapper.setup(SequenceFileTokenizerMapper.java:58)
at org.apache.hadoop.mapreduce.Mapper.run(Mapper.java:142)
at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:583)
at org.apache.hadoop.mapred.MapTask.run(MapTask.java:305)
at org.apache.hadoop.mapred.Child.main(Child.java:170)
> "job" file dependent jars in lib/ aren't found by Hadoop?
> ---------------------------------------------------------
>
> Key: MAHOUT-691
> URL: https://issues.apache.org/jira/browse/MAHOUT-691
> Project: Mahout
> Issue Type: Bug
> Affects Versions: 0.4
> Reporter: Sean Owen
> Assignee: Sean Owen
> Labels: classpath, dependency, hadoop, jar
> Fix For: 0.5
>
> Attachments: MAHOUT-691.patch
>
>
> (marking for 0.5 until we decide it's fixed or not actually a problem)
> As discussed on the mailing list, there's some apparent issue with packaging
> the "job" jar files with dependent jars within the lib/ directory. Changing
> the config to repackage everything into one jar can fix this. Patch attached
> for discussion.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira