The TokenStream class is part of Lucene. The Lucene jar is a separate file that is not normally part of a Mahout job. There is something missing in the class path for the Mahout job.
(Please just post this kind of error on the user@mahout mailing list.) On Fri, Aug 12, 2011 at 7:41 PM, patrick.J (JIRA) <[email protected]> wrote: > > [ > https://issues.apache.org/jira/browse/MAHOUT-784?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel > ] > > patrick.J updated MAHOUT-784: > ----------------------------- > > Description: > somebody may help, > I run"Twenty Newsgroups Classification Example"(from > https://cwiki.apache.org/confluence/display/MAHOUT/Twenty+Newsgroups) step by > step.And the Environment variables(HADOOP_HOME,MAHOUT_HOME,HADOOP_CONF_DIR) > all set in ~/.bashrc. > But when I going the step train the classifier on Hadoop, Exception was > cast like: > $> $MAHOUT_HOME/bin/mahout trainclassifier -i > 20news-input/bayes-train-input -o newsmodel -type bayes -ng 3 -source > hdfs: > 11/08/13 09:37:54 INFO mapred.JobClient: map 0% reduce 0% > 11/08/13 09:38:04 INFO mapred.JobClient: Task Id : > attempt_201108130934_0001_m_000000_0, Status : FAILED > java.lang.RuntimeException: Error in configuring object > at > org.apache.hadoop.util.ReflectionUtils.setJobConf(ReflectionUtils.java:93) > at > org.apache.hadoop.util.ReflectionUtils.setConf(ReflectionUtils.java:64) > at > org.apache.hadoop.util.ReflectionUtils.newInstance(ReflectionUtils.java:117) > at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:354) > at org.apache.hadoop.mapred.MapTask.run(MapTask.java:307) > at org.apache.hadoop.mapred.Child.main(Child.java:170) > Caused by: java.lang.reflect.InvocationTargetException > at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) > at > sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39) > at > sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25) > at java.lang.reflect.Method.invoke(Method.java:597) > at > org.apache.hadoop.util.ReflectionUtils.setJobConf(ReflectionUtils.java:88) > ... 5 more > Caused by: java.lang.NoClassDefFoundError: > org/apache/lucene/analysis/TokenStream > at java.lang.Class.forName0(Native Method) > at java.lang.Class.forName(Class.java:247) > at > org.apache.hadoop.conf.Configuration.getClassByName(Configuration.java:762) > at > org.apache.hadoop.conf.Configuration.getClass(Configuration.java:807) > at > org.apache.hadoop.conf.Configuration.getClass(Configuration.java:833) > at org.apache.hadoop.mapred.JobConf.getMapperClass(JobConf.java:772) > at org.apache.hadoop.mapred.MapRunner.configure(MapRunner.java:34) > ... 10 more > Caused by: java.lang.ClassNotFoundException: > org.apache.lucene.analysis.TokenStream > at java.net.URLClassLoader$1.run(URLClassLoader.java:202) > at java.security.AccessController.doPrivileged(Native Method) > at java.net.URLClassLoader.findClass(URLClassLoader.java:190) > at java.lang.ClassLoader.loadClass(ClassLoader.java:306) > at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:301) > at java.lang.ClassLoader.loadClass(ClassLoader.java:247) > ... 17 more > What happened? I just walked through the tutorials....... > > > > was: > somebody may help, > I run"Twenty Newsgroups Classification Example"(from > https://cwiki.apache.org/confluence/display/MAHOUT/Twenty+Newsgroups)step by > step.And the Environment variables(HADOOP_HOME,MAHOUT_HOME,HADOOP_CONF_DIR) > all set in ~/.bashrc. > But when I going the step train the classifier on Hadoop, Exception was > cast like: > $> $MAHOUT_HOME/bin/mahout trainclassifier -i > 20news-input/bayes-train-input -o newsmodel -type bayes -ng 3 -source > hdfs: > 11/08/13 09:37:54 INFO mapred.JobClient: map 0% reduce 0% > 11/08/13 09:38:04 INFO mapred.JobClient: Task Id : > attempt_201108130934_0001_m_000000_0, Status : FAILED > java.lang.RuntimeException: Error in configuring object > at > org.apache.hadoop.util.ReflectionUtils.setJobConf(ReflectionUtils.java:93) > at > org.apache.hadoop.util.ReflectionUtils.setConf(ReflectionUtils.java:64) > at > org.apache.hadoop.util.ReflectionUtils.newInstance(ReflectionUtils.java:117) > at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:354) > at org.apache.hadoop.mapred.MapTask.run(MapTask.java:307) > at org.apache.hadoop.mapred.Child.main(Child.java:170) > Caused by: java.lang.reflect.InvocationTargetException > at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) > at > sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39) > at > sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25) > at java.lang.reflect.Method.invoke(Method.java:597) > at > org.apache.hadoop.util.ReflectionUtils.setJobConf(ReflectionUtils.java:88) > ... 5 more > Caused by: java.lang.NoClassDefFoundError: > org/apache/lucene/analysis/TokenStream > at java.lang.Class.forName0(Native Method) > at java.lang.Class.forName(Class.java:247) > at > org.apache.hadoop.conf.Configuration.getClassByName(Configuration.java:762) > at > org.apache.hadoop.conf.Configuration.getClass(Configuration.java:807) > at > org.apache.hadoop.conf.Configuration.getClass(Configuration.java:833) > at org.apache.hadoop.mapred.JobConf.getMapperClass(JobConf.java:772) > at org.apache.hadoop.mapred.MapRunner.configure(MapRunner.java:34) > ... 10 more > Caused by: java.lang.ClassNotFoundException: > org.apache.lucene.analysis.TokenStream > at java.net.URLClassLoader$1.run(URLClassLoader.java:202) > at java.security.AccessController.doPrivileged(Native Method) > at java.net.URLClassLoader.findClass(URLClassLoader.java:190) > at java.lang.ClassLoader.loadClass(ClassLoader.java:306) > at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:301) > at java.lang.ClassLoader.loadClass(ClassLoader.java:247) > ... 17 more > What happened? I just walked through the tutorials....... > > > > >> Exception at 20 Newsgroups examples >> ----------------------------------- >> >> Key: MAHOUT-784 >> URL: https://issues.apache.org/jira/browse/MAHOUT-784 >> Project: Mahout >> Issue Type: Bug >> Components: Classification >> Environment: Ubuntu 11.04 / Hadoop 0.20.2 / Maven 3.0.3 / >> mahout-distribution-0.5 >> Reporter: patrick.J >> >> somebody may help, >> I run"Twenty Newsgroups Classification Example"(from >> https://cwiki.apache.org/confluence/display/MAHOUT/Twenty+Newsgroups) step >> by step.And the Environment >> variables(HADOOP_HOME,MAHOUT_HOME,HADOOP_CONF_DIR) all set in ~/.bashrc. >> But when I going the step train the classifier on Hadoop, Exception was >> cast like: >> $> $MAHOUT_HOME/bin/mahout trainclassifier -i >> 20news-input/bayes-train-input -o newsmodel -type bayes -ng 3 >> -source hdfs: >> 11/08/13 09:37:54 INFO mapred.JobClient: map 0% reduce 0% >> 11/08/13 09:38:04 INFO mapred.JobClient: Task Id : >> attempt_201108130934_0001_m_000000_0, Status : FAILED >> java.lang.RuntimeException: Error in configuring object >> at >> org.apache.hadoop.util.ReflectionUtils.setJobConf(ReflectionUtils.java:93) >> at >> org.apache.hadoop.util.ReflectionUtils.setConf(ReflectionUtils.java:64) >> at >> org.apache.hadoop.util.ReflectionUtils.newInstance(ReflectionUtils.java:117) >> at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:354) >> at org.apache.hadoop.mapred.MapTask.run(MapTask.java:307) >> at org.apache.hadoop.mapred.Child.main(Child.java:170) >> Caused by: java.lang.reflect.InvocationTargetException >> at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) >> at >> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39) >> at >> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25) >> at java.lang.reflect.Method.invoke(Method.java:597) >> at >> org.apache.hadoop.util.ReflectionUtils.setJobConf(ReflectionUtils.java:88) >> ... 5 more >> Caused by: java.lang.NoClassDefFoundError: >> org/apache/lucene/analysis/TokenStream >> at java.lang.Class.forName0(Native Method) >> at java.lang.Class.forName(Class.java:247) >> at >> org.apache.hadoop.conf.Configuration.getClassByName(Configuration.java:762) >> at >> org.apache.hadoop.conf.Configuration.getClass(Configuration.java:807) >> at >> org.apache.hadoop.conf.Configuration.getClass(Configuration.java:833) >> at org.apache.hadoop.mapred.JobConf.getMapperClass(JobConf.java:772) >> at org.apache.hadoop.mapred.MapRunner.configure(MapRunner.java:34) >> ... 10 more >> Caused by: java.lang.ClassNotFoundException: >> org.apache.lucene.analysis.TokenStream >> at java.net.URLClassLoader$1.run(URLClassLoader.java:202) >> at java.security.AccessController.doPrivileged(Native Method) >> at java.net.URLClassLoader.findClass(URLClassLoader.java:190) >> at java.lang.ClassLoader.loadClass(ClassLoader.java:306) >> at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:301) >> at java.lang.ClassLoader.loadClass(ClassLoader.java:247) >> ... 17 more >> What happened? I just walked through the tutorials....... >> > > -- > This message is automatically generated by JIRA. > For more information on JIRA, see: http://www.atlassian.com/software/jira > > > -- Lance Norskog [email protected]
