I am not terribly familiar with this script personally but one useful step would be to print out the CLASSPATH it is running with to spot what might be the problem. Heck it would probably be useful if the script always printed what it was doing on the command line.
Maybe others have a clearer guess at the issue. On Tue, Jun 7, 2011 at 3:44 PM, Alex Ott <[email protected]> wrote: > Hello all > > I'm not sure is this actual error or just misconfiguration, so I'm > posting it here.... > > I'm running wikipedia clusterization examples and found following > problem. if I run following command: > > bin/mahout seqwiki -all -i articles.xml -o wikipedia-seqfiles -e > > then after first part of job I get following error: > > 11/06/07 16:37:39 WARN mapred.JobClient: Use GenericOptionsParser for > parsing the arguments. Applications should implement Tool for the > same. > 11/06/07 16:37:39 INFO input.FileInputFormat: Total input paths to process > : 1 > 11/06/07 16:37:39 INFO mapred.JobClient: Running job: job_201106071631_0006 > 11/06/07 16:37:40 INFO mapred.JobClient: map 0% reduce 0% > 11/06/07 16:38:29 INFO mapred.JobClient: Task Id : > attempt_201106071631_0006_m_000000_0, Status : FAILED > Error: java.lang.ClassNotFoundException: com.google.common.base.Charsets > at java.net.URLClassLoader$1.run(URLClassLoader.java:202) > at java.security.AccessController.doPrivileged(Native Method) > at java.net.URLClassLoader.findClass(URLClassLoader.java:190) > at java.lang.ClassLoader.loadClass(ClassLoader.java:306) > at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:301) > at java.lang.ClassLoader.loadClass(ClassLoader.java:247) > at > org.apache.mahout.classifier.bayes.XmlInputFormat$XmlRecordReader.<init>(XmlInputFormat.java:75) > at > org.apache.mahout.classifier.bayes.XmlInputFormat.createRecordReader(XmlInputFormat.java:51) > at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:588) > at org.apache.hadoop.mapred.MapTask.run(MapTask.java:305) > at org.apache.hadoop.mapred.Child.main(Child.java:170) > > Although I can find com.google.common.base.Charsets class in the > mahout-examples-0.5-job.jar file included into distribution. > And this error is fixed if I explicitly add this jar file into > CLASSPATH with command: > > export CLASSPATH="`pwd`/mahout-examples-0.5-job.jar" > > Is it problem with Hadoop configuration? I'm using pseudo-distributed > setup, where only data paths are specified additionally > > -- > With best wishes, Alex Ott > http://alexott.net/ > Tiwtter: alexott_en (English), alexott (Russian) > Skype: alex.ott >
