[
https://issues.apache.org/jira/browse/MAHOUT-1034?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13646450#comment-13646450
]
Tarun Gulyani commented on MAHOUT-1034:
---------------------------------------
Hi,
I am able to run 20 News Group Data using Naive Bayes algorithm in mahout by
using MAHOUT_LOCAL=TRUE. But when i run same example of 20 news group using
mahout on hadoop cluster, it gives exception in some job after successfully run
some job.
Exception is :
13/05/01 21:04:38 INFO mapred.JobClient: Running job: job_201305011922_0018
13/05/01 21:04:39 INFO mapred.JobClient: map 0% reduce 0%
13/05/01 21:04:55 INFO mapred.JobClient: Task Id :
attempt_201305011922_0018_m_000000_0, Status : FAILED
java.lang.IllegalArgumentException
at
com.google.common.base.Preconditions.checkArgument(Preconditions.java:72)
at
org.apache.mahout.classifier.naivebayes.training.WeightsMapper.setup(WeightsMapper.java:42)
at org.apache.hadoop.mapreduce.Mapper.run(Mapper.java:142)
at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:764)
at org.apache.hadoop.mapred.MapTask.run(MapTask.java:370)
at org.apache.hadoop.mapred.Child$4.run(Child.java:255)
at java.security.AccessController.doPrivileged(Native Method)
at javax.security.auth.Subject.doAs(Subject.java:415)
at
org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1121)
at org.apache.hadoop.mapred.Child.main(Child.java:249)
13/05/01 21:05:07 INFO mapred.JobClient: Task Id :
attempt_201305011922_0018_m_000000_1, Status : FAILED
java.lang.IllegalArgumentException
at
com.google.common.base.Preconditions.checkArgument(Preconditions.java:72)
at
org.apache.mahout.classifier.naivebayes.training.WeightsMapper.setup(WeightsMapper.java:42)
at org.apache.hadoop.mapreduce.Mapper.run(Mapper.java:142)
at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:764)
at org.apache.hadoop.mapred.MapTask.run(MapTask.java:370)
at org.apache.hadoop.mapred.Child$4.run(Child.java:255)
at java.security.AccessController.doPrivileged(Native Method)
at javax.security.auth.Subject.doAs(Subject.java:415)
at
org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1121)
at org.apache.hadoop.mapred.Child.main(Child.java:249)
13/05/01 21:05:07 WARN mapred.JobClient: Error reading task
outputnoimarvinrhels2
13/05/01 21:05:07 WARN mapred.JobClient: Error reading task
outputnoimarvinrhels2
13/05/01 21:05:13 INFO mapred.JobClient: Task Id :
attempt_201305011922_0018_m_000000_2, Status : FAILED
java.lang.IllegalArgumentException
at
com.google.common.base.Preconditions.checkArgument(Preconditions.java:72)
at
org.apache.mahout.classifier.naivebayes.training.WeightsMapper.setup(WeightsMapper.java:42)
at org.apache.hadoop.mapreduce.Mapper.run(Mapper.java:142)
at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:764)
at org.apache.hadoop.mapred.MapTask.run(MapTask.java:370)
at org.apache.hadoop.mapred.Child$4.run(Child.java:255)
at java.security.AccessController.doPrivileged(Native Method)
at javax.security.auth.Subject.doAs(Subject.java:415)
at
org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1121)
at org.apache.hadoop.mapred.Child.main(Child.java:249)
13/05/01 21:05:13 WARN mapred.JobClient: Error reading task
outputnoimarvinrhels2
13/05/01 21:05:13 WARN mapred.JobClient: Error reading task
outputnoimarvinrhels2
13/05/01 21:05:25 INFO mapred.JobClient: Job complete: job_201305011922_0018
13/05/01 21:05:25 INFO mapred.JobClient: Counters: 7
13/05/01 21:05:25 INFO mapred.JobClient: Job Counters
13/05/01 21:05:25 INFO mapred.JobClient: SLOTS_MILLIS_MAPS=31281
13/05/01 21:05:25 INFO mapred.JobClient: Total time spent by all reduces
waiting after reserving slots (ms)=0
13/05/01 21:05:25 INFO mapred.JobClient: Total time spent by all maps
waiting after reserving slots (ms)=0
13/05/01 21:05:25 INFO mapred.JobClient: Launched map tasks=4
13/05/01 21:05:25 INFO mapred.JobClient: Data-local map tasks=4
13/05/01 21:05:25 INFO mapred.JobClient: SLOTS_MILLIS_REDUCES=0
13/05/01 21:05:25 INFO mapred.JobClient: Failed map tasks=1
13/05/01 21:05:25 INFO driver.MahoutDriver: Program took 84708 ms (Minutes:
1.4118)
+ echo 'Self testing on training set'
Self testing on training set
Please help me regarding this.
> ERROR in Navie Bayes Training(update: seqdirectory does not give output)
> ------------------------------------------------------------------------
>
> Key: MAHOUT-1034
> URL: https://issues.apache.org/jira/browse/MAHOUT-1034
> Project: Mahout
> Issue Type: Bug
> Components: Classification
> Affects Versions: 0.7
> Environment: Ubuntu 11.04
> Reporter: Leting Wu
> Assignee: Robin Anil
>
> When run either examples/classify-20newsgrouops.sh or ash-email-examples.sh,
> trainnb always fails:
> {noformat}
> INFO mapred.JobClient: Task Id : attempt_201206281546_0003_m_000000_0, Status
> : FAILED
> java.lang.IllegalArgumentException
> at
> com.google.common.base.Preconditions.checkArgument(Preconditions.java:72)
> at
> org.apache.mahout.classifier.naivebayes.training.WeightsMapper.setup(WeightsMapper.java:42)
> at org.apache.hadoop.mapreduce.Mapper.run(Mapper.java:142)
> at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:647)
> at org.apache.hadoop.mapred.MapTask.run(MapTask.java:323)
> at org.apache.hadoop.mapred.Child$4.run(Child.java:270)
> at java.security.AccessController.doPrivileged(Native Method)
> at javax.security.auth.Subject.doAs(Subject.java:396)
> at
> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1177)
> at org.apache.hadoop.mapred.Child.main(Child.java:264)
> {noformat}
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira