[ 
https://issues.apache.org/jira/browse/MAHOUT-1034?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13646450#comment-13646450
 ] 

Tarun Gulyani commented on MAHOUT-1034:
---------------------------------------

Hi,

I am able to run 20 News Group Data using Naive Bayes algorithm in mahout by 
using MAHOUT_LOCAL=TRUE. But when i run same example of 20 news group using 
mahout on hadoop cluster, it gives exception in some job after successfully run 
some job.

Exception is :

13/05/01 21:04:38 INFO mapred.JobClient: Running job: job_201305011922_0018
13/05/01 21:04:39 INFO mapred.JobClient:  map 0% reduce 0%
13/05/01 21:04:55 INFO mapred.JobClient: Task Id : 
attempt_201305011922_0018_m_000000_0, Status : FAILED
java.lang.IllegalArgumentException
        at 
com.google.common.base.Preconditions.checkArgument(Preconditions.java:72)
        at 
org.apache.mahout.classifier.naivebayes.training.WeightsMapper.setup(WeightsMapper.java:42)
        at org.apache.hadoop.mapreduce.Mapper.run(Mapper.java:142)
        at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:764)
        at org.apache.hadoop.mapred.MapTask.run(MapTask.java:370)
        at org.apache.hadoop.mapred.Child$4.run(Child.java:255)
        at java.security.AccessController.doPrivileged(Native Method)
        at javax.security.auth.Subject.doAs(Subject.java:415)
        at 
org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1121)
        at org.apache.hadoop.mapred.Child.main(Child.java:249)

13/05/01 21:05:07 INFO mapred.JobClient: Task Id : 
attempt_201305011922_0018_m_000000_1, Status : FAILED
java.lang.IllegalArgumentException
        at 
com.google.common.base.Preconditions.checkArgument(Preconditions.java:72)
        at 
org.apache.mahout.classifier.naivebayes.training.WeightsMapper.setup(WeightsMapper.java:42)
        at org.apache.hadoop.mapreduce.Mapper.run(Mapper.java:142)
        at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:764)
        at org.apache.hadoop.mapred.MapTask.run(MapTask.java:370)
        at org.apache.hadoop.mapred.Child$4.run(Child.java:255)
        at java.security.AccessController.doPrivileged(Native Method)
        at javax.security.auth.Subject.doAs(Subject.java:415)
        at 
org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1121)
        at org.apache.hadoop.mapred.Child.main(Child.java:249)

13/05/01 21:05:07 WARN mapred.JobClient: Error reading task 
outputnoimarvinrhels2
13/05/01 21:05:07 WARN mapred.JobClient: Error reading task 
outputnoimarvinrhels2
13/05/01 21:05:13 INFO mapred.JobClient: Task Id : 
attempt_201305011922_0018_m_000000_2, Status : FAILED
java.lang.IllegalArgumentException
        at 
com.google.common.base.Preconditions.checkArgument(Preconditions.java:72)
        at 
org.apache.mahout.classifier.naivebayes.training.WeightsMapper.setup(WeightsMapper.java:42)
        at org.apache.hadoop.mapreduce.Mapper.run(Mapper.java:142)
        at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:764)
        at org.apache.hadoop.mapred.MapTask.run(MapTask.java:370)
        at org.apache.hadoop.mapred.Child$4.run(Child.java:255)
        at java.security.AccessController.doPrivileged(Native Method)
        at javax.security.auth.Subject.doAs(Subject.java:415)
        at 
org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1121)
        at org.apache.hadoop.mapred.Child.main(Child.java:249)

13/05/01 21:05:13 WARN mapred.JobClient: Error reading task 
outputnoimarvinrhels2
13/05/01 21:05:13 WARN mapred.JobClient: Error reading task 
outputnoimarvinrhels2
13/05/01 21:05:25 INFO mapred.JobClient: Job complete: job_201305011922_0018
13/05/01 21:05:25 INFO mapred.JobClient: Counters: 7
13/05/01 21:05:25 INFO mapred.JobClient:   Job Counters
13/05/01 21:05:25 INFO mapred.JobClient:     SLOTS_MILLIS_MAPS=31281
13/05/01 21:05:25 INFO mapred.JobClient:     Total time spent by all reduces 
waiting after reserving slots (ms)=0
13/05/01 21:05:25 INFO mapred.JobClient:     Total time spent by all maps 
waiting after reserving slots (ms)=0
13/05/01 21:05:25 INFO mapred.JobClient:     Launched map tasks=4
13/05/01 21:05:25 INFO mapred.JobClient:     Data-local map tasks=4
13/05/01 21:05:25 INFO mapred.JobClient:     SLOTS_MILLIS_REDUCES=0
13/05/01 21:05:25 INFO mapred.JobClient:     Failed map tasks=1
13/05/01 21:05:25 INFO driver.MahoutDriver: Program took 84708 ms (Minutes: 
1.4118)
+ echo 'Self testing on training set'
Self testing on training set
   



Please help me regarding this.

                
> ERROR in Navie Bayes Training(update: seqdirectory does not give output)
> ------------------------------------------------------------------------
>
>                 Key: MAHOUT-1034
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-1034
>             Project: Mahout
>          Issue Type: Bug
>          Components: Classification
>    Affects Versions: 0.7
>         Environment: Ubuntu 11.04
>            Reporter: Leting Wu
>            Assignee: Robin Anil
>
> When run either examples/classify-20newsgrouops.sh or ash-email-examples.sh, 
> trainnb always fails:
> {noformat}
> INFO mapred.JobClient: Task Id : attempt_201206281546_0003_m_000000_0, Status 
> : FAILED
> java.lang.IllegalArgumentException
>       at 
> com.google.common.base.Preconditions.checkArgument(Preconditions.java:72)
>       at 
> org.apache.mahout.classifier.naivebayes.training.WeightsMapper.setup(WeightsMapper.java:42)
>       at org.apache.hadoop.mapreduce.Mapper.run(Mapper.java:142)
>       at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:647)
>       at org.apache.hadoop.mapred.MapTask.run(MapTask.java:323)
>       at org.apache.hadoop.mapred.Child$4.run(Child.java:270)
>       at java.security.AccessController.doPrivileged(Native Method)
>       at javax.security.auth.Subject.doAs(Subject.java:396)
>       at 
> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1177)
>       at org.apache.hadoop.mapred.Child.main(Child.java:264)
> {noformat}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to