[
https://issues.apache.org/jira/browse/MAHOUT-476?focusedWorklogId=989503&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-989503
]
ASF GitHub Bot logged work on MAHOUT-476:
-----------------------------------------
Author: ASF GitHub Bot
Created on: 29/Oct/25 19:19
Start Date: 29/Oct/25 19:19
Worklog Time Spent: 10m
Work Description: andrewmusselman merged PR #590:
URL: https://github.com/apache/mahout/pull/590
Issue Time Tracking
-------------------
Worklog Id: (was: 989503)
Time Spent: 20m (was: 10m)
> bug when running
> org.apache.mahout.classifier.bayes.WikipediaDatasetCreatorDriver on hadoop
> -------------------------------------------------------------------------------------------
>
> Key: MAHOUT-476
> URL: https://issues.apache.org/jira/browse/MAHOUT-476
> Project: Mahout
> Issue Type: Bug
> Affects Versions: 0.3
> Environment: hadoop 0.20.2
> mahout-0.3
> ubuntu
> Reporter: leon lee
> Priority: Major
> Fix For: 0.3
>
> Time Spent: 20m
> Remaining Estimate: 0h
>
> when I follow wiki instruction:
> https://cwiki.apache.org/MAHOUT/wikipedia-bayes-example.html
> (by the way, the bayes examples document in wiki need update to 0.3 )
> to run step 5:
> Create the countries based Split of wikipedia dataset.
> I use the following command:
> $HADOOP_HOME/bin/hadoop jar
> $MAHOUT_HOME/examples/target/mahout-examples-0.3.job
> org.apache.mahout.classifier.bayes.WikipediaDatasetCreatorDriver -i
> $MAHOUT_HOME/examples/work/wikipedia/chunks -o
> $MAHOUT_HOME/examples/work/wikipediainput -c
> $MAHOUT_HOME/examples/src/test/resources/country.txt
> and failed on hadoop.
> see hadoop log, it hint:
> Error:
> org.apache.lucene.wikipedia.analysis.WikipediaTokenizer.addAttribute(Ljava/lang/Class;)Lorg/apache/lucene/util/Attribute
--
This message was sent by Atlassian Jira
(v8.20.10#820010)