[
https://issues.apache.org/jira/browse/MAHOUT-1527?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14001162#comment-14001162
]
Andrew Palumbo commented on MAHOUT-1527:
----------------------------------------
Thanks! I have an other modification of the script that does binary
classification of United States and United Kingdom. I've noticed a lot of NB
binary classification questions lately.. Should I merge that into the script?
Also does this need documentation? It might be redundant with the 20 newsgoups
docs.
> Fix wikipedia classifier example
> --------------------------------
>
> Key: MAHOUT-1527
> URL: https://issues.apache.org/jira/browse/MAHOUT-1527
> Project: Mahout
> Issue Type: Task
> Components: Classification, Documentation, Examples
> Affects Versions: 0.7, 0.8, 0.9
> Reporter: Sebastian Schelter
> Fix For: 1.0
>
> Attachments: MAHOUT-1527.patch
>
>
> The examples package has a classification showcase for prediciting the labels
> of wikipedia pages. Unfortunately, the example is totally broken:
> It relies on the old NB implementation which has been removed, suggests to
> use the whole wikipedia as input, which will not work well on a single
> machine and the documentation uses commands that have long been removed from
> bin/mahout.
> The example needs to be updated to use the current naive bayes implementation
> and documentation on the website needs to be written.
--
This message was sent by Atlassian JIRA
(v6.2#6252)