Trying it out, found one test failure:
Failed tests:
ClusterClassificationDriverTest.testVectorClassificationWithOutlierRemovalMR:102->assertVectorsWithOutlierRemoval:188->checkClustersWithOutlierRemoval:238->Assert.assertTrue:41->Assert.fail:88
not expecting cluster:0:{0:1.0,1:1.0}
Here's the stack trace when I run that test:
java.lang.AssertionError: not expecting cluster:0:{0:1.0,1:1.0}
at __randomizedtesting.SeedInfo.seed([9DD682CDC661ECA:5DEF7B1855381EF]:0)
at org.junit.Assert.fail(Assert.java:88)
at org.junit.Assert.assertTrue(Assert.java:41)
at
org.apache.mahout.clustering.classify.ClusterClassificationDriverTest.checkClustersWithOutlierRemoval(ClusterClassificationDriverTest.java:238)
at
org.apache.mahout.clustering.classify.ClusterClassificationDriverTest.assertVectorsWithOutlierRemoval(ClusterClassificationDriverTest.java:188)
at
org.apache.mahout.clustering.classify.ClusterClassificationDriverTest.testVectorClassificationWithOutlierRemovalMR(ClusterClassificationDriverTest.java:102)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
On Sat, Jan 25, 2014 at 1:31 AM, Suneel Marthi (JIRA) <[email protected]>wrote:
>
> [
> https://issues.apache.org/jira/browse/MAHOUT-1410?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel]
>
> Suneel Marthi updated MAHOUT-1410:
> ----------------------------------
>
> Affects Version/s: (was: 0.9)
>
> > clusteredPoints do not contain a vector id
> > ------------------------------------------
> >
> > Key: MAHOUT-1410
> > URL: https://issues.apache.org/jira/browse/MAHOUT-1410
> > Project: Mahout
> > Issue Type: Bug
> > Components: Clustering
> > Affects Versions: 0.8
> > Environment: using 0.9 release candidate
> > Reporter: Pat Ferrel
> > Assignee: Suneel Marthi
> > Fix For: 0.9
> >
> > Attachments: MAHOUT-1410.patch
> >
> >
> > When clustering non-named vectors there are no vector ids in
> clusteredPoints so the other values there, cluster id, vector values,
> distance-squared, pdf, cannot be tied to any known vector.
>
>
>
> --
> This message was sent by Atlassian JIRA
> (v6.1.5#6160)
>