The vectors are having keys added during the MR version, which the
reference in the test doesn't expect.  See attached screenshots of
variables during debugging.


On Sat, Jan 25, 2014 at 9:29 AM, Andrew Musselman <
andrew.mussel...@gmail.com> wrote:

> Still trying to understand what these tests are doing, but that is only
> blowing up when that is called
> during testVectorClassificationWithOutlierRemoval*MR*.  Runs fine
> during testVectorClassificationWithOutlierRemoval.
>
>
> On Sat, Jan 25, 2014 at 9:14 AM, Andrew Musselman <
> andrew.mussel...@gmail.com> wrote:
>
>> Trying it out, found one test failure:
>>
>> Failed tests:
>>
>> ClusterClassificationDriverTest.testVectorClassificationWithOutlierRemovalMR:102->assertVectorsWithOutlierRemoval:188->checkClustersWithOutlierRemoval:238->Assert.assertTrue:41->Assert.fail:88
>> not expecting cluster:0:{0:1.0,1:1.0}
>>
>> Here's the stack trace when I run that test:
>>
>> java.lang.AssertionError: not expecting cluster:0:{0:1.0,1:1.0}
>> at __randomizedtesting.SeedInfo.seed([9DD682CDC661ECA:5DEF7B1855381EF]:0)
>>  at org.junit.Assert.fail(Assert.java:88)
>> at org.junit.Assert.assertTrue(Assert.java:41)
>> at
>> org.apache.mahout.clustering.classify.ClusterClassificationDriverTest.checkClustersWithOutlierRemoval(ClusterClassificationDriverTest.java:238)
>>  at
>> org.apache.mahout.clustering.classify.ClusterClassificationDriverTest.assertVectorsWithOutlierRemoval(ClusterClassificationDriverTest.java:188)
>> at
>> org.apache.mahout.clustering.classify.ClusterClassificationDriverTest.testVectorClassificationWithOutlierRemovalMR(ClusterClassificationDriverTest.java:102)
>>  at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>> at
>> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
>>  at
>> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>>
>>
>> On Sat, Jan 25, 2014 at 1:31 AM, Suneel Marthi (JIRA) <j...@apache.org>wrote:
>>
>>>
>>>      [
>>> https://issues.apache.org/jira/browse/MAHOUT-1410?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel]
>>>
>>> Suneel Marthi updated MAHOUT-1410:
>>> ----------------------------------
>>>
>>>     Affects Version/s:     (was: 0.9)
>>>
>>> > clusteredPoints do not contain a vector id
>>> > ------------------------------------------
>>> >
>>> >                 Key: MAHOUT-1410
>>> >                 URL: https://issues.apache.org/jira/browse/MAHOUT-1410
>>> >             Project: Mahout
>>> >          Issue Type: Bug
>>> >          Components: Clustering
>>> >    Affects Versions: 0.8
>>> >         Environment: using 0.9 release candidate
>>> >            Reporter: Pat Ferrel
>>> >            Assignee: Suneel Marthi
>>> >             Fix For: 0.9
>>> >
>>> >         Attachments: MAHOUT-1410.patch
>>> >
>>> >
>>> > When clustering non-named vectors there are no vector ids in
>>> clusteredPoints so the other values there, cluster id, vector values,
>>> distance-squared, pdf, cannot be tied to any known vector.
>>>
>>>
>>>
>>> --
>>> This message was sent by Atlassian JIRA
>>> (v6.1.5#6160)
>>>
>>
>>
>

Reply via email to