[ 
https://issues.apache.org/jira/browse/MAHOUT-552?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12935407#action_12935407
 ] 

Pere Ferrera Bertran commented on MAHOUT-552:
---------------------------------------------

Thanks for your observations, Jeff. Then I guess the problem I am reporting is 
specific to some clustering algorithm. Concretely, I am using Mean Shift 
Clustering. There is no way I can preserve vectors names in -cl mode. I am 
using the latest code (0.5 snapshot).

In MeanShiftCanopyClusterMapper there is some sort of equivalence between input 
vectors and canopies. I can see the vector that is output to clusteredPoints is 
canopy.getCenter(). Is this right?

> AbstractCluster eliminates NamedVectors by replacing them with 
> RandomAccessSparseVector always
> ----------------------------------------------------------------------------------------------
>
>                 Key: MAHOUT-552
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-552
>             Project: Mahout
>          Issue Type: Bug
>          Components: Clustering
>    Affects Versions: 0.5
>            Reporter: Pere Ferrera Bertran
>             Fix For: 0.5
>
>         Attachments: MAHOUT-552.patch
>
>
> When clustering using NamedVectors as input - after running seq2sparse with 
> patch https://issues.apache.org/jira/browse/MAHOUT-401 - names are lost 
> because AbstractCluster replaces vectors coming in the constructor with 
> RandomAccessSparseVector.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to