[ https://issues.apache.org/jira/browse/MAHOUT-552?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12935407#action_12935407 ]
Pere Ferrera Bertran commented on MAHOUT-552: --------------------------------------------- Thanks for your observations, Jeff. Then I guess the problem I am reporting is specific to some clustering algorithm. Concretely, I am using Mean Shift Clustering. There is no way I can preserve vectors names in -cl mode. I am using the latest code (0.5 snapshot). In MeanShiftCanopyClusterMapper there is some sort of equivalence between input vectors and canopies. I can see the vector that is output to clusteredPoints is canopy.getCenter(). Is this right? > AbstractCluster eliminates NamedVectors by replacing them with > RandomAccessSparseVector always > ---------------------------------------------------------------------------------------------- > > Key: MAHOUT-552 > URL: https://issues.apache.org/jira/browse/MAHOUT-552 > Project: Mahout > Issue Type: Bug > Components: Clustering > Affects Versions: 0.5 > Reporter: Pere Ferrera Bertran > Fix For: 0.5 > > Attachments: MAHOUT-552.patch > > > When clustering using NamedVectors as input - after running seq2sparse with > patch https://issues.apache.org/jira/browse/MAHOUT-401 - names are lost > because AbstractCluster replaces vectors coming in the constructor with > RandomAccessSparseVector. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.