[ 
https://issues.apache.org/jira/browse/MAHOUT-778?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13115998#comment-13115998
 ] 

Frank Scholten commented on MAHOUT-778:
---------------------------------------

+1 for the done marker

I modified ClustersFilter from examples (Used by the Display* Classes).

You can pass a Configuration object and a cluster output Path to the 
constructor and it will accept the last iteration path if the done marker is 
present. Otherwise it accepts 'clusters-0'.

Let me know what you think. 

There is still the problem that you need to pass in the exact last iteration 
path to ClusterDumper. If the last iteration path is renamed to 
'clusters-n-done' you can use a glob for command line Mahout: 
--output=clusters-*-done but this would break existing things.
                
> Mark folder name of final clustering iteration with pattern such as 
> 'cluster-n-last'
> ------------------------------------------------------------------------------------
>
>                 Key: MAHOUT-778
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-778
>             Project: Mahout
>          Issue Type: Improvement
>          Components: Clustering
>    Affects Versions: 0.5
>            Reporter: Frank Scholten
>            Assignee: Robin Anil
>            Priority: Minor
>             Fix For: 0.6
>
>         Attachments: MAHOUT-778-ClustersFilter.patch, MAHOUT-778.patch
>
>
> It would be useful if the KMeans, FuzzyKMeans would specify the last cluster 
> iteration folder with a pattern such as 'cluster-n-last'.
> At the moment it is difficult to configure other programs to process 
> clustering results since the number of actual iterations is not known up 
> front.
> A PathFilder similar to ClustersFilter could be created which filters folders 
> on the pattern 'cluster-*-last' in order to determine the folder.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to