[ 
https://issues.apache.org/jira/browse/MAHOUT-825?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13120668#comment-13120668
 ] 

Paritosh Ranjan edited comment on MAHOUT-825 at 10/5/11 5:31 AM:
-----------------------------------------------------------------

I have attached a new patch (canopy-clusterFilter-t1), which checks whether it 
needs to apply dist < t1 or not. Which means, whether it needs to cluster all 
points, or, it should cluster only points within t1 ( based on user specified 
clusterFilter variable ).

I have also added test cases, demonstrating clustering of both cases, (based on 
clusterFilter variable)
1) all points being clustered
2) only points with dist < t1 being clustered
                
      was (Author: paritoshranjan):
    I have attached a new patch, which checks whether it needs to apply dist < 
t1 or not. Which means, whether it needs to cluster all points, or, it should 
cluster only points within t1 ( based on user specified clusterFilter variable 
).

I have also added test cases, demonstrating clustering of both cases, (based on 
clusterFilter variable)
1) all points being clustered
2) only points with dist < t1 being clustered
                  
> Canopies grouping records outside t1
> ------------------------------------
>
>                 Key: MAHOUT-825
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-825
>             Project: Mahout
>          Issue Type: Bug
>          Components: Clustering
>    Affects Versions: 0.6
>         Environment: windows, linux
>            Reporter: Paritosh Ranjan
>              Labels: features, newbie, patch
>             Fix For: 0.6
>
>         Attachments: canopy-clusterFilter-t1, canopy-outside-t1-points-patch-1
>
>
> While finding closest canopy, there is no check to ensure that it returns 
> canopies which are within distance t1 from the point. This results in 
> incorrect result i.e. Points outside t1 are grouped in canopies.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to