[ 
https://issues.apache.org/jira/browse/MAHOUT-645?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13013990#comment-13013990
 ] 

Sean Owen commented on MAHOUT-645:
----------------------------------

OK, would it be better to consider this an open issue for later, to cover 
updating the K-means code? or would you rather get this in now as a prelude to 
that?

> Elkan distance optimization for VectorBenchmarks class
> ------------------------------------------------------
>
>                 Key: MAHOUT-645
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-645
>             Project: Mahout
>          Issue Type: Improvement
>          Components: Clustering
>    Affects Versions: 0.4
>         Environment: Ubuntu Linux at Intel Core2 Duo P7450 @ 2.13GHz
>            Reporter: Gustavo Salazar Torres
>            Priority: Minor
>              Labels: centroid, clustering, elkan
>             Fix For: 0.4
>
>         Attachments: patches.zip
>
>   Original Estimate: 24h
>  Remaining Estimate: 24h
>
> Implementation of first lemma of Elkan's optimization:
> Given three points x, b, c (where b and c are centroids):
>                                            d(b,c)>=2d(x.b) then d(x,c)>=d(x,b)
> in which case we wouldn't need to calculate d(x,c). This is used to find the 
> closest centroid for every point x.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to