[ 
https://issues.apache.org/jira/browse/SOLR-10786?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16346164#comment-16346164
 ] 

Joel Bernstein commented on SOLR-10786:
---------------------------------------

I did some benchmarking and DBSCAN is very fast with 2 dimensional vectors. So, 
lat/long DBSCAN clustering is looking like a very promising use case. I'll also 
add haversinMeter and haversinKilometer distance to the distance() function, to 
support the creation of distance matrices. This will open the door to other 
machine learning algorithms on lat/long data.

> Add DBSCAN clustering Streaming Evaluator
> -----------------------------------------
>
>                 Key: SOLR-10786
>                 URL: https://issues.apache.org/jira/browse/SOLR-10786
>             Project: Solr
>          Issue Type: New Feature
>      Security Level: Public(Default Security Level. Issues are Public) 
>            Reporter: Joel Bernstein
>            Assignee: Joel Bernstein
>            Priority: Major
>             Fix For: master (8.0), 7.3
>
>         Attachments: SOLR-10786.patch, SOLR-10786.patch, SOLR-10786.patch
>
>
> The DBSCAN clustering Stream Evaluator will cluster numeric vectors using the 
> DBSCAN clustering algorithm.
> Clustering implementation will be provided by Apache Commons Math.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

Reply via email to