[
https://issues.apache.org/jira/browse/SPARK-5992?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14512843#comment-14512843
]
Debasish Das commented on SPARK-5992:
-------------------------------------
Did someone compared algebird LSH with spark minhash link above ? Unless
algebird is slow (which I found for TopK monoid) we should use it the same way
HLL is being used in Spark streaming ? Is it ok to add algebird to mllib ?
> Locality Sensitive Hashing (LSH) for MLlib
> ------------------------------------------
>
> Key: SPARK-5992
> URL: https://issues.apache.org/jira/browse/SPARK-5992
> Project: Spark
> Issue Type: New Feature
> Components: MLlib
> Affects Versions: 1.4.0
> Reporter: Joseph K. Bradley
>
> Locality Sensitive Hashing (LSH) would be very useful for ML. It would be
> great to discuss some possible algorithms here, choose an API, and make a PR
> for an initial algorithm.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]