Joseph K. Bradley created SPARK-18409:
-----------------------------------------
Summary: LSH approxNearestNeighbors should use approxQuantile
instead of sort
Key: SPARK-18409
URL: https://issues.apache.org/jira/browse/SPARK-18409
Project: Spark
Issue Type: Improvement
Components: ML
Reporter: Joseph K. Bradley
LSHModel.approxNearestNeighbors sorts the full dataset on the hashDistance in
order to find a threshold. It should use approxQuantile instead.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]