Marco Gaido created SPARK-22119:
-----------------------------------
Summary: Add cosine distance to KMeans
Key: SPARK-22119
URL: https://issues.apache.org/jira/browse/SPARK-22119
Project: Spark
Issue Type: New Feature
Components: ML, MLlib
Affects Versions: 2.2.0
Reporter: Marco Gaido
Priority: Minor
Currently, KMeans assumes the only possible distance measure to be used is the
Euclidean.
In some use cases, eg. text mining, other distance measures like the cosine
distance are widely used. Thus, for such use cases, it would be good to support
multiple distance measures.
This ticket is to support the cosine distance measure on KMeans. Later, other
algorithms can be extended to support several distance measures and other
distance measures can be added.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]