Marco Gaido created SPARK-22119: ----------------------------------- Summary: Add cosine distance to KMeans Key: SPARK-22119 URL: https://issues.apache.org/jira/browse/SPARK-22119 Project: Spark Issue Type: New Feature Components: ML, MLlib Affects Versions: 2.2.0 Reporter: Marco Gaido Priority: Minor
Currently, KMeans assumes the only possible distance measure to be used is the Euclidean. In some use cases, eg. text mining, other distance measures like the cosine distance are widely used. Thus, for such use cases, it would be good to support multiple distance measures. This ticket is to support the cosine distance measure on KMeans. Later, other algorithms can be extended to support several distance measures and other distance measures can be added. -- This message was sent by Atlassian JIRA (v6.4.14#64029) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org