Reza Zadeh created SPARK-4823:
---------------------------------
Summary: rowSimilarities
Key: SPARK-4823
URL: https://issues.apache.org/jira/browse/SPARK-4823
Project: Spark
Issue Type: Improvement
Components: MLlib
Reporter: Reza Zadeh
RowMatrix has a columnSimilarities method to find cosine similarities between
columns.
A rowSimilarities method would be useful to find similarities between rows.
This is JIRA is to investigate which algorithms are suitable for such a method,
better than brute-forcing it. Note that when there are many rows (> 10^6), it
is unlikely that brute-force will be feasible, since the output will be of
order 10^12.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]