Reza Zadeh created SPARK-4823:
---------------------------------

             Summary: rowSimilarities
                 Key: SPARK-4823
                 URL: https://issues.apache.org/jira/browse/SPARK-4823
             Project: Spark
          Issue Type: Improvement
          Components: MLlib
            Reporter: Reza Zadeh


RowMatrix has a columnSimilarities method to find cosine similarities between 
columns.

A rowSimilarities method would be useful to find similarities between rows.

This is JIRA is to investigate which algorithms are suitable for such a method, 
better than brute-forcing it. Note that when there are many rows (> 10^6), it 
is unlikely that brute-force will be feasible, since the output will be of 
order 10^12.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to