Hayri Volkan Agun created SPARK-20268:
-----------------------------------------
Summary: Arbitrary RDD element (Fast return) instead of using first
Key: SPARK-20268
URL: https://issues.apache.org/jira/browse/SPARK-20268
Project: Spark
Issue Type: Improvement
Components: ML, Spark Core
Affects Versions: 2.1.0, 2.0.1, 2.0.0
Reporter: Hayri Volkan Agun
Priority: Minor
Most of the ML and MLLIB algorithms somehow need the column size of the rdd
vector (RDD[Vector]). So instead of getting the first element by rdd.first(), a
fast return can be made to calculate the length of the vector of a arbitrary
rdd element. It can also be be named any().
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]