Joseph K. Bradley created SPARK-8419:
----------------------------------------

             Summary: Statistics.colStats could avoid an extra count()
                 Key: SPARK-8419
                 URL: https://issues.apache.org/jira/browse/SPARK-8419
             Project: Spark
          Issue Type: Improvement
          Components: MLlib
            Reporter: Joseph K. Bradley
            Priority: Trivial


Statistics.colStats goes through RowMatrix to compute the stats.  But 
RowMatrix.computeColumnSummaryStatistics does an extra count() which could be 
avoided.  Not going through RowMatrix would skip this extra pass over the data.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to