[
https://issues.apache.org/jira/browse/SPARK-6810?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14790700#comment-14790700
]
Xiangrui Meng commented on SPARK-6810:
--------------------------------------
the ml part (glm) is about the same. all computation is on the Scala side. i
would wait until 1.6 to benchmark GLM because we are going to implement the
same algorithm as R in 1.6.
> Performance benchmarks for SparkR
> ---------------------------------
>
> Key: SPARK-6810
> URL: https://issues.apache.org/jira/browse/SPARK-6810
> Project: Spark
> Issue Type: New Feature
> Components: SparkR
> Reporter: Shivaram Venkataraman
> Priority: Critical
>
> We should port some performance benchmarks from spark-perf to SparkR for
> tracking performance regressions / improvements.
> https://github.com/databricks/spark-perf/tree/master/pyspark-tests has a list
> of PySpark performance benchmarks
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]