[
https://issues.apache.org/jira/browse/SPARK-6810?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14790654#comment-14790654
]
Shivaram Venkataraman commented on SPARK-6810:
----------------------------------------------
So the DataFrame API doesn't need much of performance benchmarks as we mostly
wrap all our calls to Java / Scala - However we are adding new ML API
components and [~mengxr] will be able to provide more guidance for this.
cc [~sunrui]
> Performance benchmarks for SparkR
> ---------------------------------
>
> Key: SPARK-6810
> URL: https://issues.apache.org/jira/browse/SPARK-6810
> Project: Spark
> Issue Type: New Feature
> Components: SparkR
> Reporter: Shivaram Venkataraman
> Priority: Critical
>
> We should port some performance benchmarks from spark-perf to SparkR for
> tracking performance regressions / improvements.
> https://github.com/databricks/spark-perf/tree/master/pyspark-tests has a list
> of PySpark performance benchmarks
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]