This is an automated email from the ASF dual-hosted git repository.

cutlerb pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git.


    from 197732e  [SPARK-29125][INFRA] Add Hadoop 2.7 combination to GitHub 
Action
     add 05988b2  [SPARK-27463][PYTHON] Support Dataframe Cogroup via Pandas 
UDFs

No new revisions were added by this update.

Summary of changes:
 .../org/apache/spark/api/python/PythonRunner.scala |   2 +
 python/pyspark/rdd.py                              |   1 +
 python/pyspark/serializers.py                      |  26 ++
 python/pyspark/sql/cogroup.py                      |  98 ++++++++
 python/pyspark/sql/functions.py                    |   5 +-
 python/pyspark/sql/group.py                        |  12 +-
 .../sql/tests/test_pandas_udf_cogrouped_map.py     | 280 +++++++++++++++++++++
 python/pyspark/sql/udf.py                          |  19 ++
 python/pyspark/worker.py                           |  98 +++++++-
 .../spark/sql/catalyst/analysis/Analyzer.scala     |   6 +
 .../plans/logical/pythonLogicalOperators.scala     |  19 +-
 .../spark/sql/RelationalGroupedDataset.scala       |  46 +++-
 .../spark/sql/execution/SparkStrategies.scala      |   3 +
 .../sql/execution/python/ArrowPythonRunner.scala   |  76 +-----
 ...honRunner.scala => BaseArrowPythonRunner.scala} |  92 +------
 .../sql/execution/python/BasePandasGroupExec.scala | 137 ++++++++++
 .../python/CogroupedArrowPythonRunner.scala        | 113 +++++++++
 .../python/FlatMapCoGroupsInPandasExec.scala       |  97 +++++++
 .../python/FlatMapGroupsInPandasExec.scala         |  98 +-------
 19 files changed, 965 insertions(+), 263 deletions(-)
 create mode 100644 python/pyspark/sql/cogroup.py
 create mode 100644 python/pyspark/sql/tests/test_pandas_udf_cogrouped_map.py
 copy 
sql/core/src/main/scala/org/apache/spark/sql/execution/python/{ArrowPythonRunner.scala
 => BaseArrowPythonRunner.scala} (51%)
 create mode 100644 
sql/core/src/main/scala/org/apache/spark/sql/execution/python/BasePandasGroupExec.scala
 create mode 100644 
sql/core/src/main/scala/org/apache/spark/sql/execution/python/CogroupedArrowPythonRunner.scala
 create mode 100644 
sql/core/src/main/scala/org/apache/spark/sql/execution/python/FlatMapCoGroupsInPandasExec.scala


---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org
For additional commands, e-mail: commits-h...@spark.apache.org

Reply via email to