GitHub user ueshin opened a pull request:
https://github.com/apache/spark/pull/19505
[SPARK-20396][SQL][PySpark][FOLLOW-UP] groupby().apply() with pandas udf
## What changes were proposed in this pull request?
This is a follow-up of #18732.
This pr introduces `@pandas_grouped_udf` decorator for grouped vectorized
UDF instead of reusing `@pandas_udf` decorator.
## How was this patch tested?
Exisiting tests.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/ueshin/apache-spark issues/SPARK-20396/fup1
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/spark/pull/19505.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #19505
----
commit 4d2bd959e1eeabb4f72cfbb52a374ce721030507
Author: Takuya UESHIN <[email protected]>
Date: 2017-10-16T06:45:55Z
Introduce `@pandas_grouped_udf` decorator for grouped vectorized UDF.
----
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]