GitHub user holdenk opened a pull request:
https://github.com/apache/spark/pull/20908
[SPARK-23672][PYTHON] Document support for nested return types in scalar
with arrow udfs
## What changes were proposed in this pull request?
Clarify docstring for Scalar functions
## How was this patch tested?
Adds a unit test showing use similar to wordcount, there's existing unit
test for array of floats as well.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/holdenk/spark
SPARK-23672-document-support-for-nested-return-types-in-scalar-with-arrow-udfs
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/spark/pull/20908.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #20908
----
commit da8dbafcee753ebb2f7f3a205d0b378c240a3469
Author: Holden Karau <holden@...>
Date: 2018-03-26T18:18:34Z
Demonstrate tokenize udf
commit 342d2228a5c68fd2c07bd8c1b518da6135ce1bf6
Author: Holden Karau <holden@...>
Date: 2018-03-26T18:20:51Z
Long lines are bad, kthnx
----
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]