GitHub user ueshin opened a pull request:
https://github.com/apache/spark/pull/21802
[SPARK-23928][SQL] Add shuffle collection function.
## What changes were proposed in this pull request?
This PR adds a new collection function: shuffle. It generates a random
permutation of the given array.
## How was this patch tested?
New tests are added to CollectionExpressionsSuite.scala and
DataFrameFunctionsSuite.scala.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/ueshin/apache-spark issues/SPARK-23928/shuffle
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/spark/pull/21802.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #21802
----
commit a3dbd93c0acbb2a3f3fb50574ae1e126c66c4d2d
Author: pkuwm <ihuizhi.lu@...>
Date: 2018-07-17T23:18:03Z
Add shuffle collection function.
commit b4cbb5558088356fe6be1cda053c9f91fbe7c538
Author: Takuya UESHIN <ueshin@...>
Date: 2018-07-18T12:17:59Z
Refactor Shuffle function.
----
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]