GitHub user mn-mikke opened a pull request:
https://github.com/apache/spark/pull/20858
[SPARK-23736][SQL] Implementation of the concat_arrays function
concatenating multiple array columns into one.
## What changes were proposed in this pull request?
The PR adds a logic for easy concatenation of multiple array columns and
covers:
- Generalization of null save evaluation for multiple children expressions
- An expression for array concatenation
- A Python wrapper
## How was this patch tested?
New tests added into:
- CollectionExpressionsSuite
- DataFrameFunctionsSuite
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/AbsaOSS/spark
feature/array-api-concat_arrays-to-master
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/spark/pull/20858.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #20858
----
commit 282e7249c2ed35add40f54087423ca62732b6046
Author: mn-mikke <mrkaha12346github>
Date: 2018-03-13T20:35:45Z
[SPARK-23736][SQL] Implementation of the concat_arrays function
concatenating multiple array columns into one.
----
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]