GitHub user cloud-fan opened a pull request:
https://github.com/apache/spark/pull/19740
[SPARK-22514][SQL] move ColumnVector.Array and ColumnarBatch.Row to
individual files
## What changes were proposed in this pull request?
Logically the `Array` doesn't belong to `ColumnVector`, and `Row` doesn't
belong to `ColumnarBatch`. e.g. `ColumnVector` needs to return `Array` for
`getArray`, and `Row` for `getStruct`. `Array` and `Row` can return each other
with the `getArray`/`getStruct` methods.
This is also a step to make `ColumnVector` public, it's cleaner to have
`Array` and `Row` as top-level classes.
## How was this patch tested?
existing tests.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/cloud-fan/spark vector
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/spark/pull/19740.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #19740
----
commit ba28709955cd8ec1433c3f66df761e12a2a65d77
Author: Wenchen Fan <[email protected]>
Date: 2017-11-14T00:58:05Z
move ColumnVector.Array and ColumnarBatch.Row to individual files
----
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]