GitHub user liancheng opened a pull request:
https://github.com/apache/spark/pull/11431
[SPARK-13553][SPARK-13554][SQL] Migrates basic inspection and typed
relational operations from DataFrame to Dataset
## What changes were proposed in this pull request?
This PR migrates basic inspection and typed relational operations from
DataFrame to Dataset. This is the first step of unifying DataFrame and Dataset
API.
## TODO
- [ ] Migrate `explode` operations.
## How was this patch tested?
Corresponding test cases are migrated from `DataFrameSuite` to
`DatasetSuite`. These newly added test cases all share the same "df-to-ds"
prefix so that we can easily execute them under SBT using:
```
sql/test-only *.DatasetSuite -- -z "df-to-ds"
```
This prefix will be removed after migrating all the DataFrame operations.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/liancheng/spark df-to-ds-typed-relational
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/spark/pull/11431.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #11431
----
commit 3f59569e8a0ab97beed2c6bb03c430b51fd94c3c
Author: Cheng Lian <[email protected]>
Date: 2016-02-28T08:50:32Z
Migrates basic DataFrame inspection methods to Dataset
commit f38c016faea7e881202762a3bdd0db07a17cce8f
Author: Cheng Lian <[email protected]>
Date: 2016-02-28T18:52:43Z
Migrates basic inspection and typed relational DF operations to DS
----
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]