GitHub user hvanhovell opened a pull request:
https://github.com/apache/spark/pull/16466
[SPARK-19070] Clean-up dataset actions
## What changes were proposed in this pull request?
Dataset actions currently spin off a new `Dataframe` only to track query
execution. This PR simplifies this code path by using the
`Dataset.queryExecution` directly. This PR also merges the typed and untyped
action evaluation paths.
## How was this patch tested?
Existing tests.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/hvanhovell/spark SPARK-19070
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/spark/pull/16466.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #16466
----
commit dca1b56810cd3c3469f70cc653a985b78519f6c6
Author: Herman van Hovell <[email protected]>
Date: 2017-01-04T01:04:36Z
Clean-up dataset actions.
----
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]