GitHub user rxin opened a pull request:
https://github.com/apache/spark/pull/4296
[SQL] Improve DataFrame API error reporting
1. Throw UnsupportedOperationException if a Column is not computable.
2. Perform eager analysis on DataFrame so we can catch errors when they
happen (not when an action is run).
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/rxin/spark col-computability
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/spark/pull/4296.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #4296
----
commit ee2b1e4de01c1e876949428c719d12d0b4e40135
Author: Reynold Xin <[email protected]>
Date: 2015-01-31T06:04:18Z
[SQL][API] ComputableColumn vs IncomputableColumn
This patch changes Column from a concrete implementation to a trait, and
provides two
concrete implementations: IncomputableColumn and ComputableColumn.
commit 265fde7a15e2a8492c2ca96a863754b9c1bda678
Author: Reynold Xin <[email protected]>
Date: 2015-01-31T06:27:42Z
Added eager analysis for error reporting.
----
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]