GitHub user peerside opened a pull request:
https://github.com/apache/spark/pull/4847
Refactored Dataframe join comment to use correct parameter ordering
The API signatire for join requires the JoinType to be the third parameter.
The code examples provided for join show JoinType being provided as the 2nd
parater resuling in errors (i.e. "df1.join(df2, "outer", $"df1Key" ===
$"df2Key") ). The correct sample code is df1.join(df2, $"df1Key" === $"df2Key",
"outer")
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/peerside/spark master
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/spark/pull/4847.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #4847
----
commit e35334043148f79807e94d266f6ec649872a9dd7
Author: Paul Power <[email protected]>
Date: 2015-03-02T06:41:35Z
Updated comments use correct sample code for Dataframe joins
The API signatire for join requires the JoinType to be the third parameter.
The code examples provided for join show JoinType being provided as the 2nd
parater resuling in errors (i.e. "df1.join(df2, "outer", $"df1Key" ===
$"df2Key") ). The correct example should be df1.join(df2, $"df1Key" ===
$"df2Key", "outer")
commit ebc1efa773932ebe1769070b3c99d2f08873953c
Author: Paul Power <[email protected]>
Date: 2015-03-02T06:42:33Z
Merge pull request #1 from peerside/peerside-patch-1
Updated comments use correct sample code for Dataframe joins
----
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]