GitHub user ilganeli opened a pull request:
https://github.com/apache/spark/pull/4895
[SPARK-3533] Add saveAsTextFileByKey() method to RDDs
This patch adds a method to allow saving an RDD as multiple text files
split up by key. I've included a test suite that should verify its
functionality.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/ilganeli/spark SPARK-3533
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/spark/pull/4895.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #4895
----
commit b56b950f6b4e88e590706f48ec32b01d00bf29d7
Author: Ilya Ganelin <[email protected]>
Date: 2014-12-15T18:50:01Z
Initial stub
commit 757ab3ded6892a840e925c4c59c5061be091023d
Author: Ilya Ganelin <[email protected]>
Date: 2014-12-28T16:35:33Z
Updating tests
commit cd4f732d9b7c8271c757db14bcde85166caa302b
Author: Ilya Ganelin <[email protected]>
Date: 2014-12-28T16:35:35Z
Merge remote-tracking branch 'upstream/master' into SPARK-3533
commit 37132457688935c2791c9c9701a7b5af4ff91dac
Author: Ilya Ganelin <[email protected]>
Date: 2015-01-05T21:17:31Z
Test still failing during reflection processing in hadoop utils
commit 5e615a2b734abbe74bac6a8ef83335c0c392ab41
Author: Ilya Ganelin <[email protected]>
Date: 2015-01-06T16:09:35Z
Added init function to try to resolve reflection error
commit 3defe515ff7e337945f7687010e56246498c7ada
Author: Ilya Ganelin <[email protected]>
Date: 2015-01-15T22:59:59Z
Attempting fix
commit 8bc757e5a3e9c8393c2cea9f21b431bb624aec41
Author: Ilya Ganelin <[email protected]>
Date: 2015-03-04T17:31:16Z
Merge remote-tracking branch 'upstream/master' into SPARK-3533
commit a8199f6dd4a7518816d9be32963647d27aa282b2
Author: Ilya Ganelin <[email protected]>
Date: 2015-03-04T21:24:39Z
Updated to fix init bug
commit 09f6756b869feef1ad4123ff01ddd33629a67237
Author: Ilya Ganelin <[email protected]>
Date: 2015-03-04T21:24:40Z
Merge remote-tracking branch 'upstream/master' into SPARK-3533
commit aa1e6dcb8f9711a08850648fbc7bbe3caf760723
Author: Ilya Ganelin <[email protected]>
Date: 2015-03-04T22:03:07Z
Restored lost UDF Reg
----
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]