GitHub user rxin opened a pull request:
https://github.com/apache/spark/pull/7755
[SPARK-7157][SQL] add sampleBy to DataFrame
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/mengxr/spark SPARK-7157
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/spark/pull/7755.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #7755
----
commit 832f7cc34dec264d14e02ca0bff4373924e62372
Author: Xiangrui Meng <[email protected]>
Date: 2015-06-11T23:28:49Z
add sampleBy to DataFrame
commit 4a14834f74f3edd45403473c251b9b4e09ad034a
Author: Xiangrui Meng <[email protected]>
Date: 2015-06-11T23:46:08Z
move sampleBy to stat
commit 991f26f4ca51d8e7a214c0da51cabde3ced9169d
Author: Xiangrui Meng <[email protected]>
Date: 2015-06-12T01:49:29Z
fix seed
commit 103beb3782a54d85bdc89853ea98ee5e3eecba63
Author: Xiangrui Meng <[email protected]>
Date: 2015-06-24T18:14:01Z
add Java-friendly sampleBy
----
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]