GitHub user yongtang opened a pull request:
https://github.com/apache/spark/pull/11981
[SPARK-14163][CORE] SumEvaluator and countApprox cannot reliably handle
RDDs of size 1.
## What changes were proposed in this pull request?
This fix fixes issues in SPARK-14163 where SumEvaluator could not handle
`counter.count <=` 1 as `degreesOfFreedom` requires `counter.count > 1`.
In this fix, `counter.count <= 1` is handled separately.
## How was this patch tested?
A manual test was done to make sure that no Exception is thrown for
`degreesOfFreedom`.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/yongtang/spark SPARK-14163
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/spark/pull/11981.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #11981
----
commit f739dcd0bbb84bf5429ae75997b7d0c54c95ac22
Author: Yong Tang <[email protected]>
Date: 2016-03-26T23:19:54Z
[SPARK-14163][CORE] SumEvaluator and countApprox cannot reliably handle
RDDs of size 1.
This fix fixes issues in SPARK-14163 where SumEvaluator could not handle
counter.count of `<=` 1 as degreesOfFreedom requires counter.count > 1.
----
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]