GitHub user guowei2 opened a pull request:
https://github.com/apache/spark/pull/2029
[SPARK-2873] [SQL] using ExternalAppendOnlyMap to resolve OOM when
aggregating
A new PR clone from PR 1822
Fix numbers of problems
Reuse the CompactBuffer from Spark Core to save memory and pointer
dereferences as PR 1993
Hive UDAF not support external aggregate, for hive AggregationBuffer need
serializable and hive GenericUDAFEvaluator has no method implement to merge two
evaluators
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/guowei2/spark sql-memory-patch
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/spark/pull/2029.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #2029
----
commit 5cbaaac088346077bcc27229073a5fd3de8eb3a3
Author: guowei2 <[email protected]>
Date: 2014-08-19T02:51:29Z
merge PR 1822
----
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]