David Gingrich created SPARK-20232:
--------------------------------------
Summary: Better combineByKey documentation: clarify memory
allocation, better example
Key: SPARK-20232
URL: https://issues.apache.org/jira/browse/SPARK-20232
Project: Spark
Issue Type: Improvement
Components: Documentation
Affects Versions: 2.1.0
Environment: macOS Sierra 10.12.4
Spark 2.1.0 installed via Homebrew
Reporter: David Gingrich
Priority: Trivial
combineByKey docs has a few flaws:
- Doesn't include note about memory allocation (on aggregateBykey)
- Example doesn't show difference between mergeValue and mergeCombiners (both
are add)
I have a trivial patch, will attach momentarily.
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]