David Gingrich created SPARK-20232: -------------------------------------- Summary: Better combineByKey documentation: clarify memory allocation, better example Key: SPARK-20232 URL: https://issues.apache.org/jira/browse/SPARK-20232 Project: Spark Issue Type: Improvement Components: Documentation Affects Versions: 2.1.0 Environment: macOS Sierra 10.12.4 Spark 2.1.0 installed via Homebrew Reporter: David Gingrich Priority: Trivial
combineByKey docs has a few flaws: - Doesn't include note about memory allocation (on aggregateBykey) - Example doesn't show difference between mergeValue and mergeCombiners (both are add) I have a trivial patch, will attach momentarily. -- This message was sent by Atlassian JIRA (v6.3.15#6346) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org