spark git commit: [SPARK-17549][SQL] Only collect table size stat in driver for cached relation.

2016-09-16 Thread yhuai
Repository: spark Updated Branches: refs/heads/master b9323fc93 -> 39e2bad6a [SPARK-17549][SQL] Only collect table size stat in driver for cached relation. The existing code caches all stats for all columns for each partition in the driver; for a large relation, this causes extreme memory

spark git commit: [SPARK-17549][SQL] Only collect table size stat in driver for cached relation.

2016-09-16 Thread yhuai
Repository: spark Updated Branches: refs/heads/branch-2.0 5ad4395e1 -> 3fce1255a [SPARK-17549][SQL] Only collect table size stat in driver for cached relation. The existing code caches all stats for all columns for each partition in the driver; for a large relation, this causes extreme memory

spark git commit: [SPARK-17561][DOCS] DataFrameWriter documentation formatting problems

2016-09-16 Thread rxin
Repository: spark Updated Branches: refs/heads/master dca771bec -> b9323fc93 [SPARK-17561][DOCS] DataFrameWriter documentation formatting problems ## What changes were proposed in this pull request? Fix ` / ` problems in SQL scaladoc. ## How was this patch tested? Scaladoc build and manual

spark git commit: [SPARK-17558] Bump Hadoop 2.7 version from 2.7.2 to 2.7.3

2016-09-16 Thread rxin
Repository: spark Updated Branches: refs/heads/branch-2.0 9c23f4408 -> 5ad4395e1 [SPARK-17558] Bump Hadoop 2.7 version from 2.7.2 to 2.7.3 ## What changes were proposed in this pull request? This patch bumps the Hadoop version in hadoop-2.7 profile from 2.7.2 to 2.7.3, which was recently

spark git commit: [SPARK-17558] Bump Hadoop 2.7 version from 2.7.2 to 2.7.3

2016-09-16 Thread rxin
Repository: spark Updated Branches: refs/heads/master a425a37a5 -> dca771bec [SPARK-17558] Bump Hadoop 2.7 version from 2.7.2 to 2.7.3 ## What changes were proposed in this pull request? This patch bumps the Hadoop version in hadoop-2.7 profile from 2.7.2 to 2.7.3, which was recently

spark git commit: [SPARK-17426][SQL] Refactor `TreeNode.toJSON` to avoid OOM when converting unknown fields to JSON

2016-09-16 Thread wenchen
Repository: spark Updated Branches: refs/heads/master fc1efb720 -> a425a37a5 [SPARK-17426][SQL] Refactor `TreeNode.toJSON` to avoid OOM when converting unknown fields to JSON ## What changes were proposed in this pull request? This PR is a follow up of SPARK-17356. Current implementation of

spark git commit: [SPARK-17534][TESTS] Increase timeouts for DirectKafkaStreamSuite tests

2016-09-16 Thread srowen
Repository: spark Updated Branches: refs/heads/master b2e272624 -> fc1efb720 [SPARK-17534][TESTS] Increase timeouts for DirectKafkaStreamSuite tests **## What changes were proposed in this pull request?** There are two tests in this suite that are particularly flaky on the following

spark git commit: [SPARK-17543] Missing log4j config file for tests in common/network-…

2016-09-16 Thread srowen
Repository: spark Updated Branches: refs/heads/master b72486f82 -> b2e272624 [SPARK-17543] Missing log4j config file for tests in common/network-… ## What changes were proposed in this pull request? The Maven module `common/network-shuffle` does not have a log4j configuration file for its