spark git commit: [SPARK-25159][SQL] json schema inference should only trigger one job

2018-08-21 Thread lixiao
Repository: spark Updated Branches: refs/heads/master 07737c87d -> 4a9c9d8f9 [SPARK-25159][SQL] json schema inference should only trigger one job ## What changes were proposed in this pull request? This fixes a perf regression caused by https://github.com/apache/spark/pull/21376 . We

spark git commit: [SPARK-23711][SPARK-25140][SQL] Catch correct exceptions when expr codegen fails

2018-08-21 Thread lixiao
Repository: spark Updated Branches: refs/heads/master a998e9d82 -> 07737c87d [SPARK-23711][SPARK-25140][SQL] Catch correct exceptions when expr codegen fails ## What changes were proposed in this pull request? This pr is to fix bugs when expr codegen fails; we need to catch

svn commit: r28889 - in /dev/spark/2.4.0-SNAPSHOT-2018_08_21_20_02-a998e9d-docs: ./ _site/ _site/api/ _site/api/R/ _site/api/java/ _site/api/java/lib/ _site/api/java/org/ _site/api/java/org/apache/ _s

2018-08-21 Thread pwendell
Author: pwendell Date: Wed Aug 22 03:15:59 2018 New Revision: 28889 Log: Apache Spark 2.4.0-SNAPSHOT-2018_08_21_20_02-a998e9d docs [This commit notification would consist of 1476 parts, which exceeds the limit of 50 ones, so it was shortened to the summary.]

spark git commit: [MINOR] Added import to fix compilation

2018-08-21 Thread tdas
Repository: spark Updated Branches: refs/heads/master ad45299d0 -> a998e9d82 [MINOR] Added import to fix compilation ## What changes were proposed in this pull request? Two back to PRs implicitly conflicted by one PR removing an existing import that the other PR needed. This did not cause

spark git commit: [SPARK-25095][PYSPARK] Python support for BarrierTaskContext

2018-08-21 Thread meng
Repository: spark Updated Branches: refs/heads/master 42035a4fe -> ad45299d0 [SPARK-25095][PYSPARK] Python support for BarrierTaskContext ## What changes were proposed in this pull request? Add method `barrier()` and `getTaskInfos()` in python TaskContext, these two methods are only allowed

spark git commit: [SPARK-24441][SS] Expose total estimated size of states in HDFSBackedStateStoreProvider

2018-08-21 Thread tdas
Repository: spark Updated Branches: refs/heads/master ac0174e55 -> 42035a4fe [SPARK-24441][SS] Expose total estimated size of states in HDFSBackedStateStoreProvider ## What changes were proposed in this pull request? This patch exposes the estimation of size of cache (loadedMaps) in

spark git commit: [SPARK-25129][SQL] Make the mapping of com.databricks.spark.avro to built-in module configurable

2018-08-21 Thread lixiao
Repository: spark Updated Branches: refs/heads/master 6c5cb8585 -> ac0174e55 [SPARK-25129][SQL] Make the mapping of com.databricks.spark.avro to built-in module configurable ## What changes were proposed in this pull request? In https://issues.apache.org/jira/browse/SPARK-24924, the data

spark git commit: [SPARK-24763][SS] Remove redundant key data from value in streaming aggregation

2018-08-21 Thread tdas
Repository: spark Updated Branches: refs/heads/master 72ecfd095 -> 6c5cb8585 [SPARK-24763][SS] Remove redundant key data from value in streaming aggregation ## What changes were proposed in this pull request? This patch proposes a new flag option for stateful aggregation: remove redundant

spark git commit: [SPARK-25149][GRAPHX] Update Parallel Personalized Page Rank to test with large vertexIds

2018-08-21 Thread jkbradley
Repository: spark Updated Branches: refs/heads/master 99d2e4e00 -> 72ecfd095 [SPARK-25149][GRAPHX] Update Parallel Personalized Page Rank to test with large vertexIds ## What changes were proposed in this pull request? runParallelPersonalizedPageRank in graphx checks that `sources` are <=

svn commit: r28887 - in /dev/spark/2.4.0-SNAPSHOT-2018_08_21_12_02-99d2e4e-docs: ./ _site/ _site/api/ _site/api/R/ _site/api/java/ _site/api/java/lib/ _site/api/java/org/ _site/api/java/org/apache/ _s

2018-08-21 Thread pwendell
Author: pwendell Date: Tue Aug 21 19:16:17 2018 New Revision: 28887 Log: Apache Spark 2.4.0-SNAPSHOT-2018_08_21_12_02-99d2e4e docs [This commit notification would consist of 1476 parts, which exceeds the limit of 50 ones, so it was shortened to the summary.]

spark git commit: [SPARK-24296][CORE] Replicate large blocks as a stream.

2018-08-21 Thread vanzin
Repository: spark Updated Branches: refs/heads/master 35f7f5ce8 -> 99d2e4e00 [SPARK-24296][CORE] Replicate large blocks as a stream. When replicating large cached RDD blocks, it can be helpful to replicate them as a stream, to avoid using large amounts of memory during the transfer. This

svn commit: r28885 - in /dev/spark/2.3.3-SNAPSHOT-2018_08_21_10_02-9cb9d72-docs: ./ _site/ _site/api/ _site/api/R/ _site/api/java/ _site/api/java/lib/ _site/api/java/org/ _site/api/java/org/apache/ _s

2018-08-21 Thread pwendell
Author: pwendell Date: Tue Aug 21 17:15:41 2018 New Revision: 28885 Log: Apache Spark 2.3.3-SNAPSHOT-2018_08_21_10_02-9cb9d72 docs [This commit notification would consist of 1443 parts, which exceeds the limit of 50 ones, so it was shortened to the summary.]

spark git commit: [DOCS][MINOR] Fix a few broken links and typos, and, nit, use HTTPS more consistently

2018-08-21 Thread gurwls223
Repository: spark Updated Branches: refs/heads/master d80063278 -> 35f7f5ce8 [DOCS][MINOR] Fix a few broken links and typos, and, nit, use HTTPS more consistently ## What changes were proposed in this pull request? Fix a few broken links and typos, and, nit, use HTTPS more consistently esp.

spark git commit: [MINOR] Add .crc files to .gitignore

2018-08-21 Thread gurwls223
Repository: spark Updated Branches: refs/heads/master 5059255d9 -> d80063278 [MINOR] Add .crc files to .gitignore ## What changes were proposed in this pull request? Add .crc files to .gitignore so that we don't add .crc files in state checkpoint to git repo which could be added in test

spark git commit: [SPARK-25114][2.3][CORE][FOLLOWUP] Fix RecordBinaryComparatorSuite build failure

2018-08-21 Thread lixiao
Repository: spark Updated Branches: refs/heads/branch-2.3 8bde46781 -> 9cb9d7201 [SPARK-25114][2.3][CORE][FOLLOWUP] Fix RecordBinaryComparatorSuite build failure ## What changes were proposed in this pull request? Fix RecordBinaryComparatorSuite build failure ## How was this patch tested?

spark git commit: [SPARK-25161][CORE] Fix several bugs in failure handling of barrier execution mode

2018-08-21 Thread meng
Repository: spark Updated Branches: refs/heads/master b8788b3e7 -> 5059255d9 [SPARK-25161][CORE] Fix several bugs in failure handling of barrier execution mode ## What changes were proposed in this pull request? Fix several bugs in failure handling of barrier execution mode: * Mark TaskSet

svn commit: r28882 - in /dev/spark/2.4.0-SNAPSHOT-2018_08_21_08_02-b8788b3-docs: ./ _site/ _site/api/ _site/api/R/ _site/api/java/ _site/api/java/lib/ _site/api/java/org/ _site/api/java/org/apache/ _s

2018-08-21 Thread pwendell
Author: pwendell Date: Tue Aug 21 15:16:49 2018 New Revision: 28882 Log: Apache Spark 2.4.0-SNAPSHOT-2018_08_21_08_02-b8788b3 docs [This commit notification would consist of 1476 parts, which exceeds the limit of 50 ones, so it was shortened to the summary.]

spark git commit: [BUILD] Close stale PRs

2018-08-21 Thread srowen
Repository: spark Updated Branches: refs/heads/master 4fb96e510 -> b8788b3e7 [BUILD] Close stale PRs Closes #16411 Closes #21870 Closes #21794 Closes #21610 Closes #21961 Closes #21940 Closes #21870 Closes #22118 Closes #21624 Closes #19528 Closes #18424 Closes #22159 from srowen/Stale.

svn commit: r28878 - in /dev/spark/2.3.3-SNAPSHOT-2018_08_21_02_02-8bde467-docs: ./ _site/ _site/api/ _site/api/R/ _site/api/java/ _site/api/java/lib/ _site/api/java/org/ _site/api/java/org/apache/ _s

2018-08-21 Thread pwendell
Author: pwendell Date: Tue Aug 21 09:15:43 2018 New Revision: 28878 Log: Apache Spark 2.3.3-SNAPSHOT-2018_08_21_02_02-8bde467 docs [This commit notification would consist of 1443 parts, which exceeds the limit of 50 ones, so it was shortened to the summary.]

svn commit: r28876 - in /dev/spark/2.4.0-SNAPSHOT-2018_08_21_00_02-4fb96e5-docs: ./ _site/ _site/api/ _site/api/R/ _site/api/java/ _site/api/java/lib/ _site/api/java/org/ _site/api/java/org/apache/ _s

2018-08-21 Thread pwendell
Author: pwendell Date: Tue Aug 21 07:17:04 2018 New Revision: 28876 Log: Apache Spark 2.4.0-SNAPSHOT-2018_08_21_00_02-4fb96e5 docs [This commit notification would consist of 1476 parts, which exceeds the limit of 50 ones, so it was shortened to the summary.]

spark git commit: [SPARK-25114][CORE] Fix RecordBinaryComparator when subtraction between two words is divisible by Integer.MAX_VALUE.

2018-08-21 Thread lixiao
Repository: spark Updated Branches: refs/heads/branch-2.3 9702bb637 -> 8bde46781 [SPARK-25114][CORE] Fix RecordBinaryComparator when subtraction between two words is divisible by Integer.MAX_VALUE. https://github.com/apache/spark/pull/22079#discussion_r209705612 It is possible for two

spark git commit: [SPARK-25114][CORE] Fix RecordBinaryComparator when subtraction between two words is divisible by Integer.MAX_VALUE.

2018-08-21 Thread lixiao
Repository: spark Updated Branches: refs/heads/master f984ec75e -> 4fb96e510 [SPARK-25114][CORE] Fix RecordBinaryComparator when subtraction between two words is divisible by Integer.MAX_VALUE. ## What changes were proposed in this pull request?