spark git commit: [TEST][SPARKR][CORE] Fix broken SparkSubmitSuite

2017-06-12 Thread felixcheung
Repository: spark Updated Branches: refs/heads/branch-2.2 48a843b56 -> dae1a9875 [TEST][SPARKR][CORE] Fix broken SparkSubmitSuite ## What changes were proposed in this pull request? Fix test file path. This is broken in #18264 and undetected since R-only changes don't build core and subseque

spark git commit: [TEST][SPARKR][CORE] Fix broken SparkSubmitSuite

2017-06-12 Thread felixcheung
Repository: spark Updated Branches: refs/heads/master 2639c3ed0 -> 278ba7a2c [TEST][SPARKR][CORE] Fix broken SparkSubmitSuite ## What changes were proposed in this pull request? Fix test file path. This is broken in #18264 and undetected since R-only changes don't build core and subsequent p

spark git commit: [SPARK-19910][SQL] `stack` should not reject NULL values due to type mismatch

2017-06-12 Thread lixiao
Repository: spark Updated Branches: refs/heads/master fc0e6944a -> 2639c3ed0 [SPARK-19910][SQL] `stack` should not reject NULL values due to type mismatch ## What changes were proposed in this pull request? Since `stack` function generates a table with nullable columns, it should allow mixed

spark git commit: Revert "[SPARK-21046][SQL] simplify the array offset and length in ColumnVector"

2017-06-12 Thread wenchen
Repository: spark Updated Branches: refs/heads/master 74a432d3a -> fc0e6944a Revert "[SPARK-21046][SQL] simplify the array offset and length in ColumnVector" This reverts commit 22dd65f58e12cb3a883d106fcccdff25a2a00fe8. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http:

spark git commit: [SPARK-20979][SS] Add RateSource to generate values for tests and benchmark

2017-06-12 Thread zsxwing
Repository: spark Updated Branches: refs/heads/master ff318c0d2 -> 74a432d3a [SPARK-20979][SS] Add RateSource to generate values for tests and benchmark ## What changes were proposed in this pull request? This PR adds RateSource for Structured Streaming so that the user can use it to generat

spark git commit: [SPARK-21050][ML] Word2vec persistence overflow bug fix

2017-06-12 Thread jkbradley
Repository: spark Updated Branches: refs/heads/branch-2.2 580ecfd7a -> 48a843b56 [SPARK-21050][ML] Word2vec persistence overflow bug fix ## What changes were proposed in this pull request? The method calculateNumberOfPartitions() uses Int, not Long (unlike the MLlib version), so it is very e

spark git commit: [SPARK-21050][ML] Word2vec persistence overflow bug fix

2017-06-12 Thread jkbradley
Repository: spark Updated Branches: refs/heads/master b1436c749 -> ff318c0d2 [SPARK-21050][ML] Word2vec persistence overflow bug fix ## What changes were proposed in this pull request? The method calculateNumberOfPartitions() uses Int, not Long (unlike the MLlib version), so it is very easil

spark git commit: [SPARK-21059][SQL] LikeSimplification can NPE on null pattern

2017-06-12 Thread lixiao
Repository: spark Updated Branches: refs/heads/branch-2.2 a6b7875d3 -> 580ecfd7a [SPARK-21059][SQL] LikeSimplification can NPE on null pattern ## What changes were proposed in this pull request? This patch fixes a bug that can cause NullPointerException in LikeSimplification, when the pattern

spark git commit: [SPARK-21059][SQL] LikeSimplification can NPE on null pattern

2017-06-12 Thread lixiao
Repository: spark Updated Branches: refs/heads/master 32818d9b3 -> b1436c749 [SPARK-21059][SQL] LikeSimplification can NPE on null pattern ## What changes were proposed in this pull request? This patch fixes a bug that can cause NullPointerException in LikeSimplification, when the pattern for

spark git commit: [SPARK-20345][SQL] Fix STS error handling logic on HiveSQLException

2017-06-12 Thread lixiao
Repository: spark Updated Branches: refs/heads/branch-2.2 92f7c8f52 -> a6b7875d3 [SPARK-20345][SQL] Fix STS error handling logic on HiveSQLException ## What changes were proposed in this pull request? [SPARK-5100](https://github.com/apache/spark/commit/343d3bfafd449a0371feb6a88f78e07302fa7143

spark git commit: [SPARK-20345][SQL] Fix STS error handling logic on HiveSQLException

2017-06-12 Thread lixiao
Repository: spark Updated Branches: refs/heads/master ca4e960ae -> 32818d9b3 [SPARK-20345][SQL] Fix STS error handling logic on HiveSQLException ## What changes were proposed in this pull request? [SPARK-5100](https://github.com/apache/spark/commit/343d3bfafd449a0371feb6a88f78e07302fa7143) a

spark git commit: [SPARK-17914][SQL] Fix parsing of timestamp strings with nanoseconds

2017-06-12 Thread ueshin
Repository: spark Updated Branches: refs/heads/master 22dd65f58 -> ca4e960ae [SPARK-17914][SQL] Fix parsing of timestamp strings with nanoseconds The PR contains a tiny change to fix the way Spark parses string literals into timestamps. Currently, some timestamps that contain nanoseconds are

spark git commit: [SPARK-17914][SQL] Fix parsing of timestamp strings with nanoseconds

2017-06-12 Thread ueshin
Repository: spark Updated Branches: refs/heads/branch-2.2 e6773944f -> 92f7c8f52 [SPARK-17914][SQL] Fix parsing of timestamp strings with nanoseconds The PR contains a tiny change to fix the way Spark parses string literals into timestamps. Currently, some timestamps that contain nanoseconds

spark git commit: [SPARK-21046][SQL] simplify the array offset and length in ColumnVector

2017-06-12 Thread wenchen
Repository: spark Updated Branches: refs/heads/master a92e095e7 -> 22dd65f58 [SPARK-21046][SQL] simplify the array offset and length in ColumnVector ## What changes were proposed in this pull request? Currently when a `ColumnVector` stores array type elements, we will use 2 arrays for length

spark git commit: [SPARK-21041][SQL] SparkSession.range should be consistent with SparkContext.range

2017-06-12 Thread wenchen
Repository: spark Updated Branches: refs/heads/branch-2.2 a4d78e4b3 -> e6773944f [SPARK-21041][SQL] SparkSession.range should be consistent with SparkContext.range ## What changes were proposed in this pull request? This PR fixes the inconsistency in `SparkSession.range`. **BEFORE** ```scal

spark git commit: [SPARK-21041][SQL] SparkSession.range should be consistent with SparkContext.range

2017-06-12 Thread wenchen
Repository: spark Updated Branches: refs/heads/master e6eb02df1 -> a92e095e7 [SPARK-21041][SQL] SparkSession.range should be consistent with SparkContext.range ## What changes were proposed in this pull request? This PR fixes the inconsistency in `SparkSession.range`. **BEFORE** ```scala sc

spark git commit: [DOCS] Fix error: ambiguous reference to overloaded definition

2017-06-12 Thread srowen
Repository: spark Updated Branches: refs/heads/master d14091809 -> e6eb02df1 [DOCS] Fix error: ambiguous reference to overloaded definition ## What changes were proposed in this pull request? `df.groupBy.count()` should be `df.groupBy().count()` , otherwise there is an error : ambiguous ref

spark git commit: [DOCS] Fix error: ambiguous reference to overloaded definition

2017-06-12 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.2 26003de55 -> a4d78e4b3 [DOCS] Fix error: ambiguous reference to overloaded definition ## What changes were proposed in this pull request? `df.groupBy.count()` should be `df.groupBy().count()` , otherwise there is an error : ambiguous