spark git commit: [MINOR][SQL] Replace DataFrameWriter.stream() with startStream() in comments.

2016-03-09 Thread rxin
Repository: spark Updated Branches: refs/heads/master 3e3c3d58d -> 9525c563d [MINOR][SQL] Replace DataFrameWriter.stream() with startStream() in comments. ## What changes were proposed in this pull request? According to #11627 , this PR replace `DataFrameWriter.stream()` with

spark git commit: [SPARK-13706][ML] Add Python Example for Train Validation Split

2016-03-09 Thread mlnick
Repository: spark Updated Branches: refs/heads/master 8bcad28a5 -> 3e3c3d58d [SPARK-13706][ML] Add Python Example for Train Validation Split ## What changes were proposed in this pull request? This pull request adds a python example for train validation split. ## How was this patch tested?

spark git commit: [SPARK-7420][STREAMING][TESTS] Enable test: o.a.s.streaming.JobGeneratorSuite "Do not clear received…

2016-03-09 Thread rxin
Repository: spark Updated Branches: refs/heads/master 8a3acb792 -> 8bcad28a5 [SPARK-7420][STREAMING][TESTS] Enable test: o.a.s.streaming.JobGeneratorSuite "Do not clear received… ## How was this patch tested? unit test Author: proflin Closes #11626 from

spark git commit: [SPARK-13794][SQL] Rename DataFrameWriter.stream() DataFrameWriter.startStream()

2016-03-09 Thread rxin
Repository: spark Updated Branches: refs/heads/master aa0eba2c3 -> 8a3acb792 [SPARK-13794][SQL] Rename DataFrameWriter.stream() DataFrameWriter.startStream() ## What changes were proposed in this pull request? The new name makes it more obvious with the verb "start" that we are actually

spark git commit: [SPARK-13766][SQL] Consistent file extensions for files written by internal data sources

2016-03-09 Thread rxin
Repository: spark Updated Branches: refs/heads/master 790646125 -> aa0eba2c3 [SPARK-13766][SQL] Consistent file extensions for files written by internal data sources ## What changes were proposed in this pull request? https://issues.apache.org/jira/browse/SPARK-13766 This PR makes the file

spark git commit: Revert "[SPARK-13760][SQL] Fix BigDecimal constructor for FloatType"

2016-03-09 Thread yhuai
Repository: spark Updated Branches: refs/heads/master 926e9c45a -> 790646125 Revert "[SPARK-13760][SQL] Fix BigDecimal constructor for FloatType" This reverts commit 926e9c45a21c5b71ef0832d63b8dae7d4f3d8826. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit:

spark git commit: [SPARK-13760][SQL] Fix BigDecimal constructor for FloatType

2016-03-09 Thread yhuai
Repository: spark Updated Branches: refs/heads/branch-1.6 bea91a9e9 -> 8a1bd5834 [SPARK-13760][SQL] Fix BigDecimal constructor for FloatType ## What changes were proposed in this pull request? A very minor change for using `BigDecimal.decimal(f: Float)` instead of `BigDecimal(f: float)`.

spark git commit: [SPARK-13760][SQL] Fix BigDecimal constructor for FloatType

2016-03-09 Thread yhuai
Repository: spark Updated Branches: refs/heads/master a4a0addcc -> 926e9c45a [SPARK-13760][SQL] Fix BigDecimal constructor for FloatType ## What changes were proposed in this pull request? A very minor change for using `BigDecimal.decimal(f: Float)` instead of `BigDecimal(f: float)`. The

spark git commit: [SPARK-13492][MESOS] Configurable Mesos framework webui URL.

2016-03-09 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 5f7dbdba6 -> a4a0addcc [SPARK-13492][MESOS] Configurable Mesos framework webui URL. ## What changes were proposed in this pull request? Previously the Mesos framework webui URL was being derived only from the Spark UI address leaving no

spark git commit: [MINOR] Fix typo in 'hypot' docstring

2016-03-09 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 238447db5 -> 5f7dbdba6 [MINOR] Fix typo in 'hypot' docstring Minor typo: docstring for pyspark.sql.functions: hypot has extra characters N/A Author: Tristan Reid Closes #11616 from tristanreid/master. Project:

spark git commit: [SPARK-13775] History page sorted by completed time desc by default.

2016-03-09 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 40e067675 -> 238447db5 [SPARK-13775] History page sorted by completed time desc by default. ## What changes were proposed in this pull request? Originally the page is sorted by AppID by default. After tests with users' feedback, we think

spark git commit: [SPARK-13778][CORE] Set the executor state for a worker when removing it

2016-03-09 Thread andrewor14
Repository: spark Updated Branches: refs/heads/master 37fcda3e6 -> 40e067675 [SPARK-13778][CORE] Set the executor state for a worker when removing it ## What changes were proposed in this pull request? When a worker is lost, the executors on this worker are also lost. But Master's

spark git commit: [SPARK-13747][SQL] Fix concurrent query with fork-join pool

2016-03-09 Thread zsxwing
Repository: spark Updated Branches: refs/heads/master dbf2a7cfa -> 37fcda3e6 [SPARK-13747][SQL] Fix concurrent query with fork-join pool ## What changes were proposed in this pull request? Fix this use case, which was already fixed in SPARK-10548 in 1.6 but was broken in master due to

[spark] Git Push Summary

2016-03-09 Thread marmbrus
Repository: spark Updated Tags: refs/tags/v1.6.1 [created] 15de51c23 - To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org For additional commands, e-mail: commits-h...@spark.apache.org

[spark] Git Push Summary

2016-03-09 Thread marmbrus
Repository: spark Updated Tags: refs/tags/v1.6.1 [deleted] 152252f15 - To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org For additional commands, e-mail: commits-h...@spark.apache.org

[spark] Git Push Summary

2016-03-09 Thread marmbrus
Repository: spark Updated Tags: refs/tags/v1.6.1-rc1 [deleted] 15de51c23 - To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org For additional commands, e-mail: commits-h...@spark.apache.org

[spark] Git Push Summary

2016-03-09 Thread marmbrus
Repository: spark Updated Tags: refs/tags/v1.6.1 [created] 152252f15 - To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org For additional commands, e-mail: commits-h...@spark.apache.org

spark git commit: [SPARK-13781][SQL] Use ExpressionSets in ConstraintPropagationSuite

2016-03-09 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master e1772d3f1 -> dbf2a7cfa [SPARK-13781][SQL] Use ExpressionSets in ConstraintPropagationSuite ## What changes were proposed in this pull request? This PR is a small follow up on https://github.com/apache/spark/pull/11338

spark git commit: [SPARK-11861][ML] Add feature importances for decision trees

2016-03-09 Thread jkbradley
Repository: spark Updated Branches: refs/heads/master c6aa356cd -> e1772d3f1 [SPARK-11861][ML] Add feature importances for decision trees This patch adds an API entry point for single decision tree feature importances. Author: sethah Closes #9912 from

spark git commit: [SPARK-13527][SQL] Prune Filters based on Constraints

2016-03-09 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 3dc9ae2e1 -> c6aa356cd [SPARK-13527][SQL] Prune Filters based on Constraints What changes were proposed in this pull request? Remove all the deterministic conditions in a [[Filter]] that are contained in the Child's Constraints.

spark git commit: [SPARK-13523] [SQL] Reuse exchanges in a query

2016-03-09 Thread davies
Repository: spark Updated Branches: refs/heads/master 0dd06485c -> 3dc9ae2e1 [SPARK-13523] [SQL] Reuse exchanges in a query ## What changes were proposed in this pull request? It’s possible to have common parts in a query, for example, self join, it will be good to avoid the duplicated

spark git commit: [SPARK-13615][ML] GeneralizedLinearRegression supports save/load

2016-03-09 Thread jkbradley
Repository: spark Updated Branches: refs/heads/master cad29a40b -> 0dd06485c [SPARK-13615][ML] GeneralizedLinearRegression supports save/load ## What changes were proposed in this pull request? ```GeneralizedLinearRegression``` supports ```save/load```. cc mengxr ## How was this patch tested?

spark git commit: [SPARK-13728][SQL] Fix ORC PPD test so that pushed filters can be checked.

2016-03-09 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 23369c3bd -> cad29a40b [SPARK-13728][SQL] Fix ORC PPD test so that pushed filters can be checked. ## What changes were proposed in this pull request? https://issues.apache.org/jira/browse/SPARK-13728

spark git commit: [SPARK-13763][SQL] Remove Project when its Child's Output is Nil

2016-03-09 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 256704c77 -> 23369c3bd [SPARK-13763][SQL] Remove Project when its Child's Output is Nil What changes were proposed in this pull request? As shown in another PR: https://github.com/apache/spark/pull/11596, we are using `SELECT 1` as

[5/8] spark git commit: [SPARK-13595][BUILD] Move docker, extras modules into external

2016-03-09 Thread srowen
http://git-wip-us.apache.org/repos/asf/spark/blob/256704c7/external/kinesis-asl/src/main/scala/org/apache/spark/streaming/kinesis/KinesisUtils.scala -- diff --git

[7/8] spark git commit: [SPARK-13595][BUILD] Move docker, extras modules into external

2016-03-09 Thread srowen
http://git-wip-us.apache.org/repos/asf/spark/blob/256704c7/external/docker-integration-tests/src/test/scala/org/apache/spark/util/DockerUtils.scala -- diff --git

[6/8] spark git commit: [SPARK-13595][BUILD] Move docker, extras modules into external

2016-03-09 Thread srowen
http://git-wip-us.apache.org/repos/asf/spark/blob/256704c7/external/kinesis-asl/src/main/java/org/apache/spark/examples/streaming/JavaKinesisWordCountASL.java -- diff --git

[1/8] spark git commit: [SPARK-13595][BUILD] Move docker, extras modules into external

2016-03-09 Thread srowen
Repository: spark Updated Branches: refs/heads/master 7791d0c3a -> 256704c77 http://git-wip-us.apache.org/repos/asf/spark/blob/256704c7/pom.xml -- diff --git a/pom.xml b/pom.xml index 90f4672..a3af20c 100644 --- a/pom.xml +++

[3/8] spark git commit: [SPARK-13595][BUILD] Move docker, extras modules into external

2016-03-09 Thread srowen
http://git-wip-us.apache.org/repos/asf/spark/blob/256704c7/extras/kinesis-asl/src/main/scala/org/apache/spark/examples/streaming/KinesisWordCountASL.scala -- diff --git

[2/8] spark git commit: [SPARK-13595][BUILD] Move docker, extras modules into external

2016-03-09 Thread srowen
http://git-wip-us.apache.org/repos/asf/spark/blob/256704c7/extras/kinesis-asl/src/main/scala/org/apache/spark/streaming/kinesis/KinesisUtils.scala -- diff --git

[4/8] spark git commit: [SPARK-13595][BUILD] Move docker, extras modules into external

2016-03-09 Thread srowen
http://git-wip-us.apache.org/repos/asf/spark/blob/256704c7/extras/README.md -- diff --git a/extras/README.md b/extras/README.md deleted file mode 100644 index 1b4174b..000 --- a/extras/README.md +++ /dev/null @@ -1 +0,0 @@

[8/8] spark git commit: [SPARK-13595][BUILD] Move docker, extras modules into external

2016-03-09 Thread srowen
[SPARK-13595][BUILD] Move docker, extras modules into external ## What changes were proposed in this pull request? Move `docker` dirs out of top level into `external/`; move `extras/*` into `external/` ## How was this patch tested? This is tested with Jenkins tests. Author: Sean Owen

spark git commit: Revert "[SPARK-13668][SQL] Reorder filter/join predicates to short-circuit isNotNull checks"

2016-03-09 Thread davies
Repository: spark Updated Branches: refs/heads/master 9634e17d0 -> 7791d0c3a Revert "[SPARK-13668][SQL] Reorder filter/join predicates to short-circuit isNotNull checks" This reverts commit e430614eae53c8864b31a1dc64db83e27100d1d9. Project: http://git-wip-us.apache.org/repos/asf/spark/repo

spark git commit: [SPARK-13242] [SQL] codegen fallback in case-when if there many branches

2016-03-09 Thread davies
Repository: spark Updated Branches: refs/heads/master 53ba6d6e5 -> 9634e17d0 [SPARK-13242] [SQL] codegen fallback in case-when if there many branches ## What changes were proposed in this pull request? If there are many branches in a CaseWhen expression, the generated code could go above

svn commit: r1734288 - in /spark: documentation.md site/documentation.html

2016-03-09 Thread srowen
Author: srowen Date: Wed Mar 9 17:06:53 2016 New Revision: 1734288 URL: http://svn.apache.org/viewvc?rev=1734288=rev Log: Add Big Data Analytics book to list of Spark resources Modified: spark/documentation.md spark/site/documentation.html Modified: spark/documentation.md URL:

spark git commit: [SPARK-13698][SQL] Fix Analysis Exceptions when Using Backticks in Generate

2016-03-09 Thread wenchen
Repository: spark Updated Branches: refs/heads/master 8e8633e0b -> 53ba6d6e5 [SPARK-13698][SQL] Fix Analysis Exceptions when Using Backticks in Generate ## What changes were proposed in this pull request? Analysis exception occurs while running the following query. ``` SELECT ints FROM

spark git commit: [SPARK-13769][CORE] Update Java Doc in Spark Submit

2016-03-09 Thread srowen
Repository: spark Updated Branches: refs/heads/master c3689bc24 -> 8e8633e0b [SPARK-13769][CORE] Update Java Doc in Spark Submit JIRA : https://issues.apache.org/jira/browse/SPARK-13769 The java doc here

[2/2] spark git commit: [SPARK-13702][CORE][SQL][MLLIB] Use diamond operator for generic instance creation in Java code.

2016-03-09 Thread srowen
[SPARK-13702][CORE][SQL][MLLIB] Use diamond operator for generic instance creation in Java code. ## What changes were proposed in this pull request? In order to make `docs/examples` (and other related code) more simple/readable/user-friendly, this PR replaces existing codes like the

[1/2] spark git commit: [SPARK-13702][CORE][SQL][MLLIB] Use diamond operator for generic instance creation in Java code.

2016-03-09 Thread srowen
Repository: spark Updated Branches: refs/heads/master cbff2803e -> c3689bc24 http://git-wip-us.apache.org/repos/asf/spark/blob/c3689bc2/sql/hive/src/test/java/org/apache/spark/sql/hive/test/Complex.java -- diff --git

spark git commit: [SPARK-13631][CORE] Thread-safe getLocationsWithLargestOutputs

2016-03-09 Thread srowen
Repository: spark Updated Branches: refs/heads/master 2c5af7d4d -> cbff2803e [SPARK-13631][CORE] Thread-safe getLocationsWithLargestOutputs ## What changes were proposed in this pull request? If a job is being scheduled in one thread which has a dependency on an RDD currently executing a

spark git commit: [SPARK-13640][SQL] Synchronize ScalaReflection.mirror method.

2016-03-09 Thread srowen
Repository: spark Updated Branches: refs/heads/master f3201aeeb -> 2c5af7d4d [SPARK-13640][SQL] Synchronize ScalaReflection.mirror method. ## What changes were proposed in this pull request? `ScalaReflection.mirror` method should be synchronized when scala version is `2.10` because

spark git commit: [SPARK-13692][CORE][SQL] Fix trivial Coverity/Checkstyle defects

2016-03-09 Thread srowen
Repository: spark Updated Branches: refs/heads/master 035d3acdf -> f3201aeeb [SPARK-13692][CORE][SQL] Fix trivial Coverity/Checkstyle defects ## What changes were proposed in this pull request? This issue fixes the following potential bugs and Java coding style detected by Coverity and