spark git commit: [SPARK-16863][ML] ProbabilisticClassifier.fit check threshoulds' length

2016-08-04 Thread srowen
Repository: spark Updated Branches: refs/heads/master 1d781572e -> 0e2e5d7d0 [SPARK-16863][ML] ProbabilisticClassifier.fit check threshoulds' length ## What changes were proposed in this pull request? Add threshoulds' length checking for Classifiers which extends ProbabilisticClassifier ##

spark git commit: [SPARK-16312][STREAMING][KAFKA][DOC] Doc for Kafka 0.10 integration

2016-08-05 Thread srowen
Repository: spark Updated Branches: refs/heads/master 5effc016c -> c9f2501af [SPARK-16312][STREAMING][KAFKA][DOC] Doc for Kafka 0.10 integration ## What changes were proposed in this pull request? Doc for the Kafka 0.10 integration ## How was this patch tested? Scala code examples were taken

spark git commit: [SPARK-16312][STREAMING][KAFKA][DOC] Doc for Kafka 0.10 integration

2016-08-05 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 dae08fb5a -> b4a89c1c1 [SPARK-16312][STREAMING][KAFKA][DOC] Doc for Kafka 0.10 integration ## What changes were proposed in this pull request? Doc for the Kafka 0.10 integration ## How was this patch tested? Scala code examples were

spark git commit: [MINOR] Update AccumulatorV2 doc to not mention "+=".

2016-08-05 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 b4a89c1c1 -> 7fbac48f0 [MINOR] Update AccumulatorV2 doc to not mention "+=". ## What changes were proposed in this pull request? As reported by Bryan Cutler on the mailing list, AccumulatorV2 does not have a += method, yet the

spark git commit: [MINOR] Update AccumulatorV2 doc to not mention "+=".

2016-08-05 Thread srowen
Repository: spark Updated Branches: refs/heads/master c9f2501af -> e02606414 [MINOR] Update AccumulatorV2 doc to not mention "+=". ## What changes were proposed in this pull request? As reported by Bryan Cutler on the mailing list, AccumulatorV2 does not have a += method, yet the

spark git commit: [SPARK-16826][SQL] Switch to java.net.URI for parse_url()

2016-08-05 Thread srowen
URL. Thanks to srowen for the help! Author: Sylvain Zimmer <sylv...@sylvainzimmer.com> Closes #14488 from sylvinus/master. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/2460f03f Tree: http://git-wip-us.apache.org/repos/asf/s

[3/5] spark git commit: [SPARK-16421][EXAMPLES][ML] Improve ML Example Outputs

2016-08-05 Thread srowen
http://git-wip-us.apache.org/repos/asf/spark/blob/180fd3e0/data/mllib/sample_tree_data.csv -- diff --git a/data/mllib/sample_tree_data.csv b/data/mllib/sample_tree_data.csv deleted file mode 100644 index bc97e29..000 ---

[5/5] spark git commit: [SPARK-16421][EXAMPLES][ML] Improve ML Example Outputs

2016-08-05 Thread srowen
[SPARK-16421][EXAMPLES][ML] Improve ML Example Outputs ## What changes were proposed in this pull request? Improve example outputs to better reflect the functionality that is being presented. This mostly consisted of modifying what was printed at the end of the example, such as calling show()

[1/5] spark git commit: [SPARK-16421][EXAMPLES][ML] Improve ML Example Outputs

2016-08-05 Thread srowen
Repository: spark Updated Branches: refs/heads/master 2460f03ff -> 180fd3e0a http://git-wip-us.apache.org/repos/asf/spark/blob/180fd3e0/examples/src/main/scala/org/apache/spark/examples/ml/NormalizerExample.scala -- diff --git

[4/5] spark git commit: [SPARK-16421][EXAMPLES][ML] Improve ML Example Outputs

2016-08-05 Thread srowen
http://git-wip-us.apache.org/repos/asf/spark/blob/180fd3e0/data/mllib/lr_data.txt -- diff --git a/data/mllib/lr_data.txt b/data/mllib/lr_data.txt deleted file mode 100644 index d4df063..000 --- a/data/mllib/lr_data.txt +++

spark git commit: [SPARK-16796][WEB UI] Mask spark.authenticate.secret on Spark environ…

2016-08-05 Thread srowen
Repository: spark Updated Branches: refs/heads/master 55d6dad6f -> 14dba4520 [SPARK-16796][WEB UI] Mask spark.authenticate.secret on Spark environ… ## What changes were proposed in this pull request? Mask `spark.authenticate.secret` on Spark environment page (Web UI). This is addition to

spark git commit: [SPARK-16847][SQL] Prevent to potentially read corrupt statstics on binary in Parquet vectorized reader

2016-08-05 Thread srowen
Repository: spark Updated Branches: refs/heads/master e679bc3c1 -> 55d6dad6f [SPARK-16847][SQL] Prevent to potentially read corrupt statstics on binary in Parquet vectorized reader ## What changes were proposed in this pull request? This problem was found in

spark git commit: [SPARK-16772][PYTHON][DOCS] Fix API doc references to UDFRegistration + Update "important classes"

2016-08-05 Thread srowen
Repository: spark Updated Branches: refs/heads/master 14dba4520 -> 2dd038861 [SPARK-16772][PYTHON][DOCS] Fix API doc references to UDFRegistration + Update "important classes" ## Proposed Changes * Update the list of "important classes" in `pyspark.sql` to match 2.0. * Fix references to

spark git commit: document that Mesos cluster mode supports python

2016-08-07 Thread srowen
Repository: spark Updated Branches: refs/heads/master 4f5f9b670 -> 7aaa5a01c document that Mesos cluster mode supports python update docs to be consistent with SPARK-14645 https://issues.apache.org/jira/browse/SPARK-14645 Author: Michael Gummelt Closes #14514 from

spark git commit: document that Mesos cluster mode supports python

2016-08-07 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 d23343192 -> 58e7038b8 document that Mesos cluster mode supports python update docs to be consistent with SPARK-14645 https://issues.apache.org/jira/browse/SPARK-14645 Author: Michael Gummelt Closes #14514

spark git commit: [SPARK-16932][DOCS] Changed programming guide to not reference old accumulator API in Scala

2016-08-07 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 58e7038b8 -> c0364485e [SPARK-16932][DOCS] Changed programming guide to not reference old accumulator API in Scala ## What changes were proposed in this pull request? In the programming guide, the accumulator section mixes up both

spark git commit: [SPARK-16932][DOCS] Changed programming guide to not reference old accumulator API in Scala

2016-08-07 Thread srowen
Repository: spark Updated Branches: refs/heads/master 7aaa5a01c -> b1ebe182c [SPARK-16932][DOCS] Changed programming guide to not reference old accumulator API in Scala ## What changes were proposed in this pull request? In the programming guide, the accumulator section mixes up both the

spark git commit: [SPARK-16870][DOCS] Summary:add "spark.sql.broadcastTimeout" into docs/sql-programming-gu…

2016-08-07 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 c0364485e -> 3f8a95be8 [SPARK-16870][DOCS] Summary:add "spark.sql.broadcastTimeout" into docs/sql-programming-gu… ## What changes were proposed in this pull request? default value for spark.sql.broadcastTimeout is 300s. and this

spark git commit: [SPARK-16870][DOCS] Summary:add "spark.sql.broadcastTimeout" into docs/sql-programming-gu…

2016-08-07 Thread srowen
Repository: spark Updated Branches: refs/heads/master b1ebe182c -> 1275f6469 [SPARK-16870][DOCS] Summary:add "spark.sql.broadcastTimeout" into docs/sql-programming-gu… ## What changes were proposed in this pull request? default value for spark.sql.broadcastTimeout is 300s. and this

spark git commit: [SPARK-16911] Fix the links in the programming guide

2016-08-07 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 3f8a95be8 -> 739a333f6 [SPARK-16911] Fix the links in the programming guide ## What changes were proposed in this pull request? Fix the broken links in the programming guide of the Graphx Migration and understanding closures ## How

spark git commit: [SPARK-16911] Fix the links in the programming guide

2016-08-07 Thread srowen
Repository: spark Updated Branches: refs/heads/master 1275f6469 -> 6c1ecb191 [SPARK-16911] Fix the links in the programming guide ## What changes were proposed in this pull request? Fix the broken links in the programming guide of the Graphx Migration and understanding closures ## How was

spark git commit: [SPARK-16772][PYTHON][DOCS] Fix API doc references to UDFRegistration + Update "important classes"

2016-08-05 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 b5d65b45d -> 90e046024 [SPARK-16772][PYTHON][DOCS] Fix API doc references to UDFRegistration + Update "important classes" ## Proposed Changes * Update the list of "important classes" in `pyspark.sql` to match 2.0. * Fix references to

spark git commit: [SPARK-16909][SPARK CORE] Streaming for postgreSQL JDBC driver

2016-08-07 Thread srowen
Repository: spark Updated Branches: refs/heads/master 6c1ecb191 -> bdfab9f94 [SPARK-16909][SPARK CORE] Streaming for postgreSQL JDBC driver As per the postgreSQL JDBC driver

spark git commit: [SPARK-16409][SQL] regexp_extract with optional groups causes NPE

2016-08-07 Thread srowen
hes but the requested optional group did not. This makes it return an empty string, as apparently designed. ## How was this patch tested? Additional unit test Author: Sean Owen <so...@cloudera.com> Closes #14504 from srowen/SPARK-16409. (cherry picked from commit 8d8725208771a8815a60160a5a30dc6ea87a7e6

spark git commit: [SPARK-16409][SQL] regexp_extract with optional groups causes NPE

2016-08-07 Thread srowen
hes but the requested optional group did not. This makes it return an empty string, as apparently designed. ## How was this patch tested? Additional unit test Author: Sean Owen <so...@cloudera.com> Closes #14504 from srowen/SPARK-16409. (cherry picked from commit 8d8725208771a8815a60160a5a30dc6ea87a7e6

spark git commit: [SPARK-16409][SQL] regexp_extract with optional groups causes NPE

2016-08-07 Thread srowen
ted optional group did not. This makes it return an empty string, as apparently designed. ## How was this patch tested? Additional unit test Author: Sean Owen <so...@cloudera.com> Closes #14504 from srowen/SPARK-16409. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: h

spark git commit: [SPARK-16809] enable history server links in dispatcher UI

2016-08-09 Thread srowen
Repository: spark Updated Branches: refs/heads/master 2154345b6 -> 62e621244 [SPARK-16809] enable history server links in dispatcher UI ## What changes were proposed in this pull request? Links the Spark Mesos Dispatcher UI to the history server UI - adds

spark git commit: [SPARK-16606][CORE] Misleading warning for SparkContext.getOrCreate "WARN SparkContext: Use an existing SparkContext, some configuration may not take effect."

2016-08-09 Thread srowen
pull request? SparkContext.getOrCreate shouldn't warn about ignored config if - it wasn't ignored because a new context is created with it or - no config was actually provided ## How was this patch tested? Jenkins + existing tests. Author: Sean Owen <so...@cloudera.com> Closes #14533 from

spark git commit: [SPARK-16522][MESOS] Spark application throws exception on exit.

2016-08-09 Thread srowen
Repository: spark Updated Branches: refs/heads/master 801e4d097 -> af710e5bd [SPARK-16522][MESOS] Spark application throws exception on exit. ## What changes were proposed in this pull request? Spark applications running on Mesos throw exception upon exit. For details, refer to

spark git commit: [SPARK-16940][SQL] `checkAnswer` should raise `TestFailedException` for wrong results

2016-08-09 Thread srowen
Repository: spark Updated Branches: refs/heads/master af710e5bd -> 2154345b6 [SPARK-16940][SQL] `checkAnswer` should raise `TestFailedException` for wrong results ## What changes were proposed in this pull request? This PR fixes the following to make `checkAnswer` raise

spark git commit: [SPARK-16919] Configurable update interval for console progress bar

2016-08-07 Thread srowen
Repository: spark Updated Branches: refs/heads/master a16983c97 -> e076fb05a [SPARK-16919] Configurable update interval for console progress bar ## What changes were proposed in this pull request? Currently the update interval for the console progress bar is hardcoded. This PR makes it

spark git commit: [SPARK-16939][SQL] Fix build error by using `Tuple1` explicitly in StringFunctionsSuite

2016-08-07 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.6 1a5e762ff -> a3b06ae0b [SPARK-16939][SQL] Fix build error by using `Tuple1` explicitly in StringFunctionsSuite ## What changes were proposed in this pull request? This PR aims to fix a build error on branch 1.6 at

spark git commit: [SPARK-16939][SQL] Fix build error by using `Tuple1` explicitly in StringFunctionsSuite

2016-08-07 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 fd828e14b -> f37ed6e70 [SPARK-16939][SQL] Fix build error by using `Tuple1` explicitly in StringFunctionsSuite ## What changes were proposed in this pull request? This PR aims to fix a build error on branch 1.6 at

spark git commit: [SPARK-16945] Fix Java Lint errors

2016-08-08 Thread srowen
Repository: spark Updated Branches: refs/heads/master 1db1c6567 -> e10ca8de4 [SPARK-16945] Fix Java Lint errors ## What changes were proposed in this pull request? This PR is to fix the minor Java linter errors as following: [ERROR]

spark git commit: [SPARK-16774][SQL] Fix use of deprecated timestamp constructor & improve timezone handling

2016-08-01 Thread srowen
Repository: spark Updated Branches: refs/heads/master 338a98d65 -> ab1e761f9 [SPARK-16774][SQL] Fix use of deprecated timestamp constructor & improve timezone handling ## What changes were proposed in this pull request? Removes the deprecated timestamp constructor and incidentally fixes the

spark git commit: [SPARK-16774][SQL] Fix use of deprecated timestamp constructor & improve timezone handling

2016-08-01 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 1523bf69a -> 4e73cb8eb [SPARK-16774][SQL] Fix use of deprecated timestamp constructor & improve timezone handling ## What changes were proposed in this pull request? Removes the deprecated timestamp constructor and incidentally fixes

spark git commit: [SPARK-16561][MLLIB] fix multivarOnlineSummary min/max bug

2016-07-23 Thread srowen
Repository: spark Updated Branches: refs/heads/master e10b8741d -> 25db51675 [SPARK-16561][MLLIB] fix multivarOnlineSummary min/max bug ## What changes were proposed in this pull request? renaming var names to make code more clear: nnz => weightSum weightSum => totalWeightSum and add a new

spark git commit: [SPARK-16194] Mesos Driver env vars

2016-07-21 Thread srowen
Repository: spark Updated Branches: refs/heads/master 69626addd -> 235cb256d [SPARK-16194] Mesos Driver env vars ## What changes were proposed in this pull request? Added new configuration namespace: spark.mesos.env.* This allows a user submitting a job in cluster mode to set arbitrary

spark git commit: [SPARK-16651][PYSPARK][DOC] Make `withColumnRenamed/drop` description more consistent with Scala API

2016-07-22 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 fb944a1e8 -> 28bb2b044 [SPARK-16651][PYSPARK][DOC] Make `withColumnRenamed/drop` description more consistent with Scala API ## What changes were proposed in this pull request? `withColumnRenamed` and `drop` is a no-op if the given

spark git commit: [SPARK-16651][PYSPARK][DOC] Make `withColumnRenamed/drop` description more consistent with Scala API

2016-07-22 Thread srowen
Repository: spark Updated Branches: refs/heads/master 6c56fff11 -> 47f5b88db [SPARK-16651][PYSPARK][DOC] Make `withColumnRenamed/drop` description more consistent with Scala API ## What changes were proposed in this pull request? `withColumnRenamed` and `drop` is a no-op if the given column

spark git commit: [SPARK-16662][PYSPARK][SQL] fix HiveContext warning bug

2016-07-23 Thread srowen
Repository: spark Updated Branches: refs/heads/master 25db51675 -> ab6e4aea5 [SPARK-16662][PYSPARK][SQL] fix HiveContext warning bug ## What changes were proposed in this pull request? move the `HiveContext` deprecate warning printing statement into `HiveContext` constructor. so that this

spark git commit: [SPARK-16287][HOTFIX][BUILD][SQL] Fix annotation argument needs to be a constant

2016-07-22 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 70bf8ce72 -> 0cc36cad1 [SPARK-16287][HOTFIX][BUILD][SQL] Fix annotation argument needs to be a constant ## What changes were proposed in this pull request? Build fix for [SPARK-16287][SQL] Implement str_to_map SQL function that has

spark git commit: [SPARK-16287][HOTFIX][BUILD][SQL] Fix annotation argument needs to be a constant

2016-07-22 Thread srowen
Repository: spark Updated Branches: refs/heads/master 94f14b52a -> e1bd70f44 [SPARK-16287][HOTFIX][BUILD][SQL] Fix annotation argument needs to be a constant ## What changes were proposed in this pull request? Build fix for [SPARK-16287][SQL] Implement str_to_map SQL function that has

spark git commit: [SPARK-16487][STREAMING] Fix some batches might not get marked as fully processed in JobGenerator

2016-07-22 Thread srowen
Repository: spark Updated Branches: refs/heads/master e1bd70f44 -> 2c72a4432 [SPARK-16487][STREAMING] Fix some batches might not get marked as fully processed in JobGenerator ## What changes were proposed in this pull request? In `JobGenerator`, the code reads like that some batches might

spark git commit: [GIT] add pydev & Rstudio project file to gitignore list

2016-07-22 Thread srowen
Repository: spark Updated Branches: refs/heads/master 2c72a4432 -> b4e16bd58 [GIT] add pydev & Rstudio project file to gitignore list ## What changes were proposed in this pull request? Add Pydev & Rstudio project file to gitignore list, I think the two IEDs are used by many developers. so

spark git commit: [SPARK-16650] Improve documentation of spark.task.maxFailures

2016-07-22 Thread srowen
Repository: spark Updated Branches: refs/heads/master b4e16bd58 -> 6c56fff11 [SPARK-16650] Improve documentation of spark.task.maxFailures Clarify documentation on spark.task.maxFailures No tests run as its documentation Author: Tom Graves Closes #14287 from

spark git commit: [SPARK-16650] Improve documentation of spark.task.maxFailures

2016-07-22 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 0cc36cad1 -> fb944a1e8 [SPARK-16650] Improve documentation of spark.task.maxFailures Clarify documentation on spark.task.maxFailures No tests run as its documentation Author: Tom Graves Closes #14287 from

spark git commit: [SPARK-16751][HOTFIX] Also update hadoop-1 deps file to reflect derby 10.12.1.1 security fix

2016-07-29 Thread srowen
ote in back-porting to 1.6 that an additional Hadoop 1 deps file would need to be updated. This makes that change. ## How was this patch tested? Jenkins tests. Author: Sean Owen <so...@cloudera.com> Closes #14403 from srowen/SPARK-16751.2. Project: http://git-wip-us.apache.org/repos/

spark git commit: [SPARK-16796][WEB UI] Visible passwords on Spark environment page

2016-08-03 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 d9d3504b9 -> 969313bb2 [SPARK-16796][WEB UI] Visible passwords on Spark environment page ## What changes were proposed in this pull request? Mask spark.ssl.keyPassword, spark.ssl.keyStorePassword, spark.ssl.trustStorePassword in Web

spark git commit: [SPARK-16796][WEB UI] Visible passwords on Spark environment page

2016-08-03 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.6 92ee6fbf5 -> 52d8837c6 [SPARK-16796][WEB UI] Visible passwords on Spark environment page ## What changes were proposed in this pull request? Mask spark.ssl.keyPassword, spark.ssl.keyStorePassword, spark.ssl.trustStorePassword in Web

spark git commit: [SPARK-16770][BUILD] Fix JLine dependency management and version (Sca…

2016-08-03 Thread srowen
Repository: spark Updated Branches: refs/heads/master 685b08e26 -> 4775eb414 [SPARK-16770][BUILD] Fix JLine dependency management and version (Sca… ## What changes were proposed in this pull request? As of Scala 2.11.x there is no longer a org.scala-lang:jline version aligned to the scala

spark git commit: [SPARK-16814][SQL] Fix deprecated parquet constructor usage

2016-08-03 Thread srowen
Repository: spark Updated Branches: refs/heads/master 4775eb414 -> c5eb1df72 [SPARK-16814][SQL] Fix deprecated parquet constructor usage ## What changes were proposed in this pull request? Replace deprecated ParquetWriter with the new builders ## How was this patch tested? Existing tests

spark git commit: [SPARK-16770][BUILD] Fix JLine dependency management and version (Sca…

2016-08-03 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 b44da5b4e -> bb30a3d0f [SPARK-16770][BUILD] Fix JLine dependency management and version (Sca… ## What changes were proposed in this pull request? As of Scala 2.11.x there is no longer a org.scala-lang:jline version aligned to the

spark git commit: [SPARK-16257][BUILD] Update spark_ec2.py to support Spark 1.6.2 and 1.6.3.

2016-06-30 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.6 1ac830aca -> ccc7fa357 [SPARK-16257][BUILD] Update spark_ec2.py to support Spark 1.6.2 and 1.6.3. ## What changes were proposed in this pull request? - Adds 1.6.2 and 1.6.3 as supported Spark versions within the bundled spark-ec2

spark git commit: [SPARK-16182][CORE] Utils.scala -- terminateProcess() should call Process.destroyForcibly() if and only if Process.destroy() fails

2016-07-01 Thread srowen
es #13973 from srowen/SPARK-16182. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/2075bf8e Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/2075bf8e Diff: http://git-wip-us.apache.org/repos/asf/spark/diff/2075bf8e Bra

spark git commit: [SPARK-15761][MLLIB][PYSPARK] Load ipython when default python is Python3

2016-07-01 Thread srowen
Repository: spark Updated Branches: refs/heads/master 2075bf8ef -> 66283ee0b [SPARK-15761][MLLIB][PYSPARK] Load ipython when default python is Python3 ## What changes were proposed in this pull request? I would like to use IPython with Python 3.5. It is annoying when it fails with IPython

spark git commit: [SPARK-15761][MLLIB][PYSPARK] Load ipython when default python is Python3

2016-07-01 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 972106dd3 -> 0b64543c5 [SPARK-15761][MLLIB][PYSPARK] Load ipython when default python is Python3 ## What changes were proposed in this pull request? I would like to use IPython with Python 3.5. It is annoying when it fails with

spark git commit: [SPARK-15761][MLLIB][PYSPARK] Load ipython when default python is Python3

2016-07-01 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-1.6 83f860448 -> 1026aba16 [SPARK-15761][MLLIB][PYSPARK] Load ipython when default python is Python3 ## What changes were proposed in this pull request? I would like to use IPython with Python 3.5. It is annoying when it fails with

spark git commit: [SPARK-16222][SQL] JDBC Sources - Handling illegal input values for `fetchsize` and `batchsize`

2016-07-01 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 0b64543c5 -> 3665927c6 [SPARK-16222][SQL] JDBC Sources - Handling illegal input values for `fetchsize` and `batchsize` What changes were proposed in this pull request? For JDBC data sources, users can specify `batchsize` for

spark git commit: [SPARK-16222][SQL] JDBC Sources - Handling illegal input values for `fetchsize` and `batchsize`

2016-07-01 Thread srowen
Repository: spark Updated Branches: refs/heads/master 66283ee0b -> 0ad6ce7e5 [SPARK-16222][SQL] JDBC Sources - Handling illegal input values for `fetchsize` and `batchsize` What changes were proposed in this pull request? For JDBC data sources, users can specify `batchsize` for

spark git commit: [GRAPHX][EXAMPLES] move graphx test data directory and update graphx document

2016-07-02 Thread srowen
Repository: spark Updated Branches: refs/heads/master bad0f7dbb -> 192d1f9cf [GRAPHX][EXAMPLES] move graphx test data directory and update graphx document ## What changes were proposed in this pull request? There are two test data files used for graphx examples existing in directory

spark git commit: [GRAPHX][EXAMPLES] move graphx test data directory and update graphx document

2016-07-02 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 ab4303800 -> f3a359939 [GRAPHX][EXAMPLES] move graphx test data directory and update graphx document ## What changes were proposed in this pull request? There are two test data files used for graphx examples existing in directory

spark git commit: [SPARK-16345][DOCUMENTATION][EXAMPLES][GRAPHX] Extract graphx programming guide example snippets from source files instead of hard code them

2016-07-02 Thread srowen
Repository: spark Updated Branches: refs/heads/master 192d1f9cf -> 0bd7cd18b [SPARK-16345][DOCUMENTATION][EXAMPLES][GRAPHX] Extract graphx programming guide example snippets from source files instead of hard code them ## What changes were proposed in this pull request? I extract 6 example

spark git commit: [SPARK-16345][DOCUMENTATION][EXAMPLES][GRAPHX] Extract graphx programming guide example snippets from source files instead of hard code them

2016-07-02 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 f3a359939 -> 0d0b41609 [SPARK-16345][DOCUMENTATION][EXAMPLES][GRAPHX] Extract graphx programming guide example snippets from source files instead of hard code them ## What changes were proposed in this pull request? I extract 6

spark git commit: [MINOR][BUILD] Fix Java linter errors

2016-07-02 Thread srowen
Repository: spark Updated Branches: refs/heads/master 0bd7cd18b -> 3000b4b29 [MINOR][BUILD] Fix Java linter errors ## What changes were proposed in this pull request? This PR fixes the minor Java linter errors like the following. ``` -public int read(char cbuf[], int off, int len) throws

spark git commit: [MINOR][BUILD] Fix Java linter errors

2016-07-02 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 0d0b41609 -> 0c6fd03fa [MINOR][BUILD] Fix Java linter errors This PR fixes the minor Java linter errors like the following. ``` -public int read(char cbuf[], int off, int len) throws IOException { +public int read(char[] cbuf,

spark git commit: [SPARK-15858][ML] Fix calculating error by tree stack over flow prob…

2016-06-29 Thread srowen
Repository: spark Updated Branches: refs/heads/master 21385d02a -> 393db655c [SPARK-15858][ML] Fix calculating error by tree stack over flow prob… ## What changes were proposed in this pull request? What changes were proposed in this pull request? Improving evaluateEachIteration function

spark git commit: [MINOR] Closing stale pull requests.

2016-06-20 Thread srowen
Repository: spark Updated Branches: refs/heads/master 359c2e827 -> 92514232e [MINOR] Closing stale pull requests. Closes #13114 Closes #10187 Closes #13432 Closes #13550 Author: Sean Owen <so...@cloudera.com> Closes #13781 from srowen/CloseStalePR. Project: http://git-wip-us.a

spark git commit: [SPARK-15942][REPL] Unblock `:reset` command in REPL.

2016-06-19 Thread srowen
Repository: spark Updated Branches: refs/heads/master 001a58960 -> 1b3a9b966 [SPARK-15942][REPL] Unblock `:reset` command in REPL. ## What changes were proposed in this pull (Paste from JIRA issue.) As a follow up for SPARK-15697, I have following semantics for `:reset` command. On `:reset`

spark git commit: [SPARK-16040][MLLIB][DOC] spark.mllib PIC document extra line of refernece

2016-06-19 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.0 2c1c337ba -> 80c6d4e3a [SPARK-16040][MLLIB][DOC] spark.mllib PIC document extra line of refernece ## What changes were proposed in this pull request? In the 2.0 document, Line "A full example that produces the experiment described in

spark git commit: [SPARK-16040][MLLIB][DOC] spark.mllib PIC document extra line of refernece

2016-06-19 Thread srowen
Repository: spark Updated Branches: refs/heads/master 1b3a9b966 -> 5930d7a2e [SPARK-16040][MLLIB][DOC] spark.mllib PIC document extra line of refernece ## What changes were proposed in this pull request? In the 2.0 document, Line "A full example that produces the experiment described in the

spark git commit: [MINOR][DOCS] Remove unused images; crush PNGs that could use it for good measure

2016-07-04 Thread srowen
hor: Sean Owen <so...@cloudera.com> Closes #14029 from srowen/RemoveCompressImages. (cherry picked from commit 18fb57f58a04685823408f3a174a8722f155fd4d) Signed-off-by: Sean Owen <so...@cloudera.com> Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.a

spark git commit: [MINOR][DOCS] Remove unused images; crush PNGs that could use it for good measure

2016-07-04 Thread srowen
ean Owen <so...@cloudera.com> Closes #14029 from srowen/RemoveCompressImages. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/18fb57f5 Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/18fb57f5 Diff: http:

svn commit: r1751226 - in /spark: _includes/ images/ site/images/

2016-07-04 Thread srowen
Author: srowen Date: Mon Jul 4 08:31:32 2016 New Revision: 1751226 URL: http://svn.apache.org/viewvc?rev=1751226=rev Log: Remove unused images from Spark site; crush large PNGs; remove obsolete .html _includes Removed: spark/_includes/footer.html spark/_includes/navbar.html spark

spark git commit: [MINOR][DOC] Correct code snippet results in quick start documentation

2016-08-16 Thread srowen
Repository: spark Updated Branches: refs/heads/master 8fdc6ce40 -> 6f0988b12 [MINOR][DOC] Correct code snippet results in quick start documentation ## What changes were proposed in this pull request? As README.md file is updated over time. Some code snippet outputs are not correct based on

spark git commit: [SPARK-19402][DOCS] Support LaTex inline formula correctly and fix warnings in Scala/Java APIs generation

2017-02-01 Thread srowen
Repository: spark Updated Branches: refs/heads/master 9ac05225e -> f1a1f2607 [SPARK-19402][DOCS] Support LaTex inline formula correctly and fix warnings in Scala/Java APIs generation ## What changes were proposed in this pull request? This PR proposes three things as below: - Support LaTex

spark git commit: [SPARK-19410][DOC] Fix brokens links in ml-pipeline and ml-tuning

2017-02-01 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.1 d35a1268d -> 61cdc8c7c [SPARK-19410][DOC] Fix brokens links in ml-pipeline and ml-tuning ## What changes were proposed in this pull request? Fix brokens links in ml-pipeline and ml-tuning `` -> `` ## How was this patch tested?

spark git commit: [SPARK-19410][DOC] Fix brokens links in ml-pipeline and ml-tuning

2017-02-01 Thread srowen
Repository: spark Updated Branches: refs/heads/master f1a1f2607 -> 04ee8cf63 [SPARK-19410][DOC] Fix brokens links in ml-pipeline and ml-tuning ## What changes were proposed in this pull request? Fix brokens links in ml-pipeline and ml-tuning `` -> `` ## How was this patch tested? manual

spark git commit: [SPARK-19444][ML][DOCUMENTATION] Fix imports not being present in documentation

2017-02-07 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.1 62fab5bee -> dd1abef13 [SPARK-19444][ML][DOCUMENTATION] Fix imports not being present in documentation ## What changes were proposed in this pull request? SPARK-19444 imports not being present in documentation ## How was this patch

spark git commit: [SPARK-19260] Spaces or "%20" in path parameter are not correctly handled with…

2017-02-07 Thread srowen
Repository: spark Updated Branches: refs/heads/master aee2bd2c7 -> 8fd178d21 [SPARK-19260] Spaces or "%20" in path parameter are not correctly handled with… JIRA Issue: https://issues.apache.org/jira/browse/SPARK-19260 ## What changes were proposed in this pull request? 1.

spark git commit: [SPARK-19444][ML][DOCUMENTATION] Fix imports not being present in documentation

2017-02-07 Thread srowen
Repository: spark Updated Branches: refs/heads/master a97edc2cf -> aee2bd2c7 [SPARK-19444][ML][DOCUMENTATION] Fix imports not being present in documentation ## What changes were proposed in this pull request? SPARK-19444 imports not being present in documentation ## How was this patch

spark git commit: [SPARK-19464][BUILD][HOTFIX][TEST-HADOOP2.6] Add back mockito test dep in YARN module, as it ends up being required in a Maven build

2017-02-08 Thread srowen
ven build ## How was this patch tested? PR builder again, but also a local `mvn` run using the command that the broken Jenkins job uses Author: Sean Owen <so...@cloudera.com> Closes #16853 from srowen/SPARK-19464.2. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: h

spark git commit: [SPARK-19464][BUILD][HOTFIX] run-tests should use hadoop2.6

2017-02-08 Thread srowen
Repository: spark Updated Branches: refs/heads/master 1aeb9f6cb -> c618ccdbe [SPARK-19464][BUILD][HOTFIX] run-tests should use hadoop2.6 ## What changes were proposed in this pull request? After SPARK-19464, **SparkPullRequestBuilder** fails because it still tries to use hadoop2.3.

[2/2] spark git commit: [SPARK-19464][CORE][YARN][TEST-HADOOP2.6] Remove support for Hadoop 2.5 and earlier

2017-02-08 Thread srowen
newer versions - Remove older versions' builds and profiles. ## How was this patch tested? Existing tests Author: Sean Owen <so...@cloudera.com> Closes #16810 from srowen/SPARK-19464. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/rep

spark git commit: [BUILD] Close stale PRs

2017-02-03 Thread srowen
hor: Sean Owen <so...@cloudera.com> Closes #16778 from srowen/CloseStalePRs. Project: http://git-wip-us.apache.org/repos/asf/spark/repo Commit: http://git-wip-us.apache.org/repos/asf/spark/commit/20b4ca14 Tree: http://git-wip-us.apache.org/repos/asf/spark/tree/20b4ca14 Diff: http:

spark git commit: [SPARK-17724][STREAMING][WEBUI] Unevaluated new lines in tooltip in DAG Visualization of a job

2017-01-21 Thread srowen
Repository: spark Updated Branches: refs/heads/master 3c2ba9fcc -> bcdabaac9 [SPARK-17724][STREAMING][WEBUI] Unevaluated new lines in tooltip in DAG Visualization of a job https://issues.apache.org/jira/browse/SPARK-17724 ## What changes were proposed in this pull request? For unevaluated

spark git commit: [SPARK-19117][SPARK-18922][TESTS] Fix the rest of flaky, newly introduced and missed test failures on Windows

2017-01-21 Thread srowen
Repository: spark Updated Branches: refs/heads/master bcdabaac9 -> 6113fe78a [SPARK-19117][SPARK-18922][TESTS] Fix the rest of flaky, newly introduced and missed test failures on Windows ## What changes were proposed in this pull request? **Failed tests** ```

spark git commit: [SPARK-19146][CORE] Drop more elements when stageData.taskData.size > retainedTasks

2017-01-23 Thread srowen
Repository: spark Updated Branches: refs/heads/master c4a6519c4 -> c99492141 [SPARK-19146][CORE] Drop more elements when stageData.taskData.size > retainedTasks ## What changes were proposed in this pull request? Drop more elements when `stageData.taskData.size > retainedTasks` to reduce

spark git commit: [SPARK-16473][MLLIB] Fix BisectingKMeans Algorithm failing in edge case

2017-01-24 Thread srowen
Repository: spark Updated Branches: refs/heads/branch-2.1 9c04e427d -> d128b6a39 [SPARK-16473][MLLIB] Fix BisectingKMeans Algorithm failing in edge case [SPARK-16473][MLLIB] Fix BisectingKMeans Algorithm failing in edge case where no children exist in updateAssignments ## What changes were

spark git commit: delete useless var “j”

2017-01-24 Thread srowen
Repository: spark Updated Branches: refs/heads/master 7c61c2a1c -> cca868004 delete useless var “j” the var “j” defined in "var j = 0" is useless for “def compress” Author: Souljoy Zhuo Closes #16676 from xiaoyesoso/patch-1. Project:

spark git commit: [DOCS] Fix typo in docs

2017-01-24 Thread srowen
Repository: spark Updated Branches: refs/heads/master f27e02476 -> 7c61c2a1c [DOCS] Fix typo in docs ## What changes were proposed in this pull request? Fix typo in docs ## How was this patch tested? Author: uncleGen Closes #16658 from uncleGen/typo-issue. Project:

spark-website git commit: cyclic -> acyclic

2017-01-30 Thread srowen
Repository: spark-website Updated Branches: refs/heads/asf-site 0fce54176 -> fe49ab1ef cyclic -> acyclic Project: http://git-wip-us.apache.org/repos/asf/spark-website/repo Commit: http://git-wip-us.apache.org/repos/asf/spark-website/commit/fe49ab1e Tree:

spark git commit: [SPARK-19384][ML] forget unpersist input dataset in IsotonicRegression

2017-01-28 Thread srowen
Repository: spark Updated Branches: refs/heads/master 1b5ee2003 -> 42ad93b2c [SPARK-19384][ML] forget unpersist input dataset in IsotonicRegression ## What changes were proposed in this pull request? unpersist the input dataset if `handlePersistence` = true ## How was this patch tested?

spark git commit: [CORE][DOCS] Update a help message for --files in spark-submit

2017-01-26 Thread srowen
Repository: spark Updated Branches: refs/heads/master d3dcb63b9 -> 7045b8b35 [CORE][DOCS] Update a help message for --files in spark-submit ## What changes were proposed in this pull request? This pr is to update a help message for `--files` in spark-submit because it seems users get

spark git commit: [SPARK-19302][DOC][MINOR] Fix the wrong item format in security.md

2017-01-20 Thread srowen
Repository: spark Updated Branches: refs/heads/master 039ed9fe8 -> d50d12b49 [SPARK-19302][DOC][MINOR] Fix the wrong item format in security.md ## What changes were proposed in this pull request? In docs/security.md, there is a description as follows. ``` steps to configure the key-stores

spark git commit: [SPARK-12970][DOCS] Fix the example in SturctType APIs for Scala and Java

2017-01-27 Thread srowen
Repository: spark Updated Branches: refs/heads/master 4172ff80d -> 4e35c5a3d [SPARK-12970][DOCS] Fix the example in SturctType APIs for Scala and Java ## What changes were proposed in this pull request? This PR fixes both, javadoc8 break ``` [error]

spark-website git commit: Add instructions for running individual tests.

2017-02-22 Thread srowen
Repository: spark-website Updated Branches: refs/heads/asf-site 879303593 -> ca64fac2e Add instructions for running individual tests. This is useful and I often forget how to do it. I learned some new tricks when @squito gave @jinxing64 some tips on how to do this, so I thought it was worth

spark-website git commit: Fix last update to Java pi example

2017-02-22 Thread srowen
Repository: spark-website Updated Branches: refs/heads/asf-site ca64fac2e -> 470b7ed51 Fix last update to Java pi example Project: http://git-wip-us.apache.org/repos/asf/spark-website/repo Commit: http://git-wip-us.apache.org/repos/asf/spark-website/commit/470b7ed5 Tree:

spark git commit: [SPARK-19646][CORE][STREAMING] binaryRecords replicates records in scala API

2017-02-20 Thread srowen
rns the underlying array, which may be reused when repeated reads don't need a different size, as is the case with binaryRecords APIs ## How was this patch tested? Existing tests Author: Sean Owen <so...@cloudera.com> Closes #16974 from srowen/SPARK-19646. Project: http://git-wip-us.apache.org/

spark git commit: [SPARK-19646][CORE][STREAMING] binaryRecords replicates records in scala API

2017-02-20 Thread srowen
rns the underlying array, which may be reused when repeated reads don't need a different size, as is the case with binaryRecords APIs ## How was this patch tested? Existing tests Author: Sean Owen <so...@cloudera.com> Closes #16974 from srowen/SPARK-19646. (cherry picked fr

<    9   10   11   12   13   14   15   16   17   18   >