spark git commit: [MINOR] Add Sphinx into dev/requirements.txt

2018-07-09 Thread gurwls223
Repository: spark Updated Branches: refs/heads/master eb6e98803 -> 4984f1af7 [MINOR] Add Sphinx into dev/requirements.txt ## What changes were proposed in this pull request? Not a big deal but this PR adds `sphinx` into `dev/requirements.txt` since we found it needed -

spark git commit: [SPARK-24530][PYTHON] Add a control to force Python version in Sphinx via environment variable, SPHINXPYTHON

2018-07-10 Thread gurwls223
Repository: spark Updated Branches: refs/heads/master 6078b891d -> 1f94bf492 [SPARK-24530][PYTHON] Add a control to force Python version in Sphinx via environment variable, SPHINXPYTHON ## What changes were proposed in this pull request? This PR proposes to add `SPHINXPYTHON` environment

spark git commit: [SPARK-24530][PYTHON] Add a control to force Python version in Sphinx via environment variable, SPHINXPYTHON

2018-07-10 Thread gurwls223
Repository: spark Updated Branches: refs/heads/branch-2.3 72eb97ce9 -> 19542f5de [SPARK-24530][PYTHON] Add a control to force Python version in Sphinx via environment variable, SPHINXPYTHON ## What changes were proposed in this pull request? This PR proposes to add `SPHINXPYTHON`

spark git commit: [SPARK-24749][SQL] Use sameType to compare Array's element type in ArrayContains

2018-07-06 Thread gurwls223
Repository: spark Updated Branches: refs/heads/master 4de0425df -> fc43690d3 [SPARK-24749][SQL] Use sameType to compare Array's element type in ArrayContains ## What changes were proposed in this pull request? We should use `DataType.sameType` to compare element type in `ArrayContains`,

spark git commit: [SPARK-24740][PYTHON][ML] Make PySpark's tests compatible with NumPy 1.14+

2018-07-06 Thread gurwls223
Repository: spark Updated Branches: refs/heads/master 74f6a92fc -> 044b33b2e [SPARK-24740][PYTHON][ML] Make PySpark's tests compatible with NumPy 1.14+ ## What changes were proposed in this pull request? This PR proposes to make PySpark's tests compatible with NumPy 0.14+ NumPy 0.14.x

spark git commit: [SPARK-24739][PYTHON] Make PySpark compatible with Python 3.7

2018-07-06 Thread gurwls223
Repository: spark Updated Branches: refs/heads/branch-2.3 e5cc5f699 -> 64c72b4de [SPARK-24739][PYTHON] Make PySpark compatible with Python 3.7 ## What changes were proposed in this pull request? This PR proposes to make PySpark compatible with Python 3.7. There are rather radical change in

spark git commit: [SPARK-24739][PYTHON] Make PySpark compatible with Python 3.7

2018-07-06 Thread gurwls223
Repository: spark Updated Branches: refs/heads/master fc43690d3 -> 74f6a92fc [SPARK-24739][PYTHON] Make PySpark compatible with Python 3.7 ## What changes were proposed in this pull request? This PR proposes to make PySpark compatible with Python 3.7. There are rather radical change in

spark git commit: [SPARK-24537][R] Add array_remove / array_zip / map_from_arrays / array_distinct

2018-07-12 Thread gurwls223
Repository: spark Updated Branches: refs/heads/master 75725057b -> e0f4f206b [SPARK-24537][R] Add array_remove / array_zip / map_from_arrays / array_distinct ## What changes were proposed in this pull request? Add array_remove / array_zip / map_from_arrays / array_distinct functions in

spark git commit: [SPARK-24529][BUILD][TEST-MAVEN] Add spotbugs into maven build process

2018-07-11 Thread gurwls223
Repository: spark Updated Branches: refs/heads/master 3ab48f985 -> 5ad4735bd [SPARK-24529][BUILD][TEST-MAVEN] Add spotbugs into maven build process ## What changes were proposed in this pull request? This PR enables a Java bytecode check tool [spotbugs](https://spotbugs.github.io/) to avoid

spark git commit: [SPARK-24268][SQL] Use datatype.simpleString in error messages

2018-07-09 Thread gurwls223
Repository: spark Updated Branches: refs/heads/master 034913b62 -> 1bd3d61f4 [SPARK-24268][SQL] Use datatype.simpleString in error messages ## What changes were proposed in this pull request? SPARK-22893 tried to unify error messages about dataTypes. Unfortunately, still many places were

[3/6] spark-website git commit: Fix signature description broken in PySpark API documentation in 2.2.2

2018-07-09 Thread gurwls223
http://git-wip-us.apache.org/repos/asf/spark-website/blob/7b3e459e/site/docs/2.2.2/api/python/pyspark.sql.html -- diff --git a/site/docs/2.2.2/api/python/pyspark.sql.html b/site/docs/2.2.2/api/python/pyspark.sql.html index

[1/6] spark-website git commit: Fix signature description broken in PySpark API documentation in 2.2.2

2018-07-09 Thread gurwls223
Repository: spark-website Updated Branches: refs/heads/asf-site 2b5ba2f62 -> 7b3e459e2 http://git-wip-us.apache.org/repos/asf/spark-website/blob/7b3e459e/site/docs/2.2.2/api/python/searchindex.js -- diff --git

[5/6] spark-website git commit: Fix signature description broken in PySpark API documentation in 2.2.2

2018-07-09 Thread gurwls223
http://git-wip-us.apache.org/repos/asf/spark-website/blob/7b3e459e/site/docs/2.2.2/api/python/pyspark.ml.html -- diff --git a/site/docs/2.2.2/api/python/pyspark.ml.html b/site/docs/2.2.2/api/python/pyspark.ml.html index

[6/6] spark-website git commit: Fix signature description broken in PySpark API documentation in 2.2.2

2018-07-09 Thread gurwls223
Fix signature description broken in PySpark API documentation in 2.2.2 Project: http://git-wip-us.apache.org/repos/asf/spark-website/repo Commit: http://git-wip-us.apache.org/repos/asf/spark-website/commit/7b3e459e Tree: http://git-wip-us.apache.org/repos/asf/spark-website/tree/7b3e459e Diff:

[2/6] spark-website git commit: Fix signature description broken in PySpark API documentation in 2.2.2

2018-07-09 Thread gurwls223
http://git-wip-us.apache.org/repos/asf/spark-website/blob/7b3e459e/site/docs/2.2.2/api/python/pyspark.streaming.html -- diff --git a/site/docs/2.2.2/api/python/pyspark.streaming.html

[4/6] spark-website git commit: Fix signature description broken in PySpark API documentation in 2.2.2

2018-07-09 Thread gurwls223
http://git-wip-us.apache.org/repos/asf/spark-website/blob/7b3e459e/site/docs/2.2.2/api/python/pyspark.mllib.html -- diff --git a/site/docs/2.2.2/api/python/pyspark.mllib.html b/site/docs/2.2.2/api/python/pyspark.mllib.html index

[02/14] spark-website git commit: Fix signature description broken in PySpark API documentation in 2.1.2

2018-07-03 Thread gurwls223
http://git-wip-us.apache.org/repos/asf/spark-website/blob/6bbac496/site/docs/2.1.2/api/python/pyspark.streaming.html -- diff --git a/site/docs/2.1.2/api/python/pyspark.streaming.html

[04/14] spark-website git commit: Fix signature description broken in PySpark API documentation in 2.1.2

2018-07-03 Thread gurwls223
http://git-wip-us.apache.org/repos/asf/spark-website/blob/6bbac496/site/docs/2.1.2/api/python/pyspark.mllib.html -- diff --git a/site/docs/2.1.2/api/python/pyspark.mllib.html b/site/docs/2.1.2/api/python/pyspark.mllib.html index

[13/14] spark-website git commit: Fix signature description broken in PySpark API documentation in 2.1.2

2018-07-03 Thread gurwls223
http://git-wip-us.apache.org/repos/asf/spark-website/blob/6bbac496/site/docs/2.1.2/api/python/_modules/pyspark/ml/feature.html -- diff --git a/site/docs/2.1.2/api/python/_modules/pyspark/ml/feature.html

[09/14] spark-website git commit: Fix signature description broken in PySpark API documentation in 2.1.2

2018-07-03 Thread gurwls223
http://git-wip-us.apache.org/repos/asf/spark-website/blob/6bbac496/site/docs/2.1.2/api/python/_modules/pyspark/mllib/regression.html -- diff --git a/site/docs/2.1.2/api/python/_modules/pyspark/mllib/regression.html

[06/14] spark-website git commit: Fix signature description broken in PySpark API documentation in 2.1.2

2018-07-03 Thread gurwls223
http://git-wip-us.apache.org/repos/asf/spark-website/blob/6bbac496/site/docs/2.1.2/api/python/_modules/pyspark/streaming/dstream.html -- diff --git a/site/docs/2.1.2/api/python/_modules/pyspark/streaming/dstream.html

[01/14] spark-website git commit: Fix signature description broken in PySpark API documentation in 2.1.2

2018-07-03 Thread gurwls223
Repository: spark-website Updated Branches: refs/heads/asf-site 775127770 -> 6bbac4966 http://git-wip-us.apache.org/repos/asf/spark-website/blob/6bbac496/site/docs/2.1.2/api/python/searchindex.js -- diff --git

[10/14] spark-website git commit: Fix signature description broken in PySpark API documentation in 2.1.2

2018-07-03 Thread gurwls223
http://git-wip-us.apache.org/repos/asf/spark-website/blob/6bbac496/site/docs/2.1.2/api/python/_modules/pyspark/mllib/clustering.html -- diff --git a/site/docs/2.1.2/api/python/_modules/pyspark/mllib/clustering.html

[05/14] spark-website git commit: Fix signature description broken in PySpark API documentation in 2.1.2

2018-07-03 Thread gurwls223
http://git-wip-us.apache.org/repos/asf/spark-website/blob/6bbac496/site/docs/2.1.2/api/python/pyspark.ml.html -- diff --git a/site/docs/2.1.2/api/python/pyspark.ml.html b/site/docs/2.1.2/api/python/pyspark.ml.html index

[03/14] spark-website git commit: Fix signature description broken in PySpark API documentation in 2.1.2

2018-07-03 Thread gurwls223
http://git-wip-us.apache.org/repos/asf/spark-website/blob/6bbac496/site/docs/2.1.2/api/python/pyspark.sql.html -- diff --git a/site/docs/2.1.2/api/python/pyspark.sql.html b/site/docs/2.1.2/api/python/pyspark.sql.html index

[12/14] spark-website git commit: Fix signature description broken in PySpark API documentation in 2.1.2

2018-07-03 Thread gurwls223
http://git-wip-us.apache.org/repos/asf/spark-website/blob/6bbac496/site/docs/2.1.2/api/python/_modules/pyspark/ml/param/shared.html -- diff --git a/site/docs/2.1.2/api/python/_modules/pyspark/ml/param/shared.html

[07/14] spark-website git commit: Fix signature description broken in PySpark API documentation in 2.1.2

2018-07-03 Thread gurwls223
http://git-wip-us.apache.org/repos/asf/spark-website/blob/6bbac496/site/docs/2.1.2/api/python/_modules/pyspark/sql/functions.html -- diff --git a/site/docs/2.1.2/api/python/_modules/pyspark/sql/functions.html

[14/14] spark-website git commit: Fix signature description broken in PySpark API documentation in 2.1.2

2018-07-03 Thread gurwls223
Fix signature description broken in PySpark API documentation in 2.1.2 Project: http://git-wip-us.apache.org/repos/asf/spark-website/repo Commit: http://git-wip-us.apache.org/repos/asf/spark-website/commit/6bbac496 Tree: http://git-wip-us.apache.org/repos/asf/spark-website/tree/6bbac496 Diff:

[08/14] spark-website git commit: Fix signature description broken in PySpark API documentation in 2.1.2

2018-07-03 Thread gurwls223
http://git-wip-us.apache.org/repos/asf/spark-website/blob/6bbac496/site/docs/2.1.2/api/python/_modules/pyspark/serializers.html -- diff --git a/site/docs/2.1.2/api/python/_modules/pyspark/serializers.html

[11/14] spark-website git commit: Fix signature description broken in PySpark API documentation in 2.1.2

2018-07-03 Thread gurwls223
http://git-wip-us.apache.org/repos/asf/spark-website/blob/6bbac496/site/docs/2.1.2/api/python/_modules/pyspark/ml/regression.html -- diff --git a/site/docs/2.1.2/api/python/_modules/pyspark/ml/regression.html

[1/6] spark-website git commit: Fix signature description broken in PySpark API documentation in 2.1.3

2018-07-03 Thread gurwls223
Repository: spark-website Updated Branches: refs/heads/asf-site 6bbac4966 -> da71a5c1d http://git-wip-us.apache.org/repos/asf/spark-website/blob/da71a5c1/site/docs/2.1.3/api/python/searchindex.js -- diff --git

[3/6] spark-website git commit: Fix signature description broken in PySpark API documentation in 2.1.3

2018-07-03 Thread gurwls223
http://git-wip-us.apache.org/repos/asf/spark-website/blob/da71a5c1/site/docs/2.1.3/api/python/pyspark.sql.html -- diff --git a/site/docs/2.1.3/api/python/pyspark.sql.html b/site/docs/2.1.3/api/python/pyspark.sql.html index

[5/6] spark-website git commit: Fix signature description broken in PySpark API documentation in 2.1.3

2018-07-03 Thread gurwls223
http://git-wip-us.apache.org/repos/asf/spark-website/blob/da71a5c1/site/docs/2.1.3/api/python/pyspark.ml.html -- diff --git a/site/docs/2.1.3/api/python/pyspark.ml.html b/site/docs/2.1.3/api/python/pyspark.ml.html index

[2/6] spark-website git commit: Fix signature description broken in PySpark API documentation in 2.1.3

2018-07-03 Thread gurwls223
http://git-wip-us.apache.org/repos/asf/spark-website/blob/da71a5c1/site/docs/2.1.3/api/python/pyspark.streaming.html -- diff --git a/site/docs/2.1.3/api/python/pyspark.streaming.html

[4/6] spark-website git commit: Fix signature description broken in PySpark API documentation in 2.1.3

2018-07-03 Thread gurwls223
http://git-wip-us.apache.org/repos/asf/spark-website/blob/da71a5c1/site/docs/2.1.3/api/python/pyspark.mllib.html -- diff --git a/site/docs/2.1.3/api/python/pyspark.mllib.html b/site/docs/2.1.3/api/python/pyspark.mllib.html index

spark git commit: [SPARK-23177][SQL][PYSPARK] Extract zero-parameter UDFs from aggregate

2018-01-23 Thread gurwls223
Repository: spark Updated Branches: refs/heads/master 15adcc827 -> a3911cf89 [SPARK-23177][SQL][PYSPARK] Extract zero-parameter UDFs from aggregate ## What changes were proposed in this pull request? We extract Python UDFs in logical aggregate which depends on aggregate expression or

spark git commit: [SPARK-22959][PYTHON] Configuration to select the modules for daemon and worker in PySpark

2018-01-13 Thread gurwls223
Repository: spark Updated Branches: refs/heads/master 0066d6f6f -> afae8f2bc [SPARK-22959][PYTHON] Configuration to select the modules for daemon and worker in PySpark ## What changes were proposed in this pull request? We are now forced to use `pyspark/daemon.py` and `pyspark/worker.py` in

spark git commit: [SPARK-7721][PYTHON][TESTS] Adds PySpark coverage generation script

2018-01-22 Thread gurwls223
Repository: spark Updated Branches: refs/heads/master 5d680cae4 -> 87ffe7add [SPARK-7721][PYTHON][TESTS] Adds PySpark coverage generation script ## What changes were proposed in this pull request? Note that this PR was made based on the top of https://github.com/apache/spark/pull/20151. So,

spark git commit: [MINOR][SQL] Fix wrong comments on org.apache.spark.sql.parquet.row.attributes

2018-01-21 Thread gurwls223
Repository: spark Updated Branches: refs/heads/master 11daeb833 -> 8142a3b88 [MINOR][SQL] Fix wrong comments on org.apache.spark.sql.parquet.row.attributes ## What changes were proposed in this pull request? This PR fixes the wrong comment on `org.apache.spark.sql.parquet.row.attributes`

spark git commit: [MINOR][SQL] Fix wrong comments on org.apache.spark.sql.parquet.row.attributes

2018-01-21 Thread gurwls223
Repository: spark Updated Branches: refs/heads/branch-2.3 5781fa79e -> 36af73b59 [MINOR][SQL] Fix wrong comments on org.apache.spark.sql.parquet.row.attributes ## What changes were proposed in this pull request? This PR fixes the wrong comment on `org.apache.spark.sql.parquet.row.attributes`

spark git commit: [SPARK-23177][SQL][PYSPARK][BACKPORT-2.3] Extract zero-parameter UDFs from aggregate

2018-01-24 Thread gurwls223
Repository: spark Updated Branches: refs/heads/branch-2.3 d656be74b -> 84a189a34 [SPARK-23177][SQL][PYSPARK][BACKPORT-2.3] Extract zero-parameter UDFs from aggregate ## What changes were proposed in this pull request? We extract Python UDFs in logical aggregate which depends on aggregate

spark git commit: [SPARK-23148][SQL] Allow pathnames with special characters for CSV / JSON / text

2018-01-24 Thread gurwls223
Repository: spark Updated Branches: refs/heads/branch-2.3 84a189a34 -> 17317c8fb [SPARK-23148][SQL] Allow pathnames with special characters for CSV / JSON / text …JSON / text ## What changes were proposed in this pull request? Fix for JSON and CSV data sources when file names include

spark git commit: [SPARK-23148][SQL] Allow pathnames with special characters for CSV / JSON / text

2018-01-24 Thread gurwls223
Repository: spark Updated Branches: refs/heads/master 7af1a325d -> de36f65d3 [SPARK-23148][SQL] Allow pathnames with special characters for CSV / JSON / text …JSON / text ## What changes were proposed in this pull request? Fix for JSON and CSV data sources when file names include

spark git commit: [SPARK-23174][BUILD][PYTHON] python code style checker update

2018-01-24 Thread gurwls223
Repository: spark Updated Branches: refs/heads/master 4e7b49041 -> 7af1a325d [SPARK-23174][BUILD][PYTHON] python code style checker update ## What changes were proposed in this pull request? Referencing latest python code style checking from PyPi/pycodestyle Removed pending TODO For now, in

spark git commit: [SPARK-23380][PYTHON] Adds a conf for Arrow fallback in toPandas/createDataFrame with Pandas DataFrame

2018-03-08 Thread gurwls223
Repository: spark Updated Branches: refs/heads/master 9bb239c8b -> d6632d185 [SPARK-23380][PYTHON] Adds a conf for Arrow fallback in toPandas/createDataFrame with Pandas DataFrame ## What changes were proposed in this pull request? This PR adds a configuration to control the fallback of

spark git commit: [SPARK-23159][PYTHON] Update cloudpickle to v0.4.3

2018-03-08 Thread gurwls223
Repository: spark Updated Branches: refs/heads/master fe22f3204 -> 9bb239c8b [SPARK-23159][PYTHON] Update cloudpickle to v0.4.3 ## What changes were proposed in this pull request? The version of cloudpickle in PySpark was close to version 0.4.0 with some additional backported fixes and some

spark git commit: [SPARK-23011][SQL][PYTHON] Support alternative function form with group aggregate pandas UDF

2018-03-08 Thread gurwls223
Repository: spark Updated Branches: refs/heads/master d6632d185 -> 2cb23a8f5 [SPARK-23011][SQL][PYTHON] Support alternative function form with group aggregate pandas UDF ## What changes were proposed in this pull request? This PR proposes to support an alternative function from with group

spark git commit: [SPARK-23522][PYTHON] always use sys.exit over builtin exit

2018-03-08 Thread gurwls223
Repository: spark Updated Branches: refs/heads/master 2cb23a8f5 -> 7013eea11 [SPARK-23522][PYTHON] always use sys.exit over builtin exit The exit() builtin is only for interactive use. applications should use sys.exit(). ## What changes were proposed in this pull request? All usage of the

spark git commit: [SPARK-23462][SQL] improve missing field error message in `StructType`

2018-03-12 Thread gurwls223
Repository: spark Updated Branches: refs/heads/branch-2.3 265e61ee9 -> a8e357ada [SPARK-23462][SQL] improve missing field error message in `StructType` ## What changes were proposed in this pull request? The error message ```s"""Field "$name" does not exist."""``` is thrown when looking up

spark git commit: [SPARK-23462][SQL] improve missing field error message in `StructType`

2018-03-12 Thread gurwls223
Repository: spark Updated Branches: refs/heads/master b6f837c9d -> b304e07e0 [SPARK-23462][SQL] improve missing field error message in `StructType` ## What changes were proposed in this pull request? The error message ```s"""Field "$name" does not exist."""``` is thrown when looking up an

spark git commit: [MINOR][SQL][TEST] Create table using `dataSourceName` in `HadoopFsRelationTest`

2018-03-13 Thread gurwls223
Repository: spark Updated Branches: refs/heads/master 23370554d -> 9ddd1e2ce [MINOR][SQL][TEST] Create table using `dataSourceName` in `HadoopFsRelationTest` ## What changes were proposed in this pull request? This PR fixes a minor issue in `HadoopFsRelationTest`, that you should create

spark git commit: [PYTHON] Changes input variable to not conflict with built-in function

2018-03-10 Thread gurwls223
Repository: spark Updated Branches: refs/heads/master 1a54f48b6 -> b6f837c9d [PYTHON] Changes input variable to not conflict with built-in function Signed-off-by: DylanGuedes ## What changes were proposed in this pull request? Changes variable name conflict: [input is a built-in python

spark git commit: [PYTHON] Changes input variable to not conflict with built-in function

2018-03-10 Thread gurwls223
Repository: spark Updated Branches: refs/heads/branch-2.3 5bd306c38 -> 265e61ee9 [PYTHON] Changes input variable to not conflict with built-in function Signed-off-by: DylanGuedes ## What changes were proposed in this pull request? Changes variable name conflict: [input is a built-in python

spark git commit: [SPARK-23642][DOCS] AccumulatorV2 subclass isZero scaladoc fix

2018-03-14 Thread gurwls223
Repository: spark Updated Branches: refs/heads/branch-2.3 0663b6119 -> a9d0784e6 [SPARK-23642][DOCS] AccumulatorV2 subclass isZero scaladoc fix Added/corrected scaladoc for isZero on the DoubleAccumulator, CollectionAccumulator, and LongAccumulator subclasses of AccumulatorV2, particularly

spark git commit: [SPARK-23642][DOCS] AccumulatorV2 subclass isZero scaladoc fix

2018-03-14 Thread gurwls223
Repository: spark Updated Branches: refs/heads/master 279b3db89 -> 4f5bad615 [SPARK-23642][DOCS] AccumulatorV2 subclass isZero scaladoc fix Added/corrected scaladoc for isZero on the DoubleAccumulator, CollectionAccumulator, and LongAccumulator subclasses of AccumulatorV2, particularly

spark git commit: [SPARK-23436][SQL][BACKPORT-2.3] Infer partition as Date only if it can be casted to Date

2018-03-09 Thread gurwls223
Repository: spark Updated Branches: refs/heads/branch-2.3 404f7e201 -> 8ff8e16e2 [SPARK-23436][SQL][BACKPORT-2.3] Infer partition as Date only if it can be casted to Date This PR is to backport https://github.com/apache/spark/pull/20621 to branch 2.3 --- ## What changes were proposed in

[2/2] spark git commit: [MINOR][DOC] Fix some typos and grammar issues

2018-04-05 Thread gurwls223
[MINOR][DOC] Fix some typos and grammar issues ## What changes were proposed in this pull request? Easy fix in the documentation. ## How was this patch tested? N/A Closes #20948 Author: Daniel Sakuma Closes #20928 from dsakuma/fix_typo_configuration_docs. Project:

[1/2] spark git commit: [MINOR][DOC] Fix some typos and grammar issues

2018-04-05 Thread gurwls223
Repository: spark Updated Branches: refs/heads/master 249007e37 -> 6ade5cbb4 http://git-wip-us.apache.org/repos/asf/spark/blob/6ade5cbb/docs/structured-streaming-programming-guide.md -- diff --git

spark git commit: [SPARK-23847][FOLLOWUP][PYTHON][SQL] Actually test [desc|acs]_nulls_[first|last] functions in PySpark

2018-04-11 Thread gurwls223
Repository: spark Updated Branches: refs/heads/master 7c7570d46 -> c7622befd [SPARK-23847][FOLLOWUP][PYTHON][SQL] Actually test [desc|acs]_nulls_[first|last] functions in PySpark ## What changes were proposed in this pull request? There was a mistake in `tests.py` missing `assertEquals`.

spark git commit: [MINOR][DOCS] Fix R documentation generation instruction for roxygen2

2018-04-11 Thread gurwls223
Repository: spark Updated Branches: refs/heads/master c7622befd -> 87611bba2 [MINOR][DOCS] Fix R documentation generation instruction for roxygen2 ## What changes were proposed in this pull request? This PR proposes to fix `roxygen2` to `5.0.1` in `docs/README.md` for SparkR documentation

spark git commit: [MINOR][DOCS] Fix R documentation generation instruction for roxygen2

2018-04-11 Thread gurwls223
Repository: spark Updated Branches: refs/heads/branch-2.3 0f2aabc6b -> 320269e49 [MINOR][DOCS] Fix R documentation generation instruction for roxygen2 ## What changes were proposed in this pull request? This PR proposes to fix `roxygen2` to `5.0.1` in `docs/README.md` for SparkR

spark git commit: [SPARK-23942][PYTHON][SQL][BRANCH-2.3] Makes collect in PySpark as action for a query executor listener

2018-04-14 Thread gurwls223
Repository: spark Updated Branches: refs/heads/branch-2.3 dfdf1bb9b -> d4f204c53 [SPARK-23942][PYTHON][SQL][BRANCH-2.3] Makes collect in PySpark as action for a query executor listener ## What changes were proposed in this pull request? This PR proposes to add `collect` to a query executor

spark git commit: [SPARK-23861][SQL][DOC] Clarify default window frame with and without orderBy clause

2018-04-06 Thread gurwls223
Repository: spark Updated Branches: refs/heads/master 945240193 -> d766ea2ff [SPARK-23861][SQL][DOC] Clarify default window frame with and without orderBy clause ## What changes were proposed in this pull request? Add docstring to clarify default window frame boundaries with and without

spark git commit: [SPARK-24029][CORE] Set SO_REUSEADDR on listen sockets.

2018-04-21 Thread gurwls223
Repository: spark Updated Branches: refs/heads/master 1d758dc73 -> 32b4bcd6d [SPARK-24029][CORE] Set SO_REUSEADDR on listen sockets. This allows sockets to be bound even if there are sockets from a previous application that are still pending closure. It avoids bind issues when, for example,

spark git commit: [SPARK-21811][SQL] Fix the inconsistency behavior when finding the widest common type

2018-04-19 Thread gurwls223
Repository: spark Updated Branches: refs/heads/master 9e10f69df -> d96c3e33c [SPARK-21811][SQL] Fix the inconsistency behavior when finding the widest common type ## What changes were proposed in this pull request? Currently we find the wider common type by comparing the two types from left

spark git commit: [MINOR][DOCS] Fix comments of SQLExecution#withExecutionId

2018-04-24 Thread gurwls223
Repository: spark Updated Branches: refs/heads/master 281c1ca0d -> c303b1b67 [MINOR][DOCS] Fix comments of SQLExecution#withExecutionId ## What changes were proposed in this pull request? Fix comment. Change `BroadcastHashJoin.broadcastFuture` to `BroadcastExchangeExec.relationFuture`:

spark git commit: [MINOR][DOCS] Fix comments of SQLExecution#withExecutionId

2018-04-24 Thread gurwls223
Repository: spark Updated Branches: refs/heads/branch-2.2 041aec4e1 -> e77d62a72 [MINOR][DOCS] Fix comments of SQLExecution#withExecutionId ## What changes were proposed in this pull request? Fix comment. Change `BroadcastHashJoin.broadcastFuture` to `BroadcastExchangeExec.relationFuture`:

spark git commit: [MINOR][DOCS] Fix comments of SQLExecution#withExecutionId

2018-04-24 Thread gurwls223
Repository: spark Updated Branches: refs/heads/branch-2.3 1c3e8205d -> 096defdd7 [MINOR][DOCS] Fix comments of SQLExecution#withExecutionId ## What changes were proposed in this pull request? Fix comment. Change `BroadcastHashJoin.broadcastFuture` to `BroadcastExchangeExec.relationFuture`:

spark git commit: [SPARK-24054][R] Add array_position function / element_at functions

2018-04-24 Thread gurwls223
Repository: spark Updated Branches: refs/heads/master c303b1b67 -> 87e8a572b [SPARK-24054][R] Add array_position function / element_at functions ## What changes were proposed in this pull request? This PR proposes to add array_position and element_at in R side too. array_position: ```r df

spark git commit: [SPARK-24029][CORE] Follow up: set SO_REUSEADDR on the server socket.

2018-04-23 Thread gurwls223
Repository: spark Updated Branches: refs/heads/master c8f3ac69d -> 428b90385 [SPARK-24029][CORE] Follow up: set SO_REUSEADDR on the server socket. "childOption" is for the remote connections, not for the server socket that actually listens for incoming connections. Author: Marcelo Vanzin

spark git commit: [SPARK-23770][R] Exposes repartitionByRange in SparkR

2018-03-29 Thread gurwls223
Repository: spark Updated Branches: refs/heads/master 641aec68e -> 505480cb5 [SPARK-23770][R] Exposes repartitionByRange in SparkR ## What changes were proposed in this pull request? This PR proposes to expose `repartitionByRange`. ```R > df <- createDataFrame(iris) ... >

spark git commit: [MINOR][DOC] Fix a few markdown typos

2018-04-02 Thread gurwls223
Repository: spark Updated Branches: refs/heads/branch-2.3 6ca6483c1 -> ce1565115 [MINOR][DOC] Fix a few markdown typos ## What changes were proposed in this pull request? Easy fix in the markdown. ## How was this patch tested? jekyII build test manually. Please review

spark git commit: [MINOR][DOC] Fix a few markdown typos

2018-04-02 Thread gurwls223
Repository: spark Updated Branches: refs/heads/master 441d0d076 -> 8020f66fc [MINOR][DOC] Fix a few markdown typos ## What changes were proposed in this pull request? Easy fix in the markdown. ## How was this patch tested? jekyII build test manually. Please review

spark git commit: [SPARK-19964][CORE] Avoid reading from remote repos in SparkSubmitSuite.

2018-04-02 Thread gurwls223
Repository: spark Updated Branches: refs/heads/branch-2.3 f1f10da2b -> 6ca6483c1 [SPARK-19964][CORE] Avoid reading from remote repos in SparkSubmitSuite. These tests can fail with a timeout if the remote repos are not responding, or slow. The tests don't need anything from those repos, so use

spark-website git commit: add committer

2018-04-02 Thread gurwls223
Repository: spark-website Updated Branches: refs/heads/asf-site a1d84bcbf -> 114925632 add committer Project: http://git-wip-us.apache.org/repos/asf/spark-website/repo Commit: http://git-wip-us.apache.org/repos/asf/spark-website/commit/11492563 Tree:

spark-website git commit: Update committer page

2018-04-02 Thread gurwls223
Repository: spark-website Updated Branches: refs/heads/asf-site 114925632 -> f524d4f53 Update committer page Project: http://git-wip-us.apache.org/repos/asf/spark-website/repo Commit: http://git-wip-us.apache.org/repos/asf/spark-website/commit/f524d4f5 Tree:

spark git commit: [MINOR][CORE] Show block manager id when remove RDD/Broadcast fails.

2018-04-03 Thread gurwls223
Repository: spark Updated Branches: refs/heads/master 8020f66fc -> 7cf9fab33 [MINOR][CORE] Show block manager id when remove RDD/Broadcast fails. ## What changes were proposed in this pull request? Address https://github.com/apache/spark/pull/20924#discussion_r177987175, show block manager

spark git commit: [MINOR][CORE] Show block manager id when remove RDD/Broadcast fails.

2018-04-03 Thread gurwls223
Repository: spark Updated Branches: refs/heads/branch-2.3 ce1565115 -> f36bdb401 [MINOR][CORE] Show block manager id when remove RDD/Broadcast fails. ## What changes were proposed in this pull request? Address https://github.com/apache/spark/pull/20924#discussion_r177987175, show block

spark git commit: [SPARK-23700][PYTHON] Cleanup imports in pyspark.sql

2018-03-25 Thread gurwls223
Repository: spark Updated Branches: refs/heads/master e4bec7cb8 -> a9350d709 [SPARK-23700][PYTHON] Cleanup imports in pyspark.sql ## What changes were proposed in this pull request? This cleans up unused imports, mainly from pyspark.sql module. Added a note in function.py that imports

spark git commit: [SPARK-23645][MINOR][DOCS][PYTHON] Add docs RE `pandas_udf` with keyword args

2018-03-25 Thread gurwls223
Repository: spark Updated Branches: refs/heads/branch-2.3 2fd7acabf -> 328dea6f8 [SPARK-23645][MINOR][DOCS][PYTHON] Add docs RE `pandas_udf` with keyword args ## What changes were proposed in this pull request? Add documentation about the limitations of `pandas_udf` with keyword arguments

spark git commit: [SPARK-23645][MINOR][DOCS][PYTHON] Add docs RE `pandas_udf` with keyword args

2018-03-25 Thread gurwls223
Repository: spark Updated Branches: refs/heads/master a9350d709 -> 087fb3142 [SPARK-23645][MINOR][DOCS][PYTHON] Add docs RE `pandas_udf` with keyword args ## What changes were proposed in this pull request? Add documentation about the limitations of `pandas_udf` with keyword arguments and

spark git commit: [SPARK-23706][PYTHON] spark.conf.get(value, default=None) should produce None in PySpark

2018-03-18 Thread gurwls223
Repository: spark Updated Branches: refs/heads/master 8a1efe307 -> 61487b308 [SPARK-23706][PYTHON] spark.conf.get(value, default=None) should produce None in PySpark ## What changes were proposed in this pull request? Scala: ``` scala> spark.conf.get("hey", null) res1: String = null ```

spark git commit: [SPARK-23706][PYTHON] spark.conf.get(value, default=None) should produce None in PySpark

2018-03-18 Thread gurwls223
Repository: spark Updated Branches: refs/heads/branch-2.3 6937571ab -> 80e79430f [SPARK-23706][PYTHON] spark.conf.get(value, default=None) should produce None in PySpark Scala: ``` scala> spark.conf.get("hey", null) res1: String = null ``` ``` scala>

spark git commit: [SPARK-23691][PYTHON][BRANCH-2.3] Use sql_conf util in PySpark tests where possible

2018-03-20 Thread gurwls223
Repository: spark Updated Branches: refs/heads/branch-2.3 2f82c037d -> c854b6ca7 [SPARK-23691][PYTHON][BRANCH-2.3] Use sql_conf util in PySpark tests where possible ## What changes were proposed in this pull request? This PR backports https://github.com/apache/spark/pull/20830 to reduce the

spark git commit: [SPARK-23769][CORE] Remove comments that unnecessarily disable Scalastyle check

2018-03-23 Thread gurwls223
Repository: spark Updated Branches: refs/heads/master 92e952557 -> 6ac4fba69 [SPARK-23769][CORE] Remove comments that unnecessarily disable Scalastyle check ## What changes were proposed in this pull request? We re-enabled the Scalastyle checker on a line of code. It was previously

spark git commit: [SPARK-23769][CORE] Remove comments that unnecessarily disable Scalastyle check

2018-03-23 Thread gurwls223
Repository: spark Updated Branches: refs/heads/branch-2.3 45761ceb2 -> ce0fbec68 [SPARK-23769][CORE] Remove comments that unnecessarily disable Scalastyle check ## What changes were proposed in this pull request? We re-enabled the Scalastyle checker on a line of code. It was previously

spark git commit: [MINOR][R] Fix R lint failure

2018-03-23 Thread gurwls223
Repository: spark Updated Branches: refs/heads/branch-2.3 1d0d0a5fc -> 45761ceb2 [MINOR][R] Fix R lint failure ## What changes were proposed in this pull request? The lint failure bugged me: ```R R/SQLContext.R:715:97: style: Trailing whitespace is superfluous. #'file-based

spark git commit: [MINOR][R] Fix R lint failure

2018-03-23 Thread gurwls223
Repository: spark Updated Branches: refs/heads/master 5fa438471 -> 92e952557 [MINOR][R] Fix R lint failure ## What changes were proposed in this pull request? The lint failure bugged me: ```R R/SQLContext.R:715:97: style: Trailing whitespace is superfluous. #'file-based streaming

spark git commit: [SQL][MINOR] XPathDouble prettyPrint should say 'double' not 'float'

2018-03-04 Thread gurwls223
Repository: spark Updated Branches: refs/heads/master 486f99eef -> a89cdf55f [SQL][MINOR] XPathDouble prettyPrint should say 'double' not 'float' ## What changes were proposed in this pull request? It looks like this was incorrectly copied from `XPathFloat` in the class above. ## How was

spark git commit: [SQL][MINOR] XPathDouble prettyPrint should say 'double' not 'float'

2018-03-04 Thread gurwls223
Repository: spark Updated Branches: refs/heads/branch-2.3 f12fa13f1 -> 26a8a675a [SQL][MINOR] XPathDouble prettyPrint should say 'double' not 'float' ## What changes were proposed in this pull request? It looks like this was incorrectly copied from `XPathFloat` in the class above. ## How

spark git commit: [SPARK-23569][PYTHON] Allow pandas_udf to work with python3 style type-annotated functions

2018-03-04 Thread gurwls223
Repository: spark Updated Branches: refs/heads/master a89cdf55f -> 7965c91d8 [SPARK-23569][PYTHON] Allow pandas_udf to work with python3 style type-annotated functions ## What changes were proposed in this pull request? Check python version to determine whether to use `inspect.getargspec`

spark git commit: [SPARK-23569][PYTHON] Allow pandas_udf to work with python3 style type-annotated functions

2018-03-04 Thread gurwls223
Repository: spark Updated Branches: refs/heads/branch-2.3 26a8a675a -> c8aa6fbb0 [SPARK-23569][PYTHON] Allow pandas_udf to work with python3 style type-annotated functions ## What changes were proposed in this pull request? Check python version to determine whether to use

spark git commit: [SPARK-23329][SQL] Fix documentation of trigonometric functions

2018-03-05 Thread gurwls223
Repository: spark Updated Branches: refs/heads/master 5ff72ffcf -> a366b950b [SPARK-23329][SQL] Fix documentation of trigonometric functions ## What changes were proposed in this pull request? Provide more details in trigonometric function documentations. Referenced `java.lang.Math` for

spark git commit: [SPARK-23329][SQL] Fix documentation of trigonometric functions

2018-03-05 Thread gurwls223
Repository: spark Updated Branches: refs/heads/branch-2.3 88dd335f6 -> 232b9f81f [SPARK-23329][SQL] Fix documentation of trigonometric functions ## What changes were proposed in this pull request? Provide more details in trigonometric function documentations. Referenced `java.lang.Math` for

spark git commit: [SPARK-23566][MINOR][DOC] Argument name mismatch fixed

2018-03-05 Thread gurwls223
Repository: spark Updated Branches: refs/heads/master 42cf48e20 -> 5ff72ffcf [SPARK-23566][MINOR][DOC] Argument name mismatch fixed Argument name mismatch fixed. ## What changes were proposed in this pull request? `col` changed to `new` in doc string to match the argument list. Patch file

spark git commit: [SPARK-23448][SQL] Clarify JSON and CSV parser behavior in document

2018-02-27 Thread gurwls223
Repository: spark Updated Branches: refs/heads/master 23ac3aaba -> b14993e1f [SPARK-23448][SQL] Clarify JSON and CSV parser behavior in document ## What changes were proposed in this pull request? Clarify JSON and CSV reader behavior in document. JSON doesn't support partial results for

spark git commit: [SPARK-23448][SQL] Clarify JSON and CSV parser behavior in document

2018-02-27 Thread gurwls223
Repository: spark Updated Branches: refs/heads/branch-2.3 30242b664 -> fe9cb4afe [SPARK-23448][SQL] Clarify JSON and CSV parser behavior in document ## What changes were proposed in this pull request? Clarify JSON and CSV reader behavior in document. JSON doesn't support partial results for

spark git commit: [SPARK-23517][PYTHON] Make `pyspark.util._exception_message` produce the trace from Java side by Py4JJavaError

2018-02-28 Thread gurwls223
Repository: spark Updated Branches: refs/heads/branch-2.3 dfa43792f -> a4eb1e47a [SPARK-23517][PYTHON] Make `pyspark.util._exception_message` produce the trace from Java side by Py4JJavaError ## What changes were proposed in this pull request? This PR proposes for

spark git commit: [SPARK-23517][PYTHON] Make `pyspark.util._exception_message` produce the trace from Java side by Py4JJavaError

2018-02-28 Thread gurwls223
Repository: spark Updated Branches: refs/heads/master 6a8abe29e -> fab563b9b [SPARK-23517][PYTHON] Make `pyspark.util._exception_message` produce the trace from Java side by Py4JJavaError ## What changes were proposed in this pull request? This PR proposes for

spark git commit: [SPARK-23417][PYTHON] Fix the build instructions supplied by exception messages in python streaming tests

2018-02-27 Thread gurwls223
Repository: spark Updated Branches: refs/heads/master 598446b74 -> 23ac3aaba [SPARK-23417][PYTHON] Fix the build instructions supplied by exception messages in python streaming tests ## What changes were proposed in this pull request? Fix the build instructions supplied by exception

<    1   2   3   4   5   6   7   8   9   10   >