Repository: spark
Updated Branches:
refs/heads/master 7b2dca5b1 -> 0cf59fcbe
[SPARK-24303][PYTHON] Update cloudpickle to v0.4.4
## What changes were proposed in this pull request?
cloudpickle 0.4.4 is released -
https://github.com/cloudpipe/cloudpickle/releases/tag/v0.4.4
There's no
Repository: spark
Updated Branches:
refs/heads/master b142157dc -> ec6f971dc
[SPARK-23161][PYSPARK][ML] Add missing APIs to Python GBTClassifier
## What changes were proposed in this pull request?
Add featureSubsetStrategy in GBTClassifier and GBTRegressor. Also make
GBTClassificationModel
Repository: spark
Updated Branches:
refs/heads/master 6d16b9885 -> d48803bf6
[SPARK-24324][PYTHON][FOLLOWUP] Grouped Map positional conf should have
deprecation note
## What changes were proposed in this pull request?
Followup to the discussion of the added conf in SPARK-24324 which allows
Repository: spark
Updated Branches:
refs/heads/master ce2f919f8 -> 4f1e38649
[SPARK-24057][PYTHON] put the real data type in the AssertionError message
## What changes were proposed in this pull request?
Print out the data type in the AssertionError message to make it more
meaningful.
##
Repository: spark
Updated Branches:
refs/heads/master 4f1e38649 -> f7435bec6
[SPARK-24044][PYTHON] Explicitly print out skipped tests from unittest module
## What changes were proposed in this pull request?
This PR proposes to remove duplicated dependency checking logics and also print
out
Repository: spark-website
Updated Branches:
refs/heads/asf-site 3f874c90a -> 6853fd7c6
Update committer pages
Project: http://git-wip-us.apache.org/repos/asf/spark-website/repo
Commit: http://git-wip-us.apache.org/repos/asf/spark-website/commit/6853fd7c
Tree:
Repository: spark
Updated Branches:
refs/heads/branch-2.3 fc3df4517 -> 5b187a85a
[SPARK-24976][PYTHON] Allow None for Decimal type conversion (specific to
PyArrow 0.9.0)
## What changes were proposed in this pull request?
See [ARROW-2432](https://jira.apache.org/jira/browse/ARROW-2432).
Repository: spark
Updated Branches:
refs/heads/master 42dfe4f15 -> f4772fd26
[SPARK-24976][PYTHON] Allow None for Decimal type conversion (specific to
PyArrow 0.9.0)
## What changes were proposed in this pull request?
See [ARROW-2432](https://jira.apache.org/jira/browse/ARROW-2432). Seems
Repository: spark
Updated Branches:
refs/heads/master 92fd7f321 -> ed075e1ff
[SPARK-23874][SQL][PYTHON] Upgrade Apache Arrow to 0.10.0
## What changes were proposed in this pull request?
Upgrade Apache Arrow to 0.10.0
Version 0.10.0 has a number of bug fixes and improvements with the
Repository: spark
Updated Branches:
refs/heads/master ba84bcb2c -> 10f2b6fa0
[SPARK-23555][PYTHON] Add BinaryType support for Arrow in Python
## What changes were proposed in this pull request?
Adding `BinaryType` support for Arrow in pyspark, conditional on using pyarrow
>= 0.10.0. Earlier
Repository: spark
Updated Branches:
refs/heads/master e75488718 -> 71f38ac24
[SPARK-23698][PYTHON] Resolve undefined names in Python 3
## What changes were proposed in this pull request?
Fix issues arising from the fact that builtins __file__, __long__,
__raw_input()__, __unicode__,
Repository: spark
Updated Branches:
refs/heads/master 71f38ac24 -> 2381953ab
[SPARK-25105][PYSPARK][SQL] Include PandasUDFType in the import all of
pyspark.sql.functions
## What changes were proposed in this pull request?
Include PandasUDFType in the import all of pyspark.sql.functions
##
Repository: spark
Updated Branches:
refs/heads/master f5817d8bb -> 7ef6d1daf
[SPARK-25328][PYTHON] Add an example for having two columns as the grouping key
in group aggregate pandas UDF
## What changes were proposed in this pull request?
This PR proposes to add another example for multiple
Repository: spark
Updated Branches:
refs/heads/branch-2.4 085f731ad -> f2d502223
[SPARK-25328][PYTHON] Add an example for having two columns as the grouping key
in group aggregate pandas UDF
## What changes were proposed in this pull request?
This PR proposes to add another example for
Repository: spark
Updated Branches:
refs/heads/branch-2.4 f2d502223 -> 3682d29f4
[SPARK-25072][PYSPARK] Forbid extra value for custom Row
## What changes were proposed in this pull request?
Add value length check in `_create_row`, forbid extra value for custom Row in
PySpark.
## How was
Repository: spark
Updated Branches:
refs/heads/master 3b6591b0b -> c84bc40d7
[SPARK-25072][PYSPARK] Forbid extra value for custom Row
## What changes were proposed in this pull request?
Add value length check in `_create_row`, forbid extra value for custom Row in
PySpark.
## How was this
Repository: spark
Updated Branches:
refs/heads/branch-2.3 9db81fd86 -> 31dab7140
[SPARK-25072][PYSPARK] Forbid extra value for custom Row
## What changes were proposed in this pull request?
Add value length check in `_create_row`, forbid extra value for custom Row in
PySpark.
## How was
Repository: spark
Updated Branches:
refs/heads/branch-2.4 5d98c3194 -> ffd036a6d
[SPARK-23672][PYTHON] Document support for nested return types in scalar with
arrow udfs
## What changes were proposed in this pull request?
Clarify docstring for Scalar functions
## How was this patch tested?
Repository: spark
Updated Branches:
refs/heads/master b6935ffb4 -> e99825058
[SPARK-23828][ML][PYTHON] PySpark StringIndexerModel should have constructor
from labels
## What changes were proposed in this pull request?
The Scala StringIndexerModel has an alternate constructor that will
Repository: spark
Updated Branches:
refs/heads/master c68ec4e6a -> ed72badb0
[SPARK-23699][PYTHON][SQL] Raise same type of error caught with Arrow enabled
## What changes were proposed in this pull request?
When using Arrow for createDataFrame or toPandas and an error is encountered
with
Repository: spark
Updated Branches:
refs/heads/master 529f84710 -> 44a9f8e6e
[SPARK-15009][PYTHON][FOLLOWUP] Add default param checks for
CountVectorizerModel
## What changes were proposed in this pull request?
Adding test for default params for `CountVectorizerModel` constructed from
Repository: spark
Updated Branches:
refs/heads/master b30a7d28b -> 3e778f5a9
[SPARK-23162][PYSPARK][ML] Add r2adj into Python API in LinearRegressionSummary
## What changes were proposed in this pull request?
Adding r2adj in LinearRegressionSummary for Python API.
## How was this patch
Repository: spark
Updated Branches:
refs/heads/master 5f4deff19 -> 566321852
[SPARK-23691][PYTHON] Use sql_conf util in PySpark tests where possible
## What changes were proposed in this pull request?
https://github.com/apache/spark/commit/d6632d185e147fcbe6724545488ad80dce20277e
added an
Repository: spark
Updated Branches:
refs/heads/master 95c03cbd2 -> a33655348
[SPARK-23615][ML][PYSPARK] Add maxDF Parameter to Python CountVectorizer
## What changes were proposed in this pull request?
The maxDF parameter is for filtering out frequently occurring terms. This param
was
Repository: spark
Updated Branches:
refs/heads/master e1d3f8010 -> 2224861f2
[SPARK-24439][ML][PYTHON] Add distanceMeasure to BisectingKMeans in PySpark
## What changes were proposed in this pull request?
add distanceMeasure to BisectingKMeans in Python.
## How was this patch tested?
Repository: spark
Updated Branches:
refs/heads/master b19a28dea -> 7251be0c0
[SPARK-25798][PYTHON] Internally document type conversion between Pandas data
and SQL types in Pandas UDFs
## What changes were proposed in this pull request?
We are facing some problems about type conversions
This is an automated email from the ASF dual-hosted git repository.
cutlerb pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git
The following commit(s) were added to refs/heads/master by this push:
new 75d8449 [SPARK-26676][PYTHON] Make
Repository: spark
Updated Branches:
refs/heads/master 3b8ae2373 -> 20278e719
[SPARK-24333][ML][PYTHON] Add fit with validation set to spark.ml GBT: Python
API
## What changes were proposed in this pull request?
Add validationIndicatorCol and validationTol to GBT Python.
## How was this
Repository: spark
Updated Branches:
refs/heads/master 187bb7d00 -> 518a3d10c
[SPARK-26033][SPARK-26034][PYTHON][FOLLOW-UP] Small cleanup and deduplication
in ml/mllib tests
## What changes were proposed in this pull request?
This PR is a small follow up that puts some logic and functions
Repository: spark
Updated Branches:
refs/heads/master ab76900fe -> ecaa495b1
[SPARK-25274][PYTHON][SQL] In toPandas with Arrow send un-ordered record
batches to improve performance
## What changes were proposed in this pull request?
When executing `toPandas` with Arrow enabled, partitions
This is an automated email from the ASF dual-hosted git repository.
cutlerb pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git
The following commit(s) were added to refs/heads/master by this push:
new 32515d2 [SPARK-26349][PYSPARK] Forbid insecure
This is an automated email from the ASF dual-hosted git repository.
cutlerb pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git
The following commit(s) were added to refs/heads/master by this push:
new 594be7a [SPARK-27240][PYTHON] Use pandas
This is an automated email from the ASF dual-hosted git repository.
cutlerb pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git
The following commit(s) were added to refs/heads/master by this push:
new ddc2052 [SPARK-23836][PYTHON] Add support
This is an automated email from the ASF dual-hosted git repository.
cutlerb pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git
The following commit(s) were added to refs/heads/master by this push:
new f9ca8ab [SPARK-27805][PYTHON] Propagate
This is an automated email from the ASF dual-hosted git repository.
cutlerb pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git.
from 9df7587 [MINOR][CORE] Fix line too long in TransportClientFactory
add d0fbc4d [SPARK-28003][PYTHON] Allow NaT
This is an automated email from the ASF dual-hosted git repository.
cutlerb pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git.
from 54da3bb [SPARK-28127][SQL] Micro optimization on TreeNode's
mapChildren method
add 9b9d81b [SPARK-28131
This is an automated email from the ASF dual-hosted git repository.
cutlerb pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git.
from 9b9d81b [SPARK-28131][PYTHON] Update document type conversion between
Python data and SQL types in normal UDFs
This is an automated email from the ASF dual-hosted git repository.
cutlerb pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git.
from 8375103 [SPARK-27557][DOC] Add copy button to Python API docs for
easier copying of code-blocks
add 9623420
This is an automated email from the ASF dual-hosted git repository.
cutlerb pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git.
from 2f55809 [SPARK-27294][SS] Add multi-cluster Kafka delegation token
add 5e79ae3 [SPARK-23961][SPARK-27548
This is an automated email from the ASF dual-hosted git repository.
cutlerb pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git
The following commit(s) were added to refs/heads/master by this push:
new c277afb [SPARK-27992][PYTHON] Allow Python
This is an automated email from the ASF dual-hosted git repository.
cutlerb pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git
The following commit(s) were added to refs/heads/master by this push:
new 7858e53 [SPARK-28323][SQL][PYTHON] PythonUDF
This is an automated email from the ASF dual-hosted git repository.
cutlerb pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git.
from d25cbd4 [SPARK-28839][CORE] Avoids NPE in context cleaner when
dynamic allocation and shuffle service
This is an automated email from the ASF dual-hosted git repository.
cutlerb pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git.
from 197732e [SPARK-29125][INFRA] Add Hadoop 2.7 combination to GitHub
Action
add 05988b2 [SPARK-27463][PYTHON
This is an automated email from the ASF dual-hosted git repository.
cutlerb pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git.
from 0bdadba [SPARK-29790][DOC] Note required port for Kube API
add 7fc9db0 [SPARK-29798][PYTHON][SQL] Infers
This is an automated email from the ASF dual-hosted git repository.
cutlerb pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git.
from 00347a3 [SPARK-28762][CORE] Read JAR main class if JAR is not located
in local file system
add 901ff92
This is an automated email from the ASF dual-hosted git repository.
cutlerb pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git.
from 3d2a6f4 [SPARK-29906][SQL] AQE should not introduce extra shuffle for
outermost limit
add e804ed5 [SPARK
This is an automated email from the ASF dual-hosted git repository.
cutlerb pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git.
from 3d2a6f4 [SPARK-29906][SQL] AQE should not introduce extra shuffle for
outermost limit
add e804ed5 [SPARK
This is an automated email from the ASF dual-hosted git repository.
cutlerb pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git.
from 6390f02 [SPARK-29367][DOC] Add compatibility note for Arrow 0.15.0 to
SQL guide
add beb8d2f [SPARK-29402
This is an automated email from the ASF dual-hosted git repository.
cutlerb pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git.
from 6390f02 [SPARK-29367][DOC] Add compatibility note for Arrow 0.15.0 to
SQL guide
add beb8d2f [SPARK-29402
This is an automated email from the ASF dual-hosted git repository.
cutlerb pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git.
from d0800fc [SPARK-30314] Add identifier and catalog information to
DataSourceV2Relation
add 43d9c7e [SPARK
This is an automated email from the ASF dual-hosted git repository.
cutlerb pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git.
from d0800fc [SPARK-30314] Add identifier and catalog information to
DataSourceV2Relation
add 43d9c7e [SPARK
This is an automated email from the ASF dual-hosted git repository.
cutlerb pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git.
from b5bc3e1 [SPARK-30312][SQL] Preserve path permission and acl when
truncate table
add f372d1c [SPARK-29748
This is an automated email from the ASF dual-hosted git repository.
cutlerb pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git.
from e3a88a9 [SPARK-32516][SQL] 'path' option cannot coexist with load()'s
path parameters
add 41cf1d0 [SPARK
This is an automated email from the ASF dual-hosted git repository.
cutlerb pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git.
from e3a88a9 [SPARK-32516][SQL] 'path' option cannot coexist with load()'s
path parameters
add 41cf1d0 [SPARK
This is an automated email from the ASF dual-hosted git repository.
cutlerb pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git.
from e3a88a9 [SPARK-32516][SQL] 'path' option cannot coexist with load()'s
path parameters
add 41cf1d0 [SPARK
This is an automated email from the ASF dual-hosted git repository.
cutlerb pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git.
from e3a88a9 [SPARK-32516][SQL] 'path' option cannot coexist with load()'s
path parameters
add 41cf1d0 [SPARK
This is an automated email from the ASF dual-hosted git repository.
cutlerb pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git.
from e3a88a9 [SPARK-32516][SQL] 'path' option cannot coexist with load()'s
path parameters
add 41cf1d0 [SPARK
This is an automated email from the ASF dual-hosted git repository.
cutlerb pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git
The following commit(s) were added to refs/heads/master by this push:
new 339b0eca [SPARK-25351][SQL][PYTHON] Handle
This is an automated email from the ASF dual-hosted git repository.
cutlerb pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git.
from 7aed81d [SPARK-33202][CORE] Fix BlockManagerDecommissioner to return
the correct migration status
add 66005a3
This is an automated email from the ASF dual-hosted git repository.
cutlerb pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git.
from 7aed81d [SPARK-33202][CORE] Fix BlockManagerDecommissioner to return
the correct migration status
add 66005a3
This is an automated email from the ASF dual-hosted git repository.
cutlerb pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git.
from 7aed81d [SPARK-33202][CORE] Fix BlockManagerDecommissioner to return
the correct migration status
add 66005a3
This is an automated email from the ASF dual-hosted git repository.
cutlerb pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git.
from 7aed81d [SPARK-33202][CORE] Fix BlockManagerDecommissioner to return
the correct migration status
add 66005a3
This is an automated email from the ASF dual-hosted git repository.
cutlerb pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git.
from 7aed81d [SPARK-33202][CORE] Fix BlockManagerDecommissioner to return
the correct migration status
add 66005a3
This is an automated email from the ASF dual-hosted git repository.
cutlerb pushed a change to branch branch-2.4
in repository https://gitbox.apache.org/repos/asf/spark.git.
from a4854d6 [MINOR][DOCS] Fix typo in PySpark example in ml-datasource.md
add 5084c71 [SPARK-32300][PYTHON
This is an automated email from the ASF dual-hosted git repository.
cutlerb pushed a commit to branch branch-2.4
in repository https://gitbox.apache.org/repos/asf/spark.git
The following commit(s) were added to refs/heads/branch-2.4 by this push:
new 5084c71 [SPARK-32300][PYTHON][2.4
This is an automated email from the ASF dual-hosted git repository.
cutlerb pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git.
from d06604f [SPARK-32078][DOC] Add a redirect to sql-ref from
sql-reference
add 1af19a7 [SPARK-32098][PYTHON
This is an automated email from the ASF dual-hosted git repository.
cutlerb pushed a commit to branch branch-3.0
in repository https://gitbox.apache.org/repos/asf/spark.git
The following commit(s) were added to refs/heads/branch-3.0 by this push:
new 2d6232a [SPARK-32098][PYTHON] Use iloc
This is an automated email from the ASF dual-hosted git repository.
cutlerb pushed a commit to branch branch-2.4
in repository https://gitbox.apache.org/repos/asf/spark.git
The following commit(s) were added to refs/heads/branch-2.4 by this push:
new a295003 [SPARK-32098][PYTHON] Use iloc
This is an automated email from the ASF dual-hosted git repository.
cutlerb pushed a commit to branch branch-2.4
in repository https://gitbox.apache.org/repos/asf/spark.git
The following commit(s) were added to refs/heads/branch-2.4 by this push:
new a295003 [SPARK-32098][PYTHON] Use iloc
This is an automated email from the ASF dual-hosted git repository.
cutlerb pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git.
from d06604f [SPARK-32078][DOC] Add a redirect to sql-ref from
sql-reference
add 1af19a7 [SPARK-32098][PYTHON
This is an automated email from the ASF dual-hosted git repository.
cutlerb pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git.
from d06604f [SPARK-32078][DOC] Add a redirect to sql-ref from
sql-reference
add 1af19a7 [SPARK-32098][PYTHON
This is an automated email from the ASF dual-hosted git repository.
cutlerb pushed a commit to branch branch-2.4
in repository https://gitbox.apache.org/repos/asf/spark.git
The following commit(s) were added to refs/heads/branch-2.4 by this push:
new a295003 [SPARK-32098][PYTHON] Use iloc
This is an automated email from the ASF dual-hosted git repository.
cutlerb pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git
The following commit(s) were added to refs/heads/master by this push:
new 00d06ca [SPARK-31915][SQL][PYTHON] Resolve
This is an automated email from the ASF dual-hosted git repository.
cutlerb pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git
The following commit(s) were added to refs/heads/master by this push:
new 00d06ca [SPARK-31915][SQL][PYTHON] Resolve
This is an automated email from the ASF dual-hosted git repository.
cutlerb pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git
The following commit(s) were added to refs/heads/master by this push:
new 9b875ce [SPARK-32953][PYTHON][SQL] Add Arrow
This is an automated email from the ASF dual-hosted git repository.
cutlerb pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git
The following commit(s) were added to refs/heads/master by this push:
new 816aba3 [SPARK-34521][PYTHON][SQL] Fix
This is an automated email from the ASF dual-hosted git repository.
cutlerb pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git
from f5c3f0c228f [SPARK-39164][SQL] Wrap asserts/illegal state exceptions
by the INTERNAL_ERROR exception in actions
This is an automated email from the ASF dual-hosted git repository.
cutlerb pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/spark.git
from 08678456d16 [SPARK-40476][ML][SQL] Reduce the shuffle size of ALS
add 7b8016a578f [SPARK-38098][PYTHON] Add
78 matches
Mail list logo