commits
Thread
Date
Earlier messages
Messages by Thread
(spark-docker) branch master updated: [SPARK-55549] Update PR template according to the ASF Generative Tooling Guidance recommendations
dongjoon
(spark-kubernetes-operator) branch main updated: [SPARK-55545] Improve `create_spark_jira.py` to support `-p` to set the parent JIRA ID
dongjoon
(spark-kubernetes-operator) branch main updated: [SPARK-55537] Check `spark.dynamicAllocation.enabled` before overriding deleteOnTermination
dongjoon
(spark-kubernetes-operator) branch main updated: [SPARK-55543] Change `SentinelManager.getSentinelResources` to `package private`
dongjoon
(spark-connect-swift) branch main updated: [SPARK-55542] Upgrade `gRPC Swift Protobuf` to 2.2.0
dongjoon
(spark-connect-swift) branch main updated: [SPARK-55538] Add IDEs and MacOS settings to `.gitignore`
dongjoon
(spark) branch master updated: [SPARK-55533][SQL] Support IGNORE NULLS / RESPECT NULLS for collect_set
dongjoon
[PR] Highlight Declarative Pipeline support on homepage [spark-website]
via GitHub
(spark) branch master updated: [SPARK-55129][SS] Introduce new key encoders for timestamp as a first class (UnsafeRow)
kabhwan
(spark) branch master updated: [SPARK-55479][SQL] Fix style issues in SparkShreddingUtils
dongjoon
(spark-website) branch asf-site updated: Make `site/docs/latest` up-to-date with 4.1.1 (#677)
dongjoon
[PR] Make `site/docs/latest` up-to-date with 4.1.1 [spark-website]
via GitHub
Re: [PR] Make `site/docs/latest` up-to-date with 4.1.1 [spark-website]
via GitHub
Re: [PR] Make `site/docs/latest` up-to-date with 4.1.1 [spark-website]
via GitHub
(spark) branch master updated: [SPARK-55522][WEBUI] Allow inline scripts, event handlers and styles in Spark UI with Content-Security-Policy
dongjoon
(spark-kubernetes-operator) branch main updated: [SPARK-55527] Remove `labeler` GitHub Actions job
dongjoon
(spark) branch master updated (f2459419b481 -> 5593f92a9503)
dongjoon
(spark) branch master updated (b3e6d97a06f0 -> f2459419b481)
dongjoon
(spark-kubernetes-operator) branch main updated: [MINOR] Add Java version badge and reorder the badges
dongjoon
(spark) branch master updated: [SPARK-55385][CORE][SQL][FOLLOW-UP] `getAncestorWithSamePartitionSizes` should stop at checkpointed ancestor
dongjoon
(spark) branch master updated: [SPARK-55333][PYTHON] Enable `DateType` and `TimeType` in `convert_numpy`
dongjoon
(spark) branch master updated (c42f0de7b714 -> a252c5b520ab)
dongjoon
(spark) branch master updated (a252c5b520ab -> a0c6265891d6)
dongjoon
(spark) branch master updated: [SPARK-55516][BUILD] Upgrade `ap-loader` to 4.3-12
dongjoon
(spark) branch master updated: [SPARK-55515][BUILD] Upgrade `jjwt` to 0.13.0
dongjoon
(spark) branch master updated: [SPARK-55449][GEO][SQL] Enable WKB parsing and writing for Geography
wenchen
(spark) branch master updated (9c7da4cd1738 -> 4a5efe6ab629)
yangjie01
(spark) branch master updated (75424734ea11 -> 9c7da4cd1738)
yangjie01
(spark-connect-rust) branch spark-4 created (now 3171a90)
yangjie01
(spark) branch master updated: [SPARK-55416][SS][PYTHON] Streaming Python Data Source memory leak when end-offset is not updated
kabhwan
(spark) branch master updated (b11ab86bce62 -> f391539c7536)
ruifengz
(spark-kubernetes-operator) branch main updated: [SPARK-55512] Update `README.md` with YuniKorn 1.8.0
dongjoon
(spark) branch master updated (4463ba6edd88 -> b11ab86bce62)
ruifengz
(spark) branch master updated: [SPARK-55511][K8S][DOCS][INFRA] Upgrade Volcano to 1.14.0
dongjoon
(spark) branch master updated: [SPARK-55508][BUILD] Upgrade `compress-lzf` to 1.2.0
dongjoon
(spark) branch master updated: [SPARK-55156][PS] Deal with `include_groups` for `groupby.apply`
ruifengz
(spark) branch master updated (ba291a619042 -> c4935cf18e81)
ruifengz
(spark) branch master updated: [SPARK-55509][K8S][DOCS] Update `YuniKorn` docs with `1.8.0`
dongjoon
(spark) branch branch-4.0 updated: [SPARK-55497][BUILD][TESTS] Use `jupyterTestFramework` instead of `TestFrameworks.JUnit`
chengpan
(spark) branch branch-4.1 updated: [SPARK-55497][BUILD][TESTS] Use `jupyterTestFramework` instead of `TestFrameworks.JUnit`
chengpan
(spark) branch master updated: [SPARK-55497][BUILD][TESTS] Use `jupyterTestFramework` instead of `TestFrameworks.JUnit`
chengpan
(spark-website) branch asf-site updated: [SPARK-54784] Document the security policy on ml models (#676)
ruifengz
(spark-connect-swift) branch main updated: [MINOR] Add release version badge and link to `README.md`
dongjoon
(spark) branch master updated (ac9b01ecff6f -> 30825c3dae73)
ruifengz
svn commit: r82463 - release/spark/spark-4.0.1
dongjoon
svn commit: r82462 - dev/spark/v4.2.0-preview2-rc1-docs
dongjoon
svn commit: r82461 - dev/spark/v4.2.0-preview1-rc1-docs
dongjoon
svn commit: r82460 - dev/spark/v4.1.1-rc2-docs
dongjoon
svn commit: r82459 - dev/spark/v4.1.1-rc1-docs
dongjoon
svn commit: r82458 - dev/spark/v4.1.1-rc1-bin
dongjoon
(spark) branch master updated: [SPARK-55020][PYTHON][FOLLOW-UP] Move release into disable gc protection to prevent deadlock
gurwls223
(spark) branch master updated: [SPARK-54740][PYTHON] Start faulthandler early in daemon mode
gurwls223
(spark) branch master updated: [SPARK-54784][ML][DOCS] Document the security policy on ml models
dongjoon
(spark) branch branch-3.5 updated: [SPARK-55495][CORE] Fix `EventLogFileWriters.closeWriter` to handle `checkError`
dongjoon
(spark) branch branch-4.0 updated: [SPARK-55495][CORE] Fix `EventLogFileWriters.closeWriter` to handle `checkError`
dongjoon
(spark) branch master updated: [SPARK-55495][CORE] Fix `EventLogFileWriters.closeWriter` to handle `checkError`
dongjoon
(spark) branch branch-4.1 updated: [SPARK-55495][CORE] Fix `EventLogFileWriters.closeWriter` to handle `checkError`
dongjoon
(spark) branch master updated: [SPARK-55498][BUILD][TESTS] Upgrade `oracle-free` docker image to `23.26.1-slim`
dongjoon
(spark-kubernetes-operator) branch main updated: [SPARK-55499] Update `pi-with-eventlog` to generate multiple log files
dongjoon
(spark-kubernetes-operator) branch main updated: [SPARK-55486] Fix `StatusRecorder.patchAndStatusWithVersionLocked` not to log errors
dongjoon
(spark) branch master updated (5b80958c0b01 -> 17bdbde20acd)
ruifengz
(spark) branch master updated (59f3a16590d8 -> 5b80958c0b01)
ruifengz
(spark) branch master updated (15ca64ddc90c -> 59f3a16590d8)
chengpan
(spark) branch master updated (4e1cb88bba0c -> 15ca64ddc90c)
wenchen
(spark) branch master updated: [SPARK-54805][SS][PYTHON][FOLLOW-UP] Add test_tws_tester to modules
ruifengz
(spark) branch master updated (77980546e305 -> 2538cc832bdd)
gurwls223
(spark) branch branch-4.1 updated: [SPARK-52407][SQL][FOLLOW-UP] Remove Theta Sketch aggregation buffer re-wrapping
dtenedor
(spark) branch master updated (58fbd7f6b1b0 -> 6112a0bfc481)
dtenedor
(spark) branch master updated: [SPARK-54173][K8S][FOLLOWUP] Fix `spark.kubernetes.executor.podDeletionCost` config doc
dongjoon
(spark) branch master updated: [SPARK-55484][K8S] Simplify `KubernetesClusterSchedulerBackend` by reducing private class variables
dongjoon
(spark) branch master updated: [SPARK-55485][K8S] Add `Constants.POD_DELETION_COST` for reuse
dongjoon
(spark) branch branch-4.0 updated: [SPARK-55411][SQL][4.0] SPJ may throw ArrayIndexOutOfBoundsException when join keys are less than cluster keys
ptoth
(spark) branch master updated: [SPARK-55480][PYTHON] Remove all unused noqa for ruff
ruifengz
(spark) branch master updated: [MINOR] Remove unused `InMemoryRelation.convertToColumnarIfPossible` method
viirya
(spark) branch branch-4.1 updated (3b797bc169a0 -> d4d034699464)
ptoth
(spark) branch master updated: [MINOR][INFRA] Add `build_infra_images_cache` into `Build Pipeline Status`
ruifengz
(spark) branch branch-4.0 updated (9f7325c349de -> 964a3efa854d)
chengpan
[PR] [WIP] Document the security policy on ml models [spark-website]
via GitHub
Re: [PR] [SPARK-54784] Document the security policy on ml models [spark-website]
via GitHub
Re: [PR] [SPARK-54784] Document the security policy on ml models [spark-website]
via GitHub
Re: [PR] [SPARK-54784] Document the security policy on ml models [spark-website]
via GitHub
Re: [PR] [SPARK-54784] Document the security policy on ml models [spark-website]
via GitHub
(spark) branch master updated: [SPARK-55405][PYTHON][TESTS][FOLLOWUP] Skip PyArrow array cast tests when numpy < 2.0
ruifengz
(spark) branch master updated (6ced6b477625 -> 935e5cd146c8)
ruifengz
(spark) branch master updated: [SPARK-55475][BUILD] Disable Maven Parallel PUT
dongjoon
(spark) branch master updated: [SPARK-55020][PYTHON][FOLLOW-UP] Disable gc only when we communicate through gRPC for ExecutePlan
gurwls223
(spark) branch master updated: [SPARK-55473][PYTHON] Replace itertools.tee with chain in applyInPandasWithState
gurwls223
(spark) branch master updated: [SPARK-55395][SQL][FOLLOW-UP] Delete obsolete `withSequenceColumn`
ruifengz
(spark) branch master updated: [SPARK-55472][PS] Raise `AttributeError` from methods removed in pandas 3
ruifengz
(spark) branch master updated: [SPARK-55451][SQL] Cursors must start collecting results on OPEN, not first FETCH
gengliang
(spark-website) branch asf-site updated: improve build script and instruction (#675)
wenchen
(spark-kubernetes-operator) branch main updated: [SPARK-55470] Add a `Checkstyle` rule to enforce symbolic placeholder for logging
dongjoon
(spark) branch master updated: [SPARK-55411][SQL] SPJ may throw ArrayIndexOutOfBoundsException when join keys are less than cluster keys
ptoth
(spark-website) branch asf-site updated: Add Cheng Pan to committers (#673)
chengpan
[PR] Improve build script and instruction [spark-website]
via GitHub
Re: [PR] Improve build script and instruction [spark-website]
via GitHub
Re: [PR] Improve build script and instruction [spark-website]
via GitHub
Re: [PR] Improve build script and instruction [spark-website]
via GitHub
[PR] Update info for Connect Swift/Rust/.NET repo [spark-website]
via GitHub
Re: [PR] Update info for Connect Swift/Rust/.NET repo [spark-website]
via GitHub
Re: [PR] Update info for Connect Swift/Rust/.NET repo [spark-website]
via GitHub
(spark) branch master updated: [SPARK-55458][PYTHON][TESTS] Apply the new test pattern for newly added tests
ruifengz
(spark) branch master updated: [SPARK-55460][PYTHON] Remove E203 from ruff's ignore list
ruifengz
(spark-kubernetes-operator) branch main updated: [SPARK-55468] Log `Built-in Spark Version`
dongjoon
(spark) branch master updated (7a0abe4f0859 -> 8a74912251e3)
ruifengz
[PR] Add Cheng Pan to committers [spark-website]
via GitHub
Re: [PR] Add Cheng Pan to committers [spark-website]
via GitHub
Re: [PR] Add Cheng Pan to committers [spark-website]
via GitHub
Re: [PR] Add Cheng Pan to committers [spark-website]
via GitHub
(spark) branch master updated (238efa134ceb -> 7a0abe4f0859)
ruifengz
(spark) branch master updated: [SPARK-55459][PYTHON] Fix 3x performance regression in applyInPandas for large groups
ruifengz
(spark) branch master updated (d353b4706647 -> 3cf6e6b1020e)
dongjoon
(spark) branch master updated (deb09eec6176 -> d353b4706647)
ruifengz
(spark) branch master updated: [SPARK-55385][CORE][SQL][FOLLOWUP] Rename preservesDistribution to preservesPartitionSizes
ruifengz
(spark) branch master updated (378e74a9efe3 -> b4b8165b39d9)
gurwls223
(spark) branch master updated: [SPARK-55455][BUILD] Upgrade `RoaringBitmap` to 1.6.0
dongjoon
(spark) branch master updated: [SPARK-55402][SS] Move streamingSourceIdentifyingName from CatalogTable to DataSource
ashrigondekar
(spark-connect-swift) branch main updated: [SPARK-55454] Use `4.2.0-preview2` for Spark 4.2 integration tests
dongjoon
[PR] add Apache Iceberg to index.md; add alt text to logos [spark-website]
via GitHub
(spark) branch master updated: [SPARK-55432][K8S] Support built-in K8s `ExecutorResizePlugin`
dongjoon
(spark) branch master updated (a6787fd8cc12 -> 6757f7877401)
ruifengz
(spark) branch master updated: [SPARK-55437][INFRA][R] Upgrade SparkR test image to Ubuntu 24.04
ruifengz
(spark) branch master updated: [SPARK-55229][SPARK-55231][PYTHON] Implement DataFrame.zipWithIndex in PySpark
ruifengz
(spark) branch master updated (2121a5a31d69 -> f3ad0f6db854)
yangjie01
(spark) branch master updated: [SPARK-55436][INFRA] Upgrade lint and doc test images to Ubuntu 24.04
gurwls223
(spark) branch master updated: [MINOR][INFRA] Use `lsb_release -a` to display the container os version
ruifengz
(spark) branch master updated: [SPARK-55366][SQL][PYTHON][FOLLOW-UP] Relax the duplicated field name check
ruifengz
(spark) branch master updated: [SPARK-55431][K8S] Set `resizePolicy` to `NotRequired` explicitly for executor pods
dongjoon
(spark) branch master updated: [SPARK-55408][PS] Handle unexpected keyword argument errors related to datetime with pandas 3
ruifengz
(spark) branch master updated (26384d7de53f -> 8d46ddb251b8)
ruifengz
(spark) branch master updated: [SPARK-55224][PYTHON][FOLLOWUP] Remove redundant `use_legacy_pandas_udf_conversion` condition in serializer setup
ruifengz
(spark) branch branch-3.5 updated: [SPARK-55434][INFRA] Add username and password at svn with rm at finalize step
gurwls223
(spark) branch branch-4.0 updated: [SPARK-55434][INFRA] Add username and password at svn with rm at finalize step
gurwls223
(spark) branch branch-4.1 updated: [SPARK-55434][INFRA] Add username and password at svn with rm at finalize step
gurwls223
(spark) branch master updated (ee58e0e17501 -> f6031fef94f3)
gurwls223
svn commit: r82371 - release/spark/spark-4.2.0-preview1
gurwls223
(spark) branch master updated: [SPARK-55433][INFRA] Remove labeler in GitHub Actions
gurwls223
svn commit: r82369 - dev/spark/v4.2.0-preview2-rc1-docs/_site release/spark/docs/4.2.0-preview2
gurwls223
svn commit: r82370 - dev/spark/v4.2.0-preview2-rc1-bin release/spark/spark-4.2.0-preview2
gurwls223
(spark) tag v4.2.0-preview2 created (now a2edb559299d)
gurwls223
(spark) branch master updated: [SPARK-54860][INFRA] Followup of the revert to set the permission correctly
gurwls223
(spark) branch master updated: Revert "[SPARK-54860][INFRA] Add JIRA Ticket Validating in GHA"
gurwls223
(spark) branch master updated: [SPARK-55429][K8S][TESTS] Improve `VolcanoTestsSuite` to use `Server-Side Apply` pattern
gurwls223
(spark) branch master updated: [SPARK-55424][PYTHON] Explicitly pass the series name in `convert_numpy`
gurwls223
(spark) branch master updated: [SPARK-55175][PYTHON][FOLLOW-UP] Remove unused `arrow_to_pandas` method
gurwls223
(spark) branch master updated: [SPARK-55414][PYTHON][INFRA] Upgrade Python 3.12 test images for classic-only and pandas 3 to Ubuntu 24.04
gurwls223
(spark) branch master updated: [SPARK-55358][PYTHON][INFRA][FOLLOW-UP] Do not apt-get install `python3-xxx`
gurwls223
(spark) branch master updated (d54498861119 -> 668b2c5860ed)
gurwls223
(spark) branch master updated (4c336897859c -> d54498861119)
gurwls223
(spark) branch master updated: [SPARK-55404][PYTHON] Always raise KeyboardInterrupt from SIGINT handler
gurwls223
(spark) branch master updated: [SPARK-55395][SQL] Disable RDD cache in `DataFrame.zipWithIndex`
gurwls223
(spark) branch master updated: [SPARK-55385][CORE][SQL] Mitigate the recomputation in `zipWithIndex`
gurwls223
(spark) branch master updated: [SPARK-55383][INFRA] Only send test report to codecov in coverage run
gurwls223
(spark) branch master updated: [SPARK-55413][PYTHON][INFRA] Upgrade Python minimum dep test images to Ubuntu 24.04
dongjoon
(spark) branch master updated (d1dbcdab1af9 -> e72ddacc568f)
dongjoon
(spark) branch master updated: [SPARK-55428][BUILD] Sync Netty Java options everywhere
dongjoon
(spark) branch master updated: [SPARK-55407][PYSPARK] Replace logger.warn with logger.warning
dongjoon
(spark) branch master updated: [SPARK-54881][SQL][FOLLOWUP] Extract simplifyNot method in BooleanSimplification
wenchen
(spark) branch branch-4.1 updated: [SPARK-55337][SS] Fix MemoryStream backward compatibility
wenchen
(spark) branch master updated: [SPARK-55337][SS] Fix MemoryStream backward compatibility
wenchen
(spark) branch pr-54140-update deleted (was 32afc45731d7)
dongjoon
(spark) branch master updated: [SPARK-55420][BUILD] Upgrade Netty to `4.2.10.Final`
dongjoon
(spark-connect-swift) branch main updated: [SPARK-55418] Add `create_spark_jira.py` script
dongjoon
(spark-kubernetes-operator) branch main updated: [SPARK-55417] Add `create_spark_jira.py` script
dongjoon
(spark) branch master updated (474e07efed0a -> a1c41a819f8d)
ruifengz
(spark) branch master updated (de345288830c -> 474e07efed0a)
ruifengz
(spark-connect-swift) branch main updated: [SPARK-55426] Set `strategy.max-parrallel` to 20 for all GitHub Action jobs
dongjoon
(spark-kubernetes-operator) branch main updated: [SPARK-55425] Set `strategy.max-parrallel` to 20 for all GitHub Action jobs
dongjoon
(spark) branch branch-3.5 updated: [SPARK-55423][INFRA] Set `strategy.max-parrallel` to 20 for all GitHub Action jobs
dongjoon
(spark) branch branch-4.0 updated: [SPARK-55423][INFRA] Set `strategy.max-parrallel` to 20 for all GitHub Action jobs
dongjoon
(spark) branch branch-4.1 updated: [SPARK-55423][INFRA] Set `strategy.max-parrallel` to 20 for all GitHub Action jobs
dongjoon
(spark) branch master updated: [SPARK-55423][INFRA] Set `strategy.max-parrallel` to 20 for all GitHub Action jobs
dongjoon
(spark) branch master updated: [SPARK-55410][K8S] Improve `SparkKubernetesDiagnosticsSetter` to use `patch` instead of `edit` API
dongjoon
(spark-kubernetes-operator) branch main updated: [SPARK-55422] Fix the default value of `readinessProbe.failureThreshold` to 1
dongjoon
(spark-kubernetes-operator) branch main updated: [SPARK-55421] Increase `livenessProbe.failureThreshold` to 3
dongjoon
(spark-kubernetes-operator) branch main updated: [SPARK-55419] Upgrade Netty to `4.2.10.Final`
dongjoon
(spark) branch master updated: [SPARK-55180][PYTHON][INFRA][FOLLOW-UP] Delete unused yml file
dongjoon
(spark) branch master updated: [MINOR][DOCS] Update Maven version and MAVEN_OPTS setting in `building-spark.md` docs
dongjoon
(spark) branch master updated: [SPARK-55401][PYTHON] Add retry logic and timeout handling to pyspark install download
yao
(spark) branch branch-4.0 updated: [SPARK-55387][CORE][UI] Fix DAG visualization not rendering due to malformed DOT label
yao
(spark) branch branch-4.1 updated: [SPARK-55387][CORE][UI] Fix DAG visualization not rendering due to malformed DOT label
yao
(spark) branch master updated: [SPARK-55387][CORE][UI] Fix DAG visualization not rendering due to malformed DOT label
yao
(spark) branch master updated: [SPARK-55394][PYTHON][INFRA] Upgrade Python 3.10 test image to Ubuntu 24.04
ruifengz
(spark) branch master updated: [SPARK-55393][PYTHON][INFRA] Upgrade Python 3.11 test image to Ubuntu 24.04
ruifengz
(spark) branch master updated: [SPARK-55392][PYTHON][INFRA] Upgrade Python 3.14 test image to Ubuntu 24.04
ruifengz
(spark) branch master updated: [SPARK-55391][PYTHON][INFRA] Upgrade Python 3.13 test image to Ubuntu 24.04
ruifengz
(spark) branch master updated: [SPARK-55399][K8S] Improve `KubernetesDriverEndpoint` to use `patch` instead of `edit` API
dongjoon
(spark) branch master updated: [SPARK-55304][SS][PYTHON] Introduce support of Admission Control and Trigger.AvailableNow in Python data source - streaming reader
kabhwan
(spark) branch master updated: [SPARK-55317][SQL] Add SequentialUnion logical plan node and planning rule
dtenedor
(spark) branch master updated: [SPARK-55131][SS] Change the default merge operator delimiter for RocksDB to empty string to concat without delimiter
kabhwan
(spark) branch pr-54140-update updated (76c8e0b10195 -> 32afc45731d7)
yao
(spark) 01/01: [SPARK-XXXXX][SQL] Add cost-based guard to CrossJoinArrayContainsToInnerJoin
yao
(spark) branch master updated: [SPARK-55334][PYTHON] Enable `TimestampType` and `TimestampNTZType` in `convert_numpy`
ruifengz
(spark) branch master updated (ee324696f916 -> ec29abb3033d)
ruifengz
(spark) branch master updated (861ba537250d -> ee324696f916)
ruifengz
(spark) branch master updated (f0d9f993fc3e -> 861ba537250d)
kabhwan
(spark) branch master updated: [SPARK-55386][INFRA] Run `Java 17/25` Maven install tests on PR build only
dongjoon
(spark) branch master updated: [SPARK-55376][PS] Make numeric_only argument in groupby functions accept only boolean with pandas 3
ruifengz
(spark) branch master updated: [SPARK-55382][CORE] Make `Executor` to log `Running Spark version`
dongjoon
(spark-connect-swift) branch main updated: [SPARK-55381] Use Spark `4.0.2` instead of `4.0.1`
dongjoon
Earlier messages