commits
Thread
Date
Earlier messages
Messages by Thread
(spark) branch branch-3.5 updated (b1338f450441 -> 22d4d3de0856)
dongjoon
(spark) branch master updated (b4592c431153 -> 06f7ad2fbe8f)
dongjoon
(spark) branch branch-4.0 updated: [SPARK-53689][BUILD] Respect RELEASE_VERSION environment variable if already defined
gurwls223
(spark) branch branch-3.5 updated: [SPARK-53689][BUILD] Respect RELEASE_VERSION environment variable if already defined
gurwls223
(spark-connect-swift) branch main updated: [SPARK-53685] Upgrade `gRPC Swift NIO Transport` to 2.1.1
dongjoon
(spark) branch master updated (2a9999fe5bf0 -> fa9e787db7b3)
allisonwang
(spark) branch master updated: [SPARK-53671][PYTHON] Exclude 0-args from `@udf` eval type inference
ruifengz
svn commit: r79485 - dev/spark/v3.5.7-rc1-bin release/spark/spark-3.5.7
dongjoon
(spark) tag v3.5.7 created (now ed00d046951a)
dongjoon
(spark-kubernetes-operator) branch main updated: [SPARK-53679] Fix typos in Spark Kubernetes Operator documentation
dongjoon
[PR] Fix TOC of release process documentation [spark-website]
via GitHub
Re: [PR] Fix TOC of release process documentation [spark-website]
via GitHub
(spark) branch master updated: [SPARK-53591][SDP] Simplify Pipeline Spec Pattern Glob Matching
sandy
(spark) branch master updated (b6993cbbcaa0 -> a13187c2fa46)
dongjoon
(spark) branch master updated: [SPARK-53516][SDP] Fix `spark.api.mode` arg process in SparkPipelines
sandy
(spark) branch master updated: [SPARK-53673][CONNECT][TESTS] Fix a flaky test failure in `SparkSessionE2ESuite - interrupt tag` caused by the usage of `ForkJoinPool`
sarutak
(spark) branch master updated (1e7169ed8c1d -> 33196fe1b725)
ruifengz
(spark) branch master updated: [SPARK-53629][SQL] Implement type widening for MERGE INTO WITH SCHEMA EVOLUTION
wenchen
(spark) branch master updated: [SPARK-47110][INFRA] Reenble AmmoniteTest tests in Maven builds
dongjoon
(spark) branch master updated: [SPARK-53651][SDP] Add support for persistent views in pipelines
wenchen
(spark-kubernetes-operator) branch main updated: [SPARK-53670] Use `Gradle Java Toolchain`
dongjoon
svn commit: r79464 - dev/spark/v4.1.0-preview1-rc1-bin
gurwls223
(spark) branch master updated (c37ab6e573d7 -> 69031c918167)
dongjoon
(spark-website) branch dependabot/bundler/rexml-3.4.2 deleted (was 120307d8f2)
github-bot
(spark-website) branch asf-site updated: Update `rexml` to version 3.4.4 (#635)
dongjoon
(spark-kubernetes-operator) branch main updated: [SPARK-53664] Update `updateResponseMetrics` to handle valid responses only
dongjoon
(spark-kubernetes-operator) branch main updated: [SPARK-53663] Add `spark-history-server-preview.yaml`
dongjoon
(spark) branch master updated: [SPARK-53653][DOC] Update `rexml` gem version to 3.4.4
dongjoon
(spark) branch master updated: [SPARK-53655][SQL][TESTS] Fix the intention of 'read parquet footers in parallel' test
dongjoon
(spark) branch master updated (984e16b60862 -> ed2692fe885c)
ruifengz
(spark) branch branch-4.0 updated: [SPARK-53553][CONNECT][4.0] Fix handling of null values in LiteralValueProtoConverter
wenchen
(spark) branch master updated: [SPARK-53657][PYTHON][TESTS] Enable doctests for `GroupedData.agg`
ruifengz
(spark) branch master updated: [SPARK-53592][PYTHON][TESTS][FOLLOW-UP] Remove unused config in the parity test
ruifengz
(spark) branch master updated: [SPARK-53429][PYTHON] Support Direct Passthrough Partitioning in the PySpark Dataframe API
ruifengz
(spark) branch master updated: [SPARK-53641][DOCS] Add PARTITION BY support in Arrow Python UDTF docs
ruifengz
[PR] Update `rexml` to version 3.4.4 and bundle with Ruby 2.6.3 [spark-website]
via GitHub
Re: [PR] Update `rexml` to version 3.4.4 and bundle with Ruby 2.6.3 [spark-website]
via GitHub
Re: [PR] Update `rexml` to version 3.4.4 and bundle with Ruby 2.6.3 [spark-website]
via GitHub
Re: [PR] Update `rexml` to version 3.4.4 [spark-website]
via GitHub
Re: [PR] Update `rexml` to version 3.4.4 [spark-website]
via GitHub
Re: [PR] Update `rexml` to version 3.4.4 [spark-website]
via GitHub
(spark) branch master updated: [SPARK-53522][SDP][TEST] Simplify PipelineTest
dongjoon
(spark) branch master updated: [SPARK-52601][SQL] Support primitive types in TransformingEncoder
hvanhovell
(spark) branch master updated (feaf659d0cc0 -> 6dd4001c29af)
ruifengz
(spark) branch master updated: [SPARK-53632][PYTHON][DOCS][TESTS] Reenable doctest for `DataFrame.pandas_api`
ruifengz
(spark) branch master updated: [SPARK-53479][PS] Align `==` behavior with pandas when comparing against scalar under ANSI
xinrong
[PR] Update `GraphX` menu according to the deprecation [spark-website]
via GitHub
Re: [PR] Update `GraphX` menu according to the deprecation [spark-website]
via GitHub
Re: [PR] Update `GraphX` menu according to the deprecation [spark-website]
via GitHub
(spark) branch master updated: [SPARK-53592][PYTHON] Make `@udf` support vectorized UDF
ruifengz
(spark-kubernetes-operator) branch main updated: [SPARK-53644] Upgrade `Gradle` to 9.1.0
dongjoon
(spark-kubernetes-operator) branch main updated: [SPARK-53650] Make `build.gradle` and `libs.versions.toml` up-to-date with `okhttp3` usage
dongjoon
(spark-kubernetes-operator) branch main updated: [SPARK-53646] Improve `KubernetesMetricsInterceptorTest` to verify `http.request` metric
dongjoon
(spark-kubernetes-operator) branch main updated: [SPARK-52997] Fixes wrong worker assignment if multiple clusters are deployed to the same namespace
dongjoon
(spark-kubernetes-operator) branch main updated: [SPARK-53647][SPARK-53648] Use `VertxHttpClientFactory` and `io.fabric8.kubernetes.client.http.Interceptor`
dongjoon
(spark-kubernetes-operator) branch main updated: [SPARK-53649] Remove `logging-interceptor` dependency
dongjoon
(spark-website) branch asf-site updated: Update gpg key generation
ptoth
(spark) branch master updated: [SPARK-53578][CONNECT] Simplify data type handling in LiteralValueProtoConverter
wenchen
(spark) branch master updated: [SPARK-53233][SQL][FOLLOWUP] Add compatibility class/object for org.apache.spark.sql.execution.streaming
wenchen
(spark) branch master updated (db13a38e565b -> 4f10262391f8)
wenchen
(spark) branch master updated: [SPARK-53625][SS] Propagate metadata columns through projections to address ApplyCharTypePadding incompatibility
kabhwan
(spark) branch branch-4.0 updated: [SPARK-53625][SS] Propagate metadata columns through projections to address ApplyCharTypePadding incompatibility
kabhwan
(spark) branch master updated (fb46424cd112 -> 552effccce9a)
dongjoon
(spark) branch master updated: [SPARK-53590][BUILD] Add `huaweicloud-provided` profile
dongjoon
(spark-kubernetes-operator) branch main updated: [SPARK-53639] Use `spark` consistently for `release-name` of Helm installation
dongjoon
(spark) branch master updated: [SPARK-53626][DOCS] Add invalid mixed-type operations to ANSI migration guide
dongjoon
(spark) branch master updated (49a3c132e1cb -> 263979277198)
dongjoon
[PR] Update GPG key generation [spark-website]
via GitHub
Re: [PR] Update GPG key generation [spark-website]
via GitHub
Re: [PR] Update gpg key generation [spark-website]
via GitHub
Re: [PR] Update gpg key generation [spark-website]
via GitHub
Re: [PR] Update gpg key generation [spark-website]
via GitHub
(spark) branch master updated: [SPARK-53598][SQL] Check the existence of numParts before reading large table property
wenchen
(spark) branch branch-4.0 updated: [SPARK-53598][SQL] Check the existence of numParts before reading large table property
wenchen
(spark) branch branch-4.0 updated: [SPARK-53581][CORE] Fix potential thread-safety issue for mapTaskIds.add()
dongjoon
(spark) branch master updated (1ec647ea17d8 -> 1795306078e0)
ruifengz
(spark) branch master updated: [SPARK-53630][PYTHON][DOCS][TESTS] Reenable doctest for `Dataframe.freqItems`
ruifengz
(spark-website) branch asf-site updated: Update `.asf.yaml` with new `README.md` link (#630)
dongjoon
(spark) branch master updated: [SPARK-53543][SQL][TEST-ONLY] Add more test coverage for `Window` on top of `Aggregate`
wenchen
(spark) branch master updated: [MINOR][TESTS] Restore classic-only python tests
ruifengz
(spark) branch dependabot/bundler/docs/rexml-3.4.2 deleted (was 37d4ff0a4f63)
github-bot
(spark) branch master updated: [SPARK-53355][PYTHON][SQL] fix numpy 1.x repr in type tests
ruifengz
(spark) branch master updated (3080e61e5ff0 -> 551e7f2e1e82)
ruifengz
(spark-kubernetes-operator) branch main updated: [SPARK-53588] Upgrade `kubernetes-client` to 7.4.0
dongjoon
(spark) branch master updated: [SPARK-53526][SQL] Enable SQL scripting by default
wenchen
(spark) branch branch-4.0 updated: [SPARK-52601][SQL][4.0] Support primitive types in TransformingEncoder
dongjoon
(spark) branch master updated (010d36f21940 -> 3080e61e5ff0)
allisonwang
(spark-kubernetes-operator) branch main updated: [SPARK-53628] Upgrade `PMD/SpotBugs` and enable checking on Java 25
dongjoon
(spark-kubernetes-operator) branch main updated: [SPARK-53627] Update docs to recommend K8s 1.32+
dongjoon
(spark) branch master updated (f49047130fdb -> 010d36f21940)
gurwls223
(spark) branch master updated (3990b0f1d983 -> f49047130fdb)
gengliang
(spark) branch master updated (87a71fabb097 -> 3990b0f1d983)
dongjoon
svn commit: r79378 - in dev/spark/v3.5.7-rc1-docs: . _site _site/api _site/api/R _site/api/R/articles _site/api/R/deps _site/api/R/deps/bootstrap-5.3.1 _site/api/R/deps/bootstrap-toc-1.0.1 _site/api/R/deps/clipboard.js-2.0.11 _site/api/R/deps/font-awes...
ptoth
(spark-kubernetes-operator) branch main updated: [SPARK-53607] Support Java 25
dongjoon
(spark-kubernetes-operator) branch main updated: [SPARK-53624] Use `bitnamisecure/kubectl:latest` for Helm Chart testing
dongjoon
(spark) branch branch-4.0 updated (2d2a0d95a69f -> 82351703526b)
dongjoon
svn commit: r79376 - dev/spark/v3.5.7-rc1-bin
ptoth
(spark) tag v3.5.7-rc1 created (now ed00d046951a)
ptoth
(spark) 01/01: Preparing Spark release v3.5.7-rc1
ptoth
(spark) branch branch-3.5 updated (9c325e421a37 -> 7468ecf5517a)
ptoth
(spark) 01/01: Preparing development version 3.5.8-SNAPSHOT
ptoth
(spark) branch master updated (1817e676f1b7 -> 8edc7685b971)
dongjoon
(spark) branch master updated (adef6f30fe34 -> 2f305b6817bc)
wenchen
svn commit: r79375 - release/spark
dongjoon
svn commit: r79374 - dev/spark
ptoth
(spark) branch dependabot/bundler/docs/rexml-3.4.2 created (now 37d4ff0a4f63)
github-bot
(spark-website) branch dependabot/bundler/rexml-3.4.2 created (now 120307d8f2)
github-bot
[PR] Bump rexml from 3.4.1 to 3.4.2 [spark-website]
via GitHub
svn commit: r79372 - release/spark
dongjoon
(spark-kubernetes-operator) branch main updated: [SPARK-53617] Use `zulu` Java distribution in GitHub Action jobs
dongjoon
(spark) branch master updated: [SPARK-53438][CONNECT][SQL] Use CatalystConverter in LiteralExpressionProtoConverter
wenchen
(spark) branch master updated: [SPARK-53372][SDP] SDP End to End Testing Suite
wenchen
(spark) branch master updated: [SPARK-53600][SQL] Revise `SessionHolder` last access time log message
dongjoon
svn commit: r79370 - dev/spark
ptoth
[PR] [DOC] Add ts-spark-connector to Spark Connect index and third-party projects [spark-website]
via GitHub
(spark) branch master updated: [SPARK-53606][DOCS] Fix MapInPandas/MapInArrow examples with barrier
ruifengz
(spark-kubernetes-operator) branch main updated: [SPARK-53613] Upgrade `google-java-format` to 1.28.0 to support Java 25
dongjoon
[PR] added solace spark connector to third party projects [spark-website]
via GitHub
Re: [PR] added solace spark connector to third party projects [spark-website]
via GitHub
(spark) branch master updated: [SPARK-53546][TESTS][FOLLOW-UP] Fix nested array schema evolution and style for InMemoryBaseTable
wenchen
(spark) branch master updated: [SPARK-52991][SQL][FOLLOW-UP] Revise `MergeIntoTable` to use `lazy val` and add a new test
dongjoon
(spark) branch master updated: [SPARK-53604][INFRA] Temporarily increase PySpark job execution time to 150 minutes
dongjoon
(spark) branch master updated (5d1ad6187733 -> aaf9308307cc)
dongjoon
(spark) branch master updated: [SPARK-53603][BUILD] Upgrade Checkstyle to 11.0.1
dongjoon
(spark) branch master updated: [SPARK-53594][PYTHON] Make arrow UDF respect user-specified eval type
ruifengz
(spark) branch master updated: [SPARK-53581][CORE] Fix potential thread-safety issue for mapTaskIds.add()
dongjoon
(spark) branch master updated: [SPARK-53599][BUILD] Upgrade `Netty` to 4.1.127.Final
dongjoon
(spark) branch master updated: [SPARK-53602][PYTHON] Profile dump improvement and profiler doc fix
gurwls223
(spark) branch branch-4.0 updated: [SPARK-53560][SS][SQL] Crash looping when retrying uncommitted batch in Kafka source and AvailableNow trigger
ashrigondekar
(spark) branch branch-3.5 updated: [SPARK-53560][SS][SQL] Crash looping when retrying uncommitted batch in Kafka source and AvailableNow trigger
ashrigondekar
(spark) branch branch-3.5 updated: [SPARK-53581][CORE] Fix potential thread-safety issue for mapTaskIds.add()
dongjoon
(spark) branch master updated: [SPARK-53601][INFRA] Use Java 25 instead of 25-ea
dongjoon
(spark) branch master updated: [MINOR][PYTHON][DOCS] Correct the examples of `toPandas` and `toArrow`
dongjoon
(spark) branch master updated: [SPARK-53559][SQL][CATALYST] Fix HLL sketch updates to use raw collation key bytes
dtenedor
(spark) branch master updated: Fix: SparkML-connect can't load SparkML (legacy mode) saved model
weichenxu123
(spark) branch master updated: [SPARK-53582][SQL] Extend `isExtractable` so it can be applied on `UnresolvedExtractValue`
wenchen
(spark) branch master updated: [MINOR][PYTHON][DOCS] Update the doctests to check the default column names
ruifengz
(spark-connect-go) branch dependabot/go_modules/google.golang.org/protobuf-1.36.9 created (now 668a33a)
github-bot
(spark) branch master updated: [SPARK-53584][PYTHON] Improve process_column_param validation and column parameter docstring
gurwls223
(spark) branch master updated (8c422f974a68 -> 10b27f3a5ad9)
gurwls223
(spark) branch branch-3.5 updated: [SPARK-53577][DOCS] Fix Scaladoc source links for java sources
yao
(spark) branch branch-4.0 updated: [SPARK-53577][DOCS] Fix Scaladoc source links for java sources
yao
(spark) branch master updated (10b27f3a5ad9 -> 6c9e7503f3ac)
dongjoon
(spark) branch master updated (10c634fe8cc4 -> 2844705f4885)
wenchen
(spark) branch branch-4.0 updated: [SPARK-53539][INFRA][4.0] Add `libwebp-dev` to recover `spark-rm/Dockerfile` building
dongjoon
(spark-connect-go) branch dependabot/go_modules/google.golang.org/protobuf-1.36.8 deleted (was 003249b)
github-bot
(spark-connect-go) branch dependabot/go_modules/google.golang.org/grpc-1.75.1 created (now 4f9c24b)
github-bot
(spark) branch master updated: [SPARK-53572][SQL] Avoid throwing from ExtractValue.isExtractable
wenchen
(spark) branch master updated: [SPARK-53574] Fix AnalysisContext being wiped during nested plan resolution
wenchen
(spark) branch master updated: [SPARK-53361][SS][1/2] Optimizing JVM–Python Communication in TWS by Grouping Multiple Keys into One Arrow Batch
kabhwan
(spark) branch master updated (ce94cb3132f9 -> fbdad297f542)
wenchen
(spark) branch master updated (9bd844b87c84 -> ce94cb3132f9)
wenchen
(spark) branch master updated: [SPARK-53558][SQL] Show fully qualified table name including the catalog name in the exception message when the table is not found
wenchen
(spark) branch branch-3.5 updated: [SPARK-53518][SQL][3.5] No truncation for catalogString of User Defined Type
yao
(spark) branch branch-4.0 updated: [SPARK-53524][CONNECT][SQL][4.0] Fix temporal value conversion in LiteralValueProtoConverter
wenchen
(spark) branch master updated: [SPARK-53553][CONNECT] Fix handling of null values in LiteralValueProtoConverter
wenchen
(spark) branch master updated: [SPARK-43579][PYTHON] optim: Cache the converter between Arrow and pandas for reuse
gurwls223
(spark) branch master updated: [SPARK-53568][CONNECT][PYTHON] Fix several small bugs in Spark Connect Python client error handling logic
gurwls223
(spark) branch master updated: [SPARK-53563][PS] Optimize: sql_processor by avoiding inefficient string concatenation
gurwls223
(spark) branch master updated: [SPARK-53182][PYTHON][DOCS] Fix broken and missing links in PySpark DataFrames user guide
gurwls223
(spark) branch master updated: [SPARK-53544][PYTHON] Support complex types on observations
gurwls223
(spark-connect-swift) branch main updated: [SPARK-53571] Add `integration-test-mac-spark4-iceberg` GitHub Action job
dongjoon
(spark) branch master updated: [SPARK-53521] Refactor Star expression
wenchen
(spark) branch master updated: [SPARK-53537][CORE] Adding Support for Parsing CONTINUE HANDLER
wenchen
(spark) branch master updated: [SPARK-53444][SQL] Rework execute immediate
wenchen
(spark-connect-swift) branch main updated: [SPARK-53570] Update `integration-test-token` to use Spark `4.1.0-preview1`
dongjoon
(spark-connect-swift) branch main updated: [SPARK-53569] Use `Iceberg` 1.10 for `Spark 3`-based Iceberg integration test
dongjoon
(spark) branch master updated: [SPARK-53523][SQL] Named parameters respect `spark.sql.caseSensitive`
dongjoon
(spark-kubernetes-operator) branch main updated: [SPARK-53545] Enforce style check at project level for operator
ptoth
(spark) branch master updated (19735de5b46e -> f210e979f017)
dongjoon
(spark) branch master updated: [SPARK-53550][SQL][FOLLOWUP] Union output partitioning should compare canonicalized attributes
viirya
(spark) branch master updated: [SPARK-53491][SS] Fix exponential formatting of inputRowsPerSecond and processedRowsPerSecond in progress metrics JSON
kabhwan
[PR] Update `.asf.yaml` with new `README.md` link [spark-website]
via GitHub
Re: [PR] Update `.asf.yaml` with new `README.md` link [spark-website]
via GitHub
Re: [PR] Update `.asf.yaml` with new `README.md` link [spark-website]
via GitHub
Re: [PR] Update `.asf.yaml` with new `README.md` link [spark-website]
via GitHub
(spark) branch master updated: [SPARK-52449][CONNECT][PYTHON][ML] Make datatypes for Expression.Literal.Map/Array optional
ruifengz
(spark) branch branch-4.0 updated (b92662194f32 -> aaa7fb3a675c)
dongjoon
(spark) branch master updated: [SPARK-53399][SQL] Merge Python UDFs
ptoth
(spark) branch master updated (29626507fa02 -> faa1aaa64f49)
dongjoon
(spark) branch master updated (07d987a0cc6e -> 1dab449da5c8)
dongjoon
(spark) branch master updated (faa1aaa64f49 -> 07d987a0cc6e)
dongjoon
(spark) branch master updated (f465eca7e954 -> 29626507fa02)
yao
(spark) branch master updated: [SPARK-53561][SS] Catch Interruption Exception in TransformWithStateInPySparkStateServer during outputStream.flush to avoid the worker crash
ashrigondekar
(spark) branch branch-4.0 updated: [SPARK-53538][SQL][4.0] `ExpandExec` should initialize the unsafe projections
dongjoon
(spark) branch branch-3.5 updated: [SPARK-53557][INFRA] Reduce automated vote email deadline from 4 days to 73 hours
dongjoon
(spark) branch branch-4.0 updated: [SPARK-53557][INFRA] Reduce automated vote email deadline from 4 days to 73 hours
dongjoon
(spark) branch master updated: [SPARK-53557][INFRA] Reduce automated vote email deadline from 4 days to 73 hours
dongjoon
(spark) branch master updated (b9848ac61a71 -> 4575b1da9e1d)
ashrigondekar
(spark) branch master updated (bb41e19ae9ee -> f90333d109ba)
ashrigondekar
(spark) branch master updated: [SPARK-53525][CONNECT] Spark Connect ArrowBatch Result Chunking
hvanhovell
(spark) branch master updated (d0177795bbbe -> 1817e676f1b7)
wenchen
(spark) branch branch-4.0 updated: [SPARK-53434][SQL][4.0] ColumnarRow's get should also check isNullAt
wenchen
(spark) branch master updated: [MINOR][PYTHON][TESTS] Skip some tests if numpy not installed
ruifengz
(spark) branch master updated: [SPARK-53512][SQL] Better unification of DSv2 PushDownUtils
gengliang
(spark) branch master updated: [SPARK-52238][SDP] Rename Pipeline Spec Field "definitions" to 'libraries'
wenchen
(spark) branch master updated: [SPARK-53029][FOLLOWUP] Update PyArrow StructType usage to avoid .names attribute for compatibility with PyArrow
gurwls223
(spark) branch master updated: [SPARK-53541][K8S][INFRA] Update K8s IT CI to use K8s 1.34
yangjie01
(spark) branch master updated: [SPARK-53531][CORE] Better error message for HadoopRDD.getInputFormat
yangjie01
(spark) branch branch-3.5 updated (195d81b42142 -> 7ab7b7cde154)
yangjie01
Earlier messages