Messages by Thread
-
-
[GitHub] [spark] LuciferYang opened a new pull request, #36571: [SPARK-39202][SQL] Introduce a `putByteArrays` method to `WritableColumnVector` to support setting multiple duplicate `byte[]`
GitBox
-
[GitHub] [spark] HyukjinKwon opened a new pull request, #36570: [SPARK-39096][SQL][FOLLOW-UP] Fix "MERGE INTO TABLE" test to pass with ANSI mode on
GitBox
-
[GitHub] [spark] github-actions[bot] commented on pull request #34359: [SPARK-36986][SQL] Improving schema filtering flexibility
GitBox
-
[GitHub] [spark] github-actions[bot] commented on pull request #35342: [SPARK-38043][SQL] Refactor FileBasedDataSourceSuite and add DataSourceSuite for each data source
GitBox
-
[GitHub] [spark] warrenzhu25 commented on a diff in pull request #35498: [SPARK-34777][UI] StagePage input/output size records not show when r…
GitBox
-
[GitHub] [spark] xinrong-databricks opened a new pull request, #36569: Implement `ignore_index` of `DataFrame.explode` and `DataFrame.drop_duplicates`
GitBox
-
[GitHub] [spark] eswardhinakaran-toast opened a new pull request, #36568: Analytics flavor spark
GitBox
-
[GitHub] [spark] dcoliversun opened a new pull request, #36567: [SPARK-39196][CORE][SQL][K8S] replace `getOrElse(null)` with `orNull`
GitBox
-
[GitHub] [spark] EnricoMi commented on pull request #36150: [SPARK-38864][SQL] Add melt / unpivot to Dataset
GitBox
-
[GitHub] [spark] AnywalkerGiser opened a new pull request, #36566: [SPARK-39176][PYSPARK] Fixed a problem with pyspark serializing pre-1970 datetime in windows
GitBox
-
[GitHub] [spark] AnywalkerGiser opened a new pull request, #36565: [SPARK-39176][PYSPARK] Fixed a problem with pyspark serializing pre-1970 datetime in windows
GitBox
-
[GitHub] [spark] AngersZhuuuu opened a new pull request, #36564: [WIP][SPARK-39195][SQL] Spark should use two step update of outputCommitCoordinator
GitBox
-
[GitHub] [spark] yaooqinn opened a new pull request, #36563: [SPARK-39194][SQL] Add a pre resolution builder for spark session extensions
GitBox
-
[GitHub] [spark] gengliangwang opened a new pull request, #36562: [SPARK-39193][SQL] Fasten Timestamp type inference of JSON/CSV data sources
GitBox
-
[GitHub] [spark] panbingkun opened a new pull request, #36561: [SPARK-37939][SQL] Use error classes in the parsing errors of properties
GitBox
-
[GitHub] [spark] zhengruifeng opened a new pull request, #36560: [SPARK-39192][PYTHON] make pandas-on-spark's kurt consistent with pandas
GitBox
-
[GitHub] [spark] AnywalkerGiser opened a new pull request, #36559: [SPARK-39176][PYSPARK] Fixed a problem with pyspark serializing pre-1970 datetime in windows
GitBox
-
[GitHub] [spark] MaxGekk opened a new pull request, #36558: [SPARK-39187][SQL][3.3] Remove `SparkIllegalStateException`
GitBox
-
[GitHub] [spark] cloud-fan commented on pull request #36121: [SPARK-38836][SQL] Improve the performance of ExpressionSet
GitBox
-
[GitHub] [spark] cloud-fan closed pull request #36121: [SPARK-38836][SQL] Improve the performance of ExpressionSet
GitBox
-
[GitHub] [spark] gengliangwang opened a new pull request, #36557: [SPARK-39190][SQL] Provide query context for decimal precision overflow error when WSCG is off
GitBox
-
[GitHub] [spark] AngersZhuuuu commented on pull request #36056: [SPARK-36571][SQL] Add an SQLOverwriteHadoopMapReduceCommitProtocol to support all SQL overwrite write data to staging dir
GitBox
-
[GitHub] [spark] AngersZhuuuu commented on pull request #35799: [SPARK-38498][STREAM] Support customized StreamingListener by configuration
GitBox
-
[GitHub] [spark] beliefer opened a new pull request, #36556: [SPARK-39162][SQL][3.3] Jdbc dialect should decide which function could be pushed down
GitBox
-
[GitHub] [spark] github-actions[bot] closed pull request #35357: [SPARK-21195][CORE] MetricSystem should pick up dynamically registered metrics in sources
GitBox
-
[GitHub] [spark] zhengruifeng opened a new pull request, #36555: [SPARK-39189][PYTHON] interpolate supports limit_area
GitBox
-
[GitHub] [spark] LuciferYang commented on pull request #36078: [SPARK-38814][BUILD][TESTS] Migrate Junit 4 to Junit 5
GitBox
-
[GitHub] [spark] zhengruifeng opened a new pull request, #36554: [SPARK-39186][PYTHON][FOLLOWUP] Improve the numerical stability of skewness
GitBox
-
[GitHub] [spark] github-actions[bot] commented on pull request #35357: [SPARK-21195][CORE] MetricSystem should pick up dynamically registered metrics in sources
GitBox
-
[GitHub] [spark] HyukjinKwon closed pull request #36267: [SPARK-38953][PYTHON][DOC] Document PySpark common exceptions / errors
GitBox
-
[GitHub] [spark] HyukjinKwon commented on pull request #36267: [SPARK-38953][PYTHON][DOC] Document PySpark common exceptions / errors
GitBox
-
[GitHub] [spark] MaxGekk opened a new pull request, #36553: [WIP][SQL] Improve errors related to casts
GitBox
-
[GitHub] [spark] wangyum opened a new pull request, #36552: [SPARK-38506][SQL] Push partial aggregation through join
GitBox
-
[GitHub] [spark] panbingkun opened a new pull request, #36551: [SPARK-38463][CORE] Use error classes in org.apache.spark.input
GitBox
-
[GitHub] [spark] MaxGekk opened a new pull request, #36550: [SPARK-39187][SQL] Remove `SparkIllegalStateException`
GitBox
-
[GitHub] [spark] zhengruifeng opened a new pull request, #36549: [SPARK-39186][PYTHON] make skew consistent with pandas
GitBox
-
[GitHub] [spark] panbingkun opened a new pull request, #36548: [SPARK-38470][SQL] Use error classes in org.apache.spark.partial
GitBox
-
[GitHub] [spark] xinrong-databricks opened a new pull request, #36547: Implement `skipna` parameter of `Groupby.all`
GitBox
-
[GitHub] [spark] bersprockets opened a new pull request, #36546: [SPARK-37544][SQL] Correct date arithmetic in sequences
GitBox
-
[GitHub] [spark] physinet opened a new pull request, #36545: [WIP][SPARK-39168] Use all values in a python list when inferring ArrayType schema
GitBox