Messages by Thread
-
[PR] feat: add JVM UDF framework for native execution [datafusion-comet]
via GitHub
-
[PR] fix: Correct the number of pruned/matched Parquet pages [datafusion]
via GitHub
-
[I] `EXPLAIN ANALYZE` returns a wrong number of pruned and matched Parquet pages [datafusion]
via GitHub
-
[PR] perf: Add `append_with` to string builders, use in `replace` [datafusion]
via GitHub
-
Re: [PR] feat: support factorial, pmod, and rint expressions [datafusion-comet]
via GitHub
-
Re: [I] Add encoding + compression metrics to columnar shuffle [datafusion-comet]
via GitHub
-
Re: [PR] bench: improve Iceberg TPC workflow and plan capture [datafusion-comet]
via GitHub
-
Re: [PR] feat: Use single spill file for multiple partitions in native shuffle [datafusion-comet]
via GitHub
-
[PR] feat: add support for url_encode, url_decode, and try_url_decode [datafusion-comet]
via GitHub
-
Re: [PR] fix(writer): spark 38811 insert alter table add columns [datafusion-comet]
via GitHub
-
Re: [PR] fix: insert overwrite directory native writer [datafusion-comet]
via GitHub
-
Re: [PR] fix: support literal sha2() with 'Unsupported argument types' [datafusion-comet]
via GitHub
-
Re: [PR] fix: Support scalar inputs for datetime expressions when constant folding is disabled [datafusion-comet]
via GitHub
-
Re: [PR] feat: Support Spark expression window_time [datafusion-comet]
via GitHub
-
Re: [PR] feat: Support Spark expression seconds_of_time [datafusion-comet]
via GitHub
-
Re: [PR] fix: Prevent String to TimestampNTZ cast from incorrectly adding UTC timezone metadata [datafusion-comet]
via GitHub
-
Re: [PR] feat: Add native support for max_by and min_by [datafusion-comet]
via GitHub
-
Re: [PR] feat: Add native support for mode fn [datafusion-comet]
via GitHub
-
[PR] docs: replace project logos with updated branding [datafusion-comet]
via GitHub
-
[I] Stats/interval propagation should degrade to unbounded intervals on internal errors, not fail the query [datafusion]
via GitHub
-
[PR] fix: reject disallowed type promotions in native_datafusion scan [datafusion-comet]
via GitHub
-
[PR] fix: coerce operand types in Interval mul/div/intersect/union/contains [datafusion]
via GitHub
-
[PR] fix: compilation issue after merge [datafusion-ballista]
via GitHub
-
Re: [I] [Shuffle] Support cache remote shuffle reader client in executor. [datafusion-ballista]
via GitHub
-
Re: [PR] chore: add favicon [datafusion-site]
via GitHub
-
Re: [I] [Incompatibility] Document array_repeat negative count handling [datafusion-comet]
via GitHub
-
Re: [PR] feat: Add support for Spark Pi math expression [datafusion-comet]
via GitHub
-
Re: [PR] feat: Add support for Spark Cbrt math expression [datafusion-comet]
via GitHub
-
Re: [PR] feat: Add support for Spark Acosh, Asinh, Atanh math expressions [datafusion-comet]
via GitHub
-
Re: [PR] feat: Add support for Spark ToDegrees and ToRadians math expressions [datafusion-comet]
via GitHub
-
[PR] feat: Plumb Parquet virtual columns (row_number) through TableSchema and ParquetOpener [datafusion]
via GitHub
-
Re: [I] [EPIC] A collection of support for metadata columns in ListingTable [datafusion]
via GitHub
-
Re: [I] Skip defensive copy when unpacking dictionary arrays in UnpackOrClone mode [datafusion-comet]
via GitHub
-
Re: [I] Avoid unpacking dictionaries for inputs to SortExec [datafusion-comet]
via GitHub
-
[I] Preserve dictionary encoding through native expressions where possible [datafusion-comet]
via GitHub
-
Re: [I] Add with_virtual_columns to ParquetSource for reading virtual columns [datafusion]
via GitHub
-
[PR] fix: resolve Scala compiler warnings for auto-tupling and bare try [datafusion-comet]
via GitHub
-
[PR] Make Expr::alias and alias_qualified smarter by calling unalias [datafusion]
via GitHub
-
Re: [PR] Reduce cloning in LogicalPlanBuilder [datafusion]
via GitHub
-
[PR] test: skip flaky StateStoreSuite under Comet and disambiguate JDK matrix names [datafusion-comet]
via GitHub
-
Re: [PR] perf: reduce per-node allocations in to_native_metric_node [datafusion-comet]
via GitHub
-
Re: [I] Expr. simplification / rewrite: regex `.*foo.*` [datafusion]
via GitHub
-
Re: [PR] fix: UNIQUE constraint with NULLs incorrectly collapses GROUP BY groups [datafusion]
via GitHub
-
Re: [PR] Add configurable UNION DISTINCT to FILTER rewrite optimization [datafusion]
via GitHub
-
Re: [PR] Postgres regression 7b [datafusion-sqlparser-rs]
via GitHub
-
Re: [PR] Add MERGE INTO types to datafusion-expr [datafusion]
via GitHub
-
Re: [PR] ci: add a CI job that builds without the lockfile [datafusion]
via GitHub
-
Re: [PR] fix(substrait): normalize table names from Substrait NamedTable for Calcite interop [datafusion]
via GitHub
-
[PR] fix: JNI local reference cleanup in JVMClasses::with_env [datafusion-comet]
via GitHub
-
[I] Support higher-order array functions via JVM UDF bridge [datafusion-comet]
via GitHub
-
[PR] chore(deps): bump ctor from 0.10.1 to 1.0.1 [datafusion]
via GitHub
-
[PR] chore(deps): bump the all-other-cargo-deps group with 2 updates [datafusion]
via GitHub
-
[PR] feat: implement array_exists with lambda support via JVM UDF bridge [datafusion-comet]
via GitHub
-
[PR] chore(deps): bump the arrow-parquet group with 9 updates [datafusion]
via GitHub
-
[PR] chore(deps): bump github/codeql-action from 4.35.2 to 4.35.3 [datafusion]
via GitHub
-
[PR] chore(deps): bump taiki-e/install-action from 2.74.0 to 2.77.0 [datafusion]
via GitHub
-
Re: [PR] perf : experiment roaring bitmap for int32 anti and semi joins [datafusion]
via GitHub
-
[PR] fix: drop input plan early in `CoalescePartitionsExec` [datafusion]
via GitHub
-
Re: [PR] chore: Add existence (semi / anti ) benchmarks for hashjoinexec [datafusion]
via GitHub
-
[I] `CoalescePartitionsExec` delays cancellation of child operators [datafusion]
via GitHub
-
Re: [I] [EPIC] Make DataFusion the top of the ClickBench Parquet leaderboard [datafusion]
via GitHub
-
[PR] fix: support unhex on dictionary strings [datafusion-comet]
via GitHub
-
[PR] chore(deps): bump tokio from 1.52.1 to 1.52.2 [datafusion-ballista]
via GitHub
-
[PR] chore(deps): bump ctor from 0.12.0 to 1.0.1 [datafusion-ballista]
via GitHub
-
[PR] feat: implement retract_batch for array_agg sliding window support [datafusion]
via GitHub
-
[PR] Support '0' value for parse_capacity_limit() [datafusion]
via GitHub
-
[I] Spark SQL `maintenance` test fails intermittently [datafusion-comet]
via GitHub
-
Re: [PR] feat: Support Spark expression hours_of_time [datafusion-comet]
via GitHub
-
Re: [PR] Add `rust-required-checks` [datafusion]
via GitHub
-
Re: [PR] implement `preimage` for date_trunc [datafusion]
via GitHub
-
Re: [PR] OptimizeProjections: safely prune struct-only UNNEST when outputs are unused [datafusion]
via GitHub
-
Re: [PR] feat: Improve `partition_statistics()` for `AggregateExec` using `distinct_count` [datafusion]
via GitHub
-
[PR] test: add INT96 TimestampNTZ correctness tests [datafusion-comet]
via GitHub
-
[I] native_datafusion more permissive than Spark 3.x when reading Parquet TimestampNTZ columns [datafusion-comet]
via GitHub
-
[PR] feat: add array_normalize scalar function [datafusion]
via GitHub
-
[D] [datafusion-spark] Add physical implementations for functions that only have simplify() [datafusion]
via GitHub
-
[I] Native DataFusion scan silently returns wrong values reading INT96 as TimestampNTZ [datafusion-comet]
via GitHub
-
[PR] deps: Bump OpenDAL to 0.56.0 [datafusion-comet]
via GitHub
-
[PR] feat: support Parquet field ID matching in native_datafusion scan [datafusion-comet]
via GitHub
-
Re: [I] [native_datafusion] Add support for reading row index metadata columns [datafusion-comet]
via GitHub
-
Re: [I] `lower`, `upper` could be further optimized for ASCII-only inputs [datafusion]
via GitHub
-
Re: [I] Support dict encoded structs in `get_field` [datafusion]
via GitHub
-
[PR] feat: support AQE DPP broadcast reuse for Iceberg native scans [datafusion-comet]
via GitHub
-
Re: [I] Write a wikipedia article for Apache DataFusion [datafusion]
via GitHub
-
Re: [I] Unsupported aggregation mode PartialMerge [datafusion-comet]
via GitHub
-
Re: [PR] Add support for PostgreSQL's ORDER BY ... USING <operator> clause [datafusion-sqlparser-rs]
via GitHub
-
Re: [PR] test: extend SPARK-43402 plan-match to CometNativeScanExec and retag to #4042 [datafusion-comet]
via GitHub
-
[PR] fix: include per-column details in exportBatch row count mismatch error [datafusion-comet]
via GitHub
-
[PR] Map ProfileCredentialsProvider to profiel credential chain [datafusion-comet]
via GitHub
-
[I] Support AWS ProfileCredentialsProvider in native S3 object store [datafusion-comet]
via GitHub
-
[I] Number of rows in each column should be the same, but got [ArrayBuffer(8192, 0)] [datafusion-comet]
via GitHub
-
[PR] proto: serialize dynamic filters on Sort, Aggregate, HashJoin [datafusion]
via GitHub
-
Re: [PR] proto: serialize dynamic filters on Sort, Aggregate, HashJoin [datafusion]
via GitHub
-
Re: [PR] proto: serialize dynamic filters on Sort, Aggregate, HashJoin [datafusion]
via GitHub
-
Re: [PR] proto: serialize dynamic filters on Sort, Aggregate, HashJoin plan nodes [datafusion]
via GitHub
-
Re: [PR] proto: serialize dynamic filters on Sort, Aggregate, HashJoin plan nodes [datafusion]
via GitHub
-
Re: [PR] proto: serialize dynamic filters on Sort, Aggregate, HashJoin plan nodes [datafusion]
via GitHub
-
Re: [PR] proto: serialize dynamic filters on Sort, Aggregate, HashJoin plan nodes [datafusion]
via GitHub
-
Re: [PR] proto: serialize dynamic filters on Sort, Aggregate, HashJoin plan nodes [datafusion]
via GitHub
-
Re: [PR] proto: serialize dynamic filters on Sort, Aggregate, HashJoin plan nodes [datafusion]
via GitHub
-
[PR] docs: document Spark version labels in bug triage guide [datafusion-comet]
via GitHub
-
[PR] perf: coalesce batches before sending to distributor channels in RepartitionExec [datafusion]
via GitHub