(incubator-gluten) branch main updated: [GLUTEN-6887][VL] Daily Update Velox Version (2026_03_08) (#11717)

yuanzhou Mon, 09 Mar 2026 01:43:42 -0700

This is an automated email from the ASF dual-hosted git repository.

yuanzhou pushed a commit to branch main
in repository https://gitbox.apache.org/repos/asf/incubator-gluten.git



The following commit(s) were added to refs/heads/main by this push:
     new fb7ab0e46f [GLUTEN-6887][VL] Daily Update Velox Version (2026_03_08) 
(#11717)
fb7ab0e46f is described below

commit fb7ab0e46fae582ca26c137658afb39228771470
Author: Gluten Performance Bot 
<[email protected]>
AuthorDate: Mon Mar 9 08:43:30 2026 +0000

    [GLUTEN-6887][VL] Daily Update Velox Version (2026_03_08) (#11717)
    
    * [GLUTEN-6887][VL] Daily Update Velox Version (dft-2026_03_08)
    
    Upstream Velox's New Commits:
    d085d37b6 by Ke Wang, feat: Add hive footer speculative IO size config 
(#16641)
    bb7b75c93 by Yedidya Feldblum, refactor: Use explicit types in 
Cursor::write (#16585)
    1367a7516 by Zac Wen, misc: Fix typo `duckDbVefifySql` in test parameter 
(#16668)
    2dff55a1b by Zoltan Arnold Nagy, feat(cudf): Add varchar support to cuDF 
min/max aggregation registry (#16651)
    ba2f8a491 by Chengcheng Jin, fix(cudf): Fix Cudf spark aggregation function 
registeration validation (#16614)
    1df51f9df by Raymond Lin, feat(connector): Use named constants for 
fileReadOps table identity keys in SplitReader (#16656)
    3beb70d97 by Raymond Lin, feat(connector): Add dbName field to 
HiveTableHandle and inject table identity into fileReadOps (#16647)
    ade18bd8c by Shruti Shivakumar, fix(cudf): Set functionEngine to spark in 
ExpressionEvaluatorSelection test (#16666)
    fe8703d84 by Jimmy Lu, refactor: Remove VELOX_ENABLE_BACKWARD_COMPATIBILITY 
shim for filterPushdownEnabled in HiveTableHandle (#16665)
    bf98064e1 by Pratik Pugalia, build: Disable build-metrics workflow due to 
defunct conbench service (#16648)
    24ac0df5d by Pedro Eugenio Rocha Pedreira, test: Support null map rows in 
createMapOfArraysVector() (#16657)
    5bf47a4a5 by Kent Yao, feat(spark): Add index lambda support for Spark 
filter function (#16334)
    a9c30b5d7 by Amit Dutta, fix: Fix crashes, memory leaks, and format string 
bugs in NumericHistogramAggregate (#16640)
    3b263a79f by Han Yan, refactor: Refactor VectorSerde named serde API from 
Kind to string (#16558)
    c3061087d by hikey-dj, feat(cuDF): Add GPU batch concatenation for small 
cuDF batches (#16201)
    85f8d00c7 by Yizhou Yang, feat: Add Spark approx_percentile aggregate 
function (#16320)
    e4e6dc48c by Simon Krueger, build: Move StringCore include and 
computeAndSetIsAscii to cpp (#16638)
    a6f53d492 by Shruti Shivakumar, feat(cudf): Apply stream-related fixes to 
cuDF (#16572)
    e2a7f7dff by Pramod Satya, feat: Add variant comparison with null handling 
mode (#15726)
    d807c6e60 by Wangyang Guo, perf(dwio): Avoid redundant outputBuffer 
clearing (#15998)
    48d371d8a by Jimmy Lu, feat: Column extraction pushdown design proposal 
(#16627)
    885364593 by Ke Wang, feat: Add option to collect column decoding time for 
selective readers (#16635)
    7bc1be16a by n0r0shi, feat: Support decimal type for the Spark checked_add 
and checked_subtract functions (#16302)
    0c2b4c488 by Matt Gara, fix(cudf): Fix stream synchronization issue with 
CudfTopN (#16431)
    f2c269936 by Jimmy Lu, fix: Fix build failure caused by merge conflict 
(#16639)
    f6b7e8372 by lingbin, feat: Expose 'runningIsBlockedWallNanos' operator 
runtime metric (#16021)
    4d185d130 by Chengcheng Jin, feat(cudf): Register the functions of specific 
engine separately (#16524)
    f063ee7d4 by Jialiang Tan, refactor: Rename join terminology to more 
generic terms in HiveIndexReader/HiveIndexSource (#16631)
    6cea60a45 by Jimmy Lu, refactor: Remove deprecated filterPushdownEnabled 
from HiveTableHandle (#16593)
    6585e3575 by Andrii Rosa, fix: Sanitize file name parts when generating 
file name in HiveDataSink (#16603)
    6bbd589e5 by Shakyan Kushwaha, feat: Add current_time Presto function 
(#15603)
    23b491ec0 by Kent Yao, feat(sparksql): Add Spark-specific sequence function 
for integer types (#16339)
    13656603a by wuyuhao28, perf: Fix readRegions prefetch incorrectly 
submitting non-prefetch loads (#16574)
    2042c2a18 by Ke Wang, feat: Add column type info to decompression metrics 
for dwrf reader (#16602)
    c34853c59 by Krishna Pai, fix: Make document publishing job more robust 
(#16619)
    3a20d75a1 by Chengcheng Jin, feat(spark): Respect Spark bloom_filter_agg 
max num items (#16526)
    0f070c60b by Ke Wang, fix: Make cacheable default to false in FileIoContext 
and BufferedInput (#16621)
    e83521455 by Xiaoxuan Meng, fix: Add IoStatistics tracking for 
findCachedRegion and use emplace_back in BufferedInput (#16604)
    6b362a5dd by Xiaoxuan Meng, feat: Reland D94871321: [velox] feat: Add cache 
lookup and zero-copy APIs to BufferedInput (#16606)
    2e0e810b0 by Suryadev Sahadevan Rajesh, refactor(encoding): Merge 
EncodingLayoutCapture into EncodingLayout (#16582)
    d525b0d7b by Rohit Jain, refactor: Migrate remaining string constants to 
string_view (#16455)
    a8548d60d by zhli1142015, perf(expr): Remove batch size limit for 
FlatNoNulls fast path (#16607)
    e4a431e01 by Ping Liu, fix(parquet): Max target file size not working 
(#16389)
    a4a0ddb1d by Patrick Sullivan, refactor(geo): Use envelope checks to skip 
deserialization (#16368)
    ee9f5395a by Peter Enescu, feat: Add template information to 
createEmptyLike (#16453)
    6b9604173 by infvg, test: Revamp Spark cast tests to support ANSI on/off 
differentiation (#16058)
    d64d78858 by David Goode, test(operator): Add velox mixedunion hive 
connector test (#16449)
    4514fee81 by Pratik Pugalia, feat: Add IPADDRESS support for Fuzzers 
(#16020)
    1a3478f06 by Pratik Pugalia, feat: Add HUGEINT key support to 
MapUnionSumAggregate (#16196)
    296571388 by Pratik Pugalia, feat: Add IPADDRESS support to Arbitrary 
aggregate function (#16195)
    fb5b29684 by Ping Liu, fix: Iceberg positional delete compare base64 
encoded value with raw value (#16592)
    
    Signed-off-by: glutenperfbot <[email protected]>
    
    * fix
    
    Signed-off-by: Yuan <[email protected]>
    
    * fix
    
    Signed-off-by: Yuan <[email protected]>
    
    * fix
    
    Signed-off-by: Yuan <[email protected]>
    
    * fix
    
    Signed-off-by: Yuan <[email protected]>
    
    * fix
    
    Signed-off-by: Yuan <[email protected]>
    
    ---------
    
    Signed-off-by: glutenperfbot <[email protected]>
    Signed-off-by: Yuan <[email protected]>
    Co-authored-by: glutenperfbot <[email protected]>
    Co-authored-by: Yuan <[email protected]>
---
 cpp/velox/compute/VeloxBackend.cc           | 2 +-
 cpp/velox/shuffle/VeloxShuffleReader.cc     | 2 +-
 cpp/velox/substrait/SubstraitToVeloxPlan.cc | 4 +---
 cpp/velox/utils/ConfigExtractor.cc          | 4 +---
 ep/build-velox/src/get-velox.sh             | 4 ++--
 5 files changed, 6 insertions(+), 10 deletions(-)

diff --git a/cpp/velox/compute/VeloxBackend.cc 
b/cpp/velox/compute/VeloxBackend.cc
index 45a64908e1..de9e9385f8 100644
--- a/cpp/velox/compute/VeloxBackend.cc
+++ b/cpp/velox/compute/VeloxBackend.cc
@@ -201,7 +201,7 @@ void VeloxBackend::init(
     // serde, for spill
     
facebook::velox::serializer::presto::PrestoVectorSerde::registerVectorSerde();
   }
-  if 
(!isRegisteredNamedVectorSerde(facebook::velox::VectorSerde::Kind::kPresto)) {
+  if (!isRegisteredNamedVectorSerde("Presto")) {
     // RSS shuffle serde.
     
facebook::velox::serializer::presto::PrestoVectorSerde::registerNamedVectorSerde();
   }
diff --git a/cpp/velox/shuffle/VeloxShuffleReader.cc 
b/cpp/velox/shuffle/VeloxShuffleReader.cc
index bfe11ef3e9..e5435aee52 100644
--- a/cpp/velox/shuffle/VeloxShuffleReader.cc
+++ b/cpp/velox/shuffle/VeloxShuffleReader.cc
@@ -732,7 +732,7 @@ 
VeloxRssSortShuffleReaderDeserializer::VeloxRssSortShuffleReaderDeserializer(
       rowType_(rowType),
       batchSize_(batchSize),
       veloxCompressionType_(veloxCompressionType),
-      serde_(getNamedVectorSerde(facebook::velox::VectorSerde::Kind::kPresto)),
+      serde_(getNamedVectorSerde("Presto")),
       deserializeTime_(deserializeTime) {
   serdeOptions_ = {false, veloxCompressionType_};
 }
diff --git a/cpp/velox/substrait/SubstraitToVeloxPlan.cc 
b/cpp/velox/substrait/SubstraitToVeloxPlan.cc
index 727f4882e1..d71ab12528 100644
--- a/cpp/velox/substrait/SubstraitToVeloxPlan.cc
+++ b/cpp/velox/substrait/SubstraitToVeloxPlan.cc
@@ -1447,8 +1447,6 @@ core::PlanNodePtr 
SubstraitToVeloxPlanConverter::toVeloxPlan(const ::substrait::
     SubstraitParser::parseColumnTypes(baseSchema, columnTypes);
   }
 
-  // Velox requires Filter Pushdown must being enabled.
-  bool filterPushdownEnabled = true;
   auto names = colNameList;
   auto types = veloxTypeList;
 
@@ -1468,7 +1466,7 @@ core::PlanNodePtr 
SubstraitToVeloxPlanConverter::toVeloxPlan(const ::substrait::
   }
   common::SubfieldFilters subfieldFilters;
   tableHandle = std::make_shared<connector::hive::HiveTableHandle>(
-      connectorId, "hive_table", filterPushdownEnabled, 
std::move(subfieldFilters), remainingFilter, tableSchema);
+      connectorId, "hive_table", std::move(subfieldFilters), remainingFilter, 
tableSchema);
 
   // Get assignments and out names.
   std::vector<std::string> outNames;
diff --git a/cpp/velox/utils/ConfigExtractor.cc 
b/cpp/velox/utils/ConfigExtractor.cc
index 52e42d606d..fe35ce9f8e 100644
--- a/cpp/velox/utils/ConfigExtractor.cc
+++ b/cpp/velox/utils/ConfigExtractor.cc
@@ -292,9 +292,7 @@ std::shared_ptr<facebook::velox::config::ConfigBase> 
createHiveConnectorConfig(
       conf->get<std::string>(kPrefetchRowGroups, "1");
   hiveConfMap[facebook::velox::connector::hive::HiveConfig::kLoadQuantum] =
       conf->get<std::string>(kLoadQuantum, "268435456"); // 256M
-  auto footerEstimatedSize = conf->get<std::string>(kDirectorySizeGuess, 
"32768"); // 32K
-  
hiveConfMap[facebook::velox::connector::hive::HiveConfig::kFooterEstimatedSize] 
=
-      conf->get<std::string>(kFooterEstimatedSize, footerEstimatedSize); // 32K
+
   
hiveConfMap[facebook::velox::connector::hive::HiveConfig::kFilePreloadThreshold]
 =
       conf->get<std::string>(kFilePreloadThreshold, "1048576"); // 1M
 
diff --git a/ep/build-velox/src/get-velox.sh b/ep/build-velox/src/get-velox.sh
index dbfe9f2d5b..558f988e5c 100755
--- a/ep/build-velox/src/get-velox.sh
+++ b/ep/build-velox/src/get-velox.sh
@@ -18,8 +18,8 @@ set -exu
 
 CURRENT_DIR=$(cd "$(dirname "$BASH_SOURCE")"; pwd)
 VELOX_REPO=https://github.com/IBM/velox.git
-VELOX_BRANCH=dft-2026_03_03-iceberg
-VELOX_ENHANCED_BRANCH=ibm-2026_03_03
+VELOX_BRANCH=dft-2026_03_08-iceberg
+VELOX_ENHANCED_BRANCH=ibm-2026_03_08
 VELOX_HOME=""
 RUN_SETUP_SCRIPT=ON
 ENABLE_ENHANCED_FEATURES=OFF


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

(incubator-gluten) branch main updated: [GLUTEN-6887][VL] Daily Update Velox Version (2026_03_08) (#11717)

Reply via email to