gongxun0928 commented on code in PR #1324:
URL: https://github.com/apache/cloudberry/pull/1324#discussion_r2297005813


##########
contrib/pax_storage/src/cpp/access/pax_access_handle.cc:
##########
@@ -453,6 +453,7 @@ uint32 PaxAccessMethod::ScanFlags(Relation relation) {
   flags |= SCAN_FORCE_BIG_WRITE_LOCK;
 #endif
 
+  flags |= SCAN_SUPPORT_RUNTIME_FILTER;

Review Comment:
   When gp_enable_runtime_filter_pushdown is enabled by default, it applies to 
all queries on PAX-formatted tables, which is expected to affect TPC-DS/TPC-H 
performance. 
   
   IN and min-max filters rely on column-level statistics, whereas bloom 
filters perform row-level filtering. Even without IN or min-max filtering, 
simply pushing the bloom filter from SeqNext() down to the TableAM level can 
reduce unnecessary memory copies. For example, if a row has already been 
filtered by a bloom filter expression, there is no need to construct and return 
the corresponding TableTupleSlot for that row.
   
   It leads to a minor performance improvement



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@cloudberry.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscr...@cloudberry.apache.org
For additional commands, e-mail: commits-h...@cloudberry.apache.org

Reply via email to