LiaCastaneda opened a new issue, #17523:
URL: https://github.com/apache/datafusion/issues/17523

   ### Is your feature request related to a problem or challenge?
   
   Currently, DataFusion only supports bounds dynamic filters, which work well 
when the data sources have metadata or are sorted (e.g., Parquet). However, for 
unsorted data sources, we would benefit from supporting list-based dynamic 
filters.
   
   The idea is to take the build side values, collect them into a List, and 
then convert that list into a predicate (similar to an IN clause).
   
   
   ### Describe the solution you'd like
   
   For example if we have the following query:
   ```
   SELECT *
   FROM fact_table f
   JOIN dim_table d
     ON f.key = d.key
   WHERE d.region = 'US';
   ```
   
   With list dynamic filter, we could collect the f.key values into a List and 
push down a predicate like:
   
   `d.key IN (1, 42, 77, 103, ...)`
   
   into the probe side.
   
   I understand we should put a limit on how big this List can get, maybe make 
this configurable by the users though an option and use a safe default limit.
   
   
   ### Describe alternatives you've considered
   
   _No response_
   
   ### Additional context
   
   _No response_


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org
For additional commands, e-mail: github-h...@datafusion.apache.org

Reply via email to