Dandandan commented on PR #15697:
URL: https://github.com/apache/datafusion/pull/15697#issuecomment-2807089262

   > I think this would be nicer (and tie in better with future work 😉) if we 
essentially followed the structure of #15301 but do the filtering in `TopK` or 
`SortExec`:
   > 
   > 1. Keep track of a `thresholds: Arc<RwLock<Vec<Option<ScalarValue>>>>` and 
`filter: `Option<Arc>`on`TopK`.
   > 2. For each batch check pass it through the existing filter, if any, and 
exit early if no rows remain.
   > 3. If we updated our heap propagate the update to `thresholds` and 
`filter`.
   
   Yeah this sounds like a good idea, let me have a look to make some changes 
in this direction within the scope of this PR.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org
For additional commands, e-mail: github-h...@datafusion.apache.org

Reply via email to