alamb commented on issue #17211:
URL: https://github.com/apache/datafusion/issues/17211#issuecomment-3272260490

   > These first queries really begin to show discrepancies. In spite of the 
GET requests being identical between the two queries, the partitioned table 
executes 22 LIST operations compared to the flat table's 1 LIST operation.
   
   Yes this seems like a quite compelling reason to:
   1. Cache the listing results
   2. Get the datafusion-cli listing feature polished up. 
   
   
   > This data also suggests that if a ListFilesCache were to be implemented 
and enabled it could eliminate all the IO for simple queries that can be 
computed entirely from metadata.
   
   This sounds great and a worthy goal for sure


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org
For additional commands, e-mail: github-h...@datafusion.apache.org

Reply via email to