alamb commented on issue #133:
URL: 
https://github.com/apache/arrow-datafusion/issues/133#issuecomment-828446299


   > Is there any reason to limit this to parquet files? 
   
   I do not think there is any reason to limit to parquet files. Parquet files 
are probably the most important usecase initially but the functionality would 
be useful for everyone
   
   I think the first thing to do might be to write up a high level proposal (we 
have used google docs to good effect in the past). The first work needed (for 
this ticket) is probably to do a recursive directory traversal and find all 
parquet (or other) formats in subdirectories.
   
   Then there is probably work to interpret paths as their relevant partition 
keys, and then implement partition pruning (based on the existing row group 
pruning code, I would think) 
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to