[GitHub] [arrow-datafusion] alamb commented on issue #2445: ObjectStore Directory Semantics

GitBox Fri, 06 May 2022 09:18:32 -0700


alamb commented on issue #2445:
URL: 
https://github.com/apache/arrow-datafusion/issues/2445#issuecomment-1119783979


   It would help me significantly, to understand the globbing usecase more -- 
like when exactly are you selecting a subset of files in a directory via a 
glob? Most analytic systems I have seen tend to assume data has been 
pre-grouped into directories (or equivalent) 
   
   AWS redshift does offer the ability to specify a subset of files that are 
not all in the same directory, but it does so by taking a manifest file:  
https://docs.aws.amazon.com/redshift/latest/dg/loading-data-files-using-manifest.html


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

[GitHub] [arrow-datafusion] alamb commented on issue #2445: ObjectStore Directory Semantics

Reply via email to