tustvold commented on issue #3261:
URL: 
https://github.com/apache/arrow-datafusion/issues/3261#issuecomment-1227360620

   The reason I didn't do this is glob characters aren't URL-safe, so something 
like `s3://bucket/path/*.parquet` isn't a valid URL. I could only find examples 
of systems that supported glob expressions to local filesystem, and so I wasn't 
really sure how best to encode globs in URLs and opted to just punt on it.
   
   Some possible ideas:
   
   * Just ignore that it isn't a valid URL and accept the fact it is 
potentially very confusing (what this ticket proposes)
   * Provide a programmatic interface to construct a `ListingTableUrl` with a 
custom scheme and glob
   * Encode the glob expression as a URL-encoded query parameter
   * Something else
   
   It is also potentially worth highlighting that IIRC the logical plan 
serialization currently doesn't handle glob expressions and just drops them on 
the floor.
   
   I think it would really help move this forward if we could find an example 
of a system that supports glob expressions to object stores, otherwise we end 
up having to design something custom which we will inevitably get wrong


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to