aditanase commented on issue #9280: URL: https://github.com/apache/datafusion/issues/9280#issuecomment-2120467689
I was recently trying to query the NYC dataset from ballista. Path looks something like https://d37ci6vzurychx.cloudfront.net/trip-data/yellow_tripdata_2024-01.parquet What would be the correct way to register this path with DF? Trying this: ``` create external table nyc_trip_data stored as parquet location 'https://d37ci6vzurychx.cloudfront.net/trip-data/yellow_tripdata_2024-01.parquet'; ``` Results in: `Error building plan: Execution error: No object store available for: https://d37ci6vzurychx.cloudfront.net/trip-data/yellow_tripdata_2024-01.parquet` I believe this is probably s3 configured for distribution with cloudfront (educated guess). Duckdb can handle this via the `httpfs` extension: https://duckdb.org/docs/extensions/httpfs/https Thanks! -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
