alamb commented on PR #8753: URL: https://github.com/apache/arrow-datafusion/pull/8753#issuecomment-1879274741
Thank you @kcolford and @Jefffrey for the review. It would be great to address @Jefffrey 's comments I tried this out on the ClickBench file fetched via http and it worked like a charm ```shell ❯ create external table hits stored as parquet location 'https://datasets.clickhouse.com/hits_compatible/athena_partitioned/hits_1.parquet'; 0 rows in set. Query took 0.178 seconds. ❯ describe hits; +-----------------------+-----------+-------------+ | column_name | data_type | is_nullable | +-----------------------+-----------+-------------+ | WatchID | Int64 | YES | | JavaEnable | Int16 | YES | | Title | Binary | YES | | GoodEvent | Int16 | YES | | EventTime | Int64 | YES | | EventDate | UInt16 | YES | ... | RefererHash | Int64 | YES | | URLHash | Int64 | YES | | CLID | Int32 | YES | +-----------------------+-----------+-------------+ 105 rows in set. Query took 0.003 seconds. ``` This is a really nice improvement. Let us know if you need help updating the docs or tests -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
