cube2222 edited a comment on issue #2109: URL: https://github.com/apache/arrow-datafusion/issues/2109#issuecomment-1081461585
Hey @matthewmturner, sure: 1. I've installed datafusion-cli through homebrew, I'm assuming that's compiled with --release. 2. Yes, it does. 3. Running interactively it's already the CREATE EXTERNAL TABLE statement that takes ages. But you're right, the final query didn't run. With the semicolon, the final query takes 0.4 seconds (so as expected). So it's basically 100 second CREATE EXTERNAL TABLE + 0.4 second query. Here's the full output: ``` datafusion-cli DataFusion CLI v7.0.0 ❯ CREATE EXTERNAL TABLE taxi STORED AS CSV WITH HEADER ROW LOCATION './taxi.csv'; 0 rows in set. Query took 106.102 seconds. ❯ SELECT passenger_count, COUNT(*), AVG(total_amount) FROM taxi GROUP BY passenger_count; +-----------------+-----------------+------------------------+ | passenger_count | COUNT(UInt8(1)) | AVG(taxi.total_amount) | +-----------------+-----------------+------------------------+ | 4 | 25510 | 18.452774990199917 | | 9 | 1 | 113.6 | | 0 | 42228 | 17.021401676612687 | | 5 | 50291 | 17.27092481756182 | | 8 | 2 | 95.705 | | | 128020 | 32.237151148258164 | | 2 | 286461 | 18.097587071189274 | | 3 | 72852 | 17.915395871081138 | | 1 | 1533197 | 17.6418833065818 | | 6 | 32623 | 17.600296416638567 | | 7 | 2 | 87.17 | +-----------------+-----------------+------------------------+ 11 rows in set. Query took 0.385 seconds. ❯ ``` -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org