cube2222 edited a comment on issue #2109:
URL: 
https://github.com/apache/arrow-datafusion/issues/2109#issuecomment-1081461585


   Hey @matthewmturner, sure:
   1. I've installed datafusion-cli through homebrew, I'm assuming that's 
compiled with --release.
   2. Yes, it does.
   3. Running interactively it's already the CREATE EXTERNAL TABLE statement 
that takes ages. But you're right, the final query didn't run. With the 
semicolon, the final query takes 0.4 seconds (so as expected). So it's 
basically 100 second CREATE EXTERNAL TABLE + 0.4 second query. 
   
   Here's the full output:
   ```
   datafusion-cli
   DataFusion CLI v7.0.0
   ❯ CREATE EXTERNAL TABLE taxi
   STORED AS CSV
   WITH HEADER ROW
   LOCATION './taxi.csv';
   0 rows in set. Query took 106.102 seconds.
   ❯ SELECT passenger_count, COUNT(*), AVG(total_amount) FROM taxi GROUP BY 
passenger_count;
   +-----------------+-----------------+------------------------+
   | passenger_count | COUNT(UInt8(1)) | AVG(taxi.total_amount) |
   +-----------------+-----------------+------------------------+
   | 4               | 25510           | 18.452774990199917     |
   | 9               | 1               | 113.6                  |
   | 0               | 42228           | 17.021401676612687     |
   | 5               | 50291           | 17.27092481756182      |
   | 8               | 2               | 95.705                 |
   |                 | 128020          | 32.237151148258164     |
   | 2               | 286461          | 18.097587071189274     |
   | 3               | 72852           | 17.915395871081138     |
   | 1               | 1533197         | 17.6418833065818       |
   | 6               | 32623           | 17.600296416638567     |
   | 7               | 2               | 87.17                  |
   +-----------------+-----------------+------------------------+
   11 rows in set. Query took 0.385 seconds.
   ❯
   ``` 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Reply via email to