alamb opened a new issue #199:
URL: https://github.com/apache/arrow-datafusion/issues/199


   *Note*: migrated from original JIRA: 
https://issues.apache.org/jira/browse/ARROW-12312
   
   If you try to run a `COUNT (DISTINCT ..)` query on a float column you get 
the following error:
   
   thread 'tokio-runtime-worker' panicked at 'Unexpected DataType for list', 
datafusion/src/scalar.rs:342:22
   
   Reproducer:
   {code}
    echo "foo,1.23" > /tmp/foo.csv
    ./target/debug/datafusion-cli
   
   > CREATE EXTERNAL TABLE t (a varchar, b float) STORED AS CSV LOCATION 
'/tmp/foo.csv';
   0 rows in set. Query took 0 seconds.
   > select count(distinct a) from t;
   +-------------------+
   | COUNT(DISTINCT a) |
   +-------------------+
   | 1                 |
   +-------------------+
   1 rows in set. Query took 0 seconds.
   > select count(distinct b) from t;
   thread 'tokio-runtime-worker' panicked at 'Unexpected DataType for list', 
datafusion/src/scalar.rs:342:22
   note: run with `RUST_BACKTRACE=1` environment variable to display a backtrace
   ArrowError(ExternalError(Canceled))
   {code}


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to