alamb opened a new issue #199: URL: https://github.com/apache/arrow-datafusion/issues/199
*Note*: migrated from original JIRA: https://issues.apache.org/jira/browse/ARROW-12312 If you try to run a `COUNT (DISTINCT ..)` query on a float column you get the following error: thread 'tokio-runtime-worker' panicked at 'Unexpected DataType for list', datafusion/src/scalar.rs:342:22 Reproducer: {code} echo "foo,1.23" > /tmp/foo.csv ./target/debug/datafusion-cli > CREATE EXTERNAL TABLE t (a varchar, b float) STORED AS CSV LOCATION '/tmp/foo.csv'; 0 rows in set. Query took 0 seconds. > select count(distinct a) from t; +-------------------+ | COUNT(DISTINCT a) | +-------------------+ | 1 | +-------------------+ 1 rows in set. Query took 0 seconds. > select count(distinct b) from t; thread 'tokio-runtime-worker' panicked at 'Unexpected DataType for list', datafusion/src/scalar.rs:342:22 note: run with `RUST_BACKTRACE=1` environment variable to display a backtrace ArrowError(ExternalError(Canceled)) {code} -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected]
