mkleen commented on issue #22989: URL: https://github.com/apache/datafusion/issues/22989#issuecomment-4787791956
I had a look at DuckDB. DuckDB doesn't declare the supported types in their docs for approx_count_distinct https://duckdb.org/docs/lts/sql/functions/aggregates. I did some manual testing and it seems it supports every each data type: | Category | Types | |---|---| | Signed int | `TINYINT`, `SMALLINT`, `INTEGER`, `BIGINT`, `HUGEINT` | | Unsigned int | `UTINYINT`, `USMALLINT`, `UINTEGER`, `UBIGINT` | | Float | `FLOAT`, `DOUBLE`, `DECIMAL` | | String/binary | `VARCHAR`, `BLOB`, `BIT`, `UUID` | | Temporal | `DATE`, `TIME`, `TIMESTAMP`, `TIMESTAMPTZ`, `INTERVAL` | | Other | `BOOLEAN`, `JSON` | | Nested | `LIST`, `STRUCT`, `MAP` | Maybe let's focus as a start on the datatypes which a straight-forward to support. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
