andygrove commented on PR #2273: URL: https://github.com/apache/datafusion-comet/pull/2273#issuecomment-3249846325
I understand the issue now. For `COUNT(DISTINCT bool_col)` the partial count outputs all of the distinct values for `bool_col` so we have a boolean list vector containing `[[true], [false]]`. Columnar shuffle writer does not understand this and thinks that the schema is a single `LongType` representing the final output from the count. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org For additional commands, e-mail: github-h...@datafusion.apache.org