alamb opened a new issue, #16158: URL: https://github.com/apache/datafusion/issues/16158
Today, when creating tables of parquet files using `CREATE EXTERNAL TABLE` or ListingTables, statistics are not gathered. This is good in the sense that creating the table is fast(er) but subsequent queries might be slower The behavior is clarified in - https://github.com/apache/datafusion/pull/16157 @davisp suggests that defaulting to collecting statistics would make more sense (and I agree): > I’ll also note that my personal preference would be to default to true purely because it took a surprising amount of work to figure out how to even report #15908 not knowing that statistics collection was a config option. I do see the rationale around the behavior change, though I’d say either way that flag is defaulted is a behavior change and true seems like a saner default. _Originally posted by @davisp in https://github.com/apache/datafusion/pull/16080#pullrequestreview-2852069846_ -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org For additional commands, e-mail: github-h...@datafusion.apache.org