alamb opened a new issue, #16158:
URL: https://github.com/apache/datafusion/issues/16158

   Today, when creating tables of parquet files using `CREATE EXTERNAL TABLE` 
or ListingTables, statistics are not gathered.
   
   This is good in the sense that creating the table is fast(er) but subsequent 
queries might be slower
   
   The behavior is clarified in 
   - https://github.com/apache/datafusion/pull/16157 
   
   @davisp  suggests that defaulting to collecting statistics would make more 
sense (and I agree):
   
   > I’ll also note that my personal preference would be to default to true 
purely because it took a surprising amount of work to figure out how to even 
report #15908 not knowing that statistics collection was a config option.  I do 
see the rationale around the behavior change, though I’d say either way that 
flag is defaulted is a behavior change and true seems like a saner default.
   
   
   _Originally posted by @davisp in 
https://github.com/apache/datafusion/pull/16080#pullrequestreview-2852069846_
               


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org
For additional commands, e-mail: github-h...@datafusion.apache.org

Reply via email to