alamb opened a new issue, #11682: URL: https://github.com/apache/datafusion/issues/11682
### Is your feature request related to a problem or challenge? As part of https://github.com/apache/datafusion/issues/10918, @XiangpengHao has threaded the use of `StringView` through parquet, arrow-rs and then into DataFusion When the `datafusion.execution.parquet.schema_force_string_view` option is enabled, the DataFusion Parquet reader will read all Utf8 columns as `StringView` instead, which results in significantly faster performance (details TBD but we will write it down in https://github.com/apache/datafusion/issues/11603 ) However, when initially merged https://github.com/apache/datafusion/pull/11667 this setting will be off by default This ticket tracks what it would take to turn the setting on by default ### Describe the solution you'd like Change the default value of `datafusion.execution.parquet.schema_force_string_view` to true ### Describe alternatives you've considered Basically we should enable the flag by default and then run some benchmarks to ensure performance doesn't change by too much ### Additional context _No response_ -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org For additional commands, e-mail: github-h...@datafusion.apache.org