alamb opened a new issue, #11682:
URL: https://github.com/apache/datafusion/issues/11682

   ### Is your feature request related to a problem or challenge?
   
   As part of https://github.com/apache/datafusion/issues/10918, @XiangpengHao 
has threaded the use of `StringView` through  parquet, arrow-rs and then into 
DataFusion
   
   When the `datafusion.execution.parquet.schema_force_string_view` option is 
enabled, the DataFusion Parquet reader will read all Utf8 columns as 
`StringView` instead, which results in significantly faster performance 
(details TBD but we will write it down in  
https://github.com/apache/datafusion/issues/11603 )
   
   However, when initially merged 
https://github.com/apache/datafusion/pull/11667 this setting will be off by 
default 
   
   This ticket tracks what it would take to turn the setting on by default
   
   ### Describe the solution you'd like
   
   Change the default value of 
`datafusion.execution.parquet.schema_force_string_view`  to true
   
   ### Describe alternatives you've considered
   
   Basically we should enable the flag by default and then run some benchmarks 
to ensure performance doesn't change by too much
   
   ### Additional context
   
   _No response_


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org
For additional commands, e-mail: github-h...@datafusion.apache.org

Reply via email to