alamb commented on issue #10918: URL: https://github.com/apache/datafusion/issues/10918#issuecomment-2228800548
An update here on the plan from @XiangpengHao: * He has local changes (that also require some additional features that will be released in arrow `52.2.0`) that show significant performance improvements for TPCH and ClickBench queries * The hope is that these proposals are all up for review by the end of this week * So by the time arrow `52.2.0` is released (early August 2024) we'll be able to add an option that makes DataFusion use StringView when reading from Parquet / filtering * We also plan to write a blog post about this work / adventure I for one am very excited -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: github-unsubscr...@datafusion.apache.org For additional commands, e-mail: github-h...@datafusion.apache.org