alamb commented on issue #6782: URL: https://github.com/apache/arrow-datafusion/issues/6782#issuecomment-1783059578
Update here is that I have spent a non trivial amount of time analyzing the results from ClickBench and TPCH. My conclusion is that DataFusion does quite well. The biggest outliers in TPCH performance are related to join orders (https://github.com/apache/arrow-datafusion/issues/7949, https://github.com/apache/arrow-datafusion/issues/7950) but nothing fundamental to the choice of Arrow / Parquet. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
