ic4y commented on pull request #1556: URL: https://github.com/apache/arrow-datafusion/pull/1556#issuecomment-1012809108
I found that the peak memory usage of this branch increases by 80% compared to the master branch。 sql : select avg(user_id) from parquet_event_1 group by user_name limit 10 test dataset : total 450 million, 50 million users | branch | peak memory | | ---- | ---- | | master(d7e465 and 35d65fc) | 10G | | arrow2_merge | 6G | -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
