avantgardnerio commented on PR #2813: URL: https://github.com/apache/arrow-datafusion/pull/2813#issuecomment-1171624690
I double checked with ```cargo run --release --bin tpch -- benchmark datafusion --iterations 3 --path ./data --format tbl --query 4 --batch-size 4096``` and observed that this didn't resolve "the real" query 4, just my stripped down minimal failing test case. I made some adjustments, and I was able to run query 4 with the presently committed code: ``` +-----------------+-------------+ | o_orderpriority | order_count | +-----------------+-------------+ | 1-URGENT | 10594 | | 2-HIGH | 10476 | | 3-MEDIUM | 10410 | | 4-NOT SPECIFIED | 10556 | | 5-LOW | 10487 | +-----------------+-------------+ Query 4 iteration 2 took 43617.9 ms and returned 5 rows Query 4 avg time: 45785.94 ms ``` This is slow, but matches my postgres results: ``` +---------------+-----------+ |o_orderpriority|order_count| +---------------+-----------+ |1-URGENT |10594 | |2-HIGH |10476 | |3-MEDIUM |10410 | |4-NOT SPECIFIED|10556 | |5-LOW |10487 | +---------------+-----------+ ``` -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org