andygrove commented on pull request #8172:
URL: https://github.com/apache/arrow/pull/8172#issuecomment-695803865
I've been testing with the TPC-H benchmark this morning.
Latest from master:
```
Running benchmarks with the following options: TpchOpt { query: 1, debug:
false, iterations: 3, concurrency: 24, batch_size: 4096, path:
"/mnt/tpch/parquet/100-240", file_format: "parquet" }
Query 1 iteration 0 took 16167 ms
Query 1 iteration 1 took 16012 ms
Query 1 iteration 2 took 16229 ms
```
This PR:
```
Running benchmarks with the following options: TpchOpt { query: 1, debug:
false, iterations: 3, concurrency: 24, batch_size: 4096, path:
"/mnt/tpch/parquet/100-240", file_format: "parquet" }
Query 1 iteration 0 took 13638 ms
Query 1 iteration 1 took 13559 ms
Query 1 iteration 2 took 13622 ms
```
That's around 15% speedup :rocket:
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
[email protected]