lxian commented on pull request #31998:
URL: https://github.com/apache/spark/pull/31998#issuecomment-809650610
I did a simple benchmark on it, and the result looks good
```
Running benchmark: simple filters
Running case: Parquet Vectorized
Stopped after 5 iterations, 6065 ms
Running case: Parquet Vectorized (columnIndex)
Stopped after 27 iterations, 2048 ms
Java HotSpot(TM) 64-Bit Server VM 1.8.0_111-b14 on Mac OS X 10.14.4
Intel(R) Core(TM) i7-4770HQ CPU @ 2.20GHz
simple filters: Best Time(ms) Avg Time(ms)
Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
Parquet Vectorized 1169 1213
55 13.5 74.3 1.0X
Parquet Vectorized (columnIndex) 61 76
11 258.4 3.9 19.2X
Running benchmark: range filters
Running case: Parquet Vectorized
Stopped after 5 iterations, 6338 ms
Running case: Parquet Vectorized (columnIndex)
Stopped after 6 iterations, 2128 ms
Java HotSpot(TM) 64-Bit Server VM 1.8.0_111-b14 on Mac OS X 10.14.4
Intel(R) Core(TM) i7-4770HQ CPU @ 2.20GHz
range filters: Best Time(ms) Avg Time(ms)
Stdev(ms) Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------------------------------
Parquet Vectorized 1222 1268
79 12.9 77.7 1.0X
Parquet Vectorized (columnIndex) 346 355
8 45.4 22.0 3.5X
```
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]