Github user wangyum commented on the issue:
https://github.com/apache/spark/pull/21623
Benchmark result:
```
###########################[ Pushdown benchmark for StringStartsWith
]###########################
Java HotSpot(TM) 64-Bit Server VM 1.8.0_151-b12 on Mac OS X 10.12.6
Intel(R) Core(TM) i7-7820HQ CPU @ 2.90GHz
StringStartsWith filter: (value like '10%'): Best/Avg Time(ms) Rate(M/s)
Per Row(ns) Relative
------------------------------------------------------------------------------------------------
Parquet Vectorized 10104 / 11125 1.6
642.4 1.0X
Parquet Vectorized (Pushdown) 3002 / 3608 5.2
190.8 3.4X
Native ORC Vectorized 9589 / 10454 1.6
609.7 1.1X
Native ORC Vectorized (Pushdown) 9798 / 10509 1.6
622.9 1.0X
StringStartsWith filter: (value like '1000%'): Best/Avg Time(ms)
Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------
Parquet Vectorized 8437 / 8563 1.9
536.4 1.0X
Parquet Vectorized (Pushdown) 279 / 289 56.3
17.8 30.2X
Native ORC Vectorized 7354 / 7568 2.1
467.5 1.1X
Native ORC Vectorized (Pushdown) 7730 / 7972 2.0
491.4 1.1X
StringStartsWith filter: (value like '786432%'): Best/Avg Time(ms)
Rate(M/s) Per Row(ns) Relative
------------------------------------------------------------------------------------------------
Parquet Vectorized 8290 / 8510 1.9
527.0 1.0X
Parquet Vectorized (Pushdown) 260 / 272 60.5
16.5 31.9X
Native ORC Vectorized 7361 / 7395 2.1
468.0 1.1X
Native ORC Vectorized (Pushdown) 7694 / 7811 2.0
489.2 1.1X
```
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]