flyrain commented on pull request #3557:
URL: https://github.com/apache/iceberg/pull/3557#issuecomment-974498636


   Resolved the comments and added the benchmark. Here is the benchmark result 
for 10M rows with different percentage of equality deleted rows. About 40% 
percent perf gain on average. Less eq delete rows, larger perf gain.
   ```
   Benchmark                                                    
(percentDeleteRow)  Mode  Cnt   Score   Error  Units
   IcebergSourceParquetEqDeleteBenchmark.readIceberg                            
 0    ss    5   3.261 ± 0.210   s/op
   IcebergSourceParquetEqDeleteBenchmark.readIceberg                      
0.000001    ss    5   3.940 ± 0.253   s/op
   IcebergSourceParquetEqDeleteBenchmark.readIceberg                          
0.05    ss    5   4.888 ± 0.540   s/op
   IcebergSourceParquetEqDeleteBenchmark.readIceberg                          
0.25    ss    5   7.369 ± 3.521   s/op
   IcebergSourceParquetEqDeleteBenchmark.readIceberg                           
0.5    ss    5   9.433 ± 6.369   s/op
   IcebergSourceParquetEqDeleteBenchmark.readIceberg                            
 1    ss    5  20.206 ± 4.298   s/op
   IcebergSourceParquetEqDeleteBenchmark.readIcebergVectorized                  
 0    ss    5   1.721 ± 0.120   s/op
   IcebergSourceParquetEqDeleteBenchmark.readIcebergVectorized            
0.000001    ss    5   2.172 ± 0.112   s/op
   IcebergSourceParquetEqDeleteBenchmark.readIcebergVectorized                
0.05    ss    5   3.305 ± 0.301   s/op
   IcebergSourceParquetEqDeleteBenchmark.readIcebergVectorized                
0.25    ss    5   5.201 ± 3.418   s/op
   IcebergSourceParquetEqDeleteBenchmark.readIcebergVectorized                 
0.5    ss    5   8.869 ± 6.236   s/op
   IcebergSourceParquetEqDeleteBenchmark.readIcebergVectorized                  
 1    ss    5  15.589 ± 7.211   s/op
   ```


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to