cyb70289 edited a comment on pull request #11376:
URL: https://github.com/apache/arrow/pull/11376#issuecomment-939657726


   **NOTE**: below results are compared against the non-inlined benchmark (1st 
commit in this pr), not against master branch.
   
   `$ archery benchmark diff --suite-filter="arrow-utf8-util-benchmark" HEAD 
HEAD^`
   
   **clang-10, Intel gold 5218**
   _The improvement of ValidateLargeNonAscii looks not real. It should not be 
affected by this pr._
   ```
   
----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
   Non-regressions: (8)
   
----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
                  benchmark       baseline      contender  change %             
                                                                                
                   counters
          ValidateTinyAscii  1.421 GiB/sec  2.000 GiB/sec    40.758       
{'run_name': 'ValidateTinyAscii', 'repetitions': 0, 'repetition_index': 0, 
'threads': 1, 'iterations': 106006292}
      ValidateLargeNonAscii  1.554 GiB/sec  2.032 GiB/sec    30.731       
{'run_name': 'ValidateLargeNonAscii', 'repetitions': 0, 'repetition_index': 0, 
'threads': 1, 'iterations': 11668}
         ValidateSmallAscii 14.104 GiB/sec 17.500 GiB/sec    24.082       
{'run_name': 'ValidateSmallAscii', 'repetitions': 0, 'repetition_index': 0, 
'threads': 1, 'iterations': 75859578}
   ValidateSmallAlmostAscii  2.983 GiB/sec  3.402 GiB/sec    14.066 
{'run_name': 'ValidateSmallAlmostAscii', 'repetitions': 0, 'repetition_index': 
0, 'threads': 1, 'iterations': 15147502}
   ValidateLargeAlmostAscii  3.382 GiB/sec  3.711 GiB/sec     9.736    
{'run_name': 'ValidateLargeAlmostAscii', 'repetitions': 0, 'repetition_index': 
0, 'threads': 1, 'iterations': 25408}
      ValidateSmallNonAscii  1.953 GiB/sec  1.978 GiB/sec     1.280    
{'run_name': 'ValidateSmallNonAscii', 'repetitions': 0, 'repetition_index': 0, 
'threads': 1, 'iterations': 10763254}
         ValidateLargeAscii 38.357 GiB/sec 38.436 GiB/sec     0.207         
{'run_name': 'ValidateLargeAscii', 'repetitions': 0, 'repetition_index': 0, 
'threads': 1, 'iterations': 288231}
       ValidateTinyNonAscii  1.243 GiB/sec  1.242 GiB/sec    -0.105     
{'run_name': 'ValidateTinyNonAscii', 'repetitions': 0, 'repetition_index': 0, 
'threads': 1, 'iterations': 84939377}
   ```
   
   **gcc-9, Intel gold 5218**
   _Again, the regression of ValidateLargeNonAscii looks not real._
   ```
   
----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
   Non-regressions: (7)
   
----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
                  benchmark       baseline      contender  change %             
                                                                                
                   counters
         ValidateSmallAscii 13.695 GiB/sec 13.943 GiB/sec     1.810       
{'run_name': 'ValidateSmallAscii', 'repetitions': 0, 'repetition_index': 0, 
'threads': 1, 'iterations': 74956699}
   ValidateSmallAlmostAscii  3.259 GiB/sec  3.303 GiB/sec     1.349 
{'run_name': 'ValidateSmallAlmostAscii', 'repetitions': 0, 'repetition_index': 
0, 'threads': 1, 'iterations': 16556254}
       ValidateTinyNonAscii  1.186 GiB/sec  1.201 GiB/sec     1.270     
{'run_name': 'ValidateTinyNonAscii', 'repetitions': 0, 'repetition_index': 0, 
'threads': 1, 'iterations': 81825116}
         ValidateLargeAscii 39.003 GiB/sec 39.047 GiB/sec     0.112         
{'run_name': 'ValidateLargeAscii', 'repetitions': 0, 'repetition_index': 0, 
'threads': 1, 'iterations': 293159}
   ValidateLargeAlmostAscii  3.489 GiB/sec  3.489 GiB/sec    -0.010    
{'run_name': 'ValidateLargeAlmostAscii', 'repetitions': 0, 'repetition_index': 
0, 'threads': 1, 'iterations': 26201}
      ValidateSmallNonAscii  1.675 GiB/sec  1.674 GiB/sec    -0.061     
{'run_name': 'ValidateSmallNonAscii', 'repetitions': 0, 'repetition_index': 0, 
'threads': 1, 'iterations': 9388982}
          ValidateTinyAscii  1.624 GiB/sec  1.616 GiB/sec    -0.523       
{'run_name': 'ValidateTinyAscii', 'repetitions': 0, 'repetition_index': 0, 
'threads': 1, 'iterations': 122075013}
   
   
-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------
   Regressions: (1)
   
-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------
               benchmark      baseline     contender  change %                  
                                                                                
        counters
   ValidateLargeNonAscii 1.747 GiB/sec 1.584 GiB/sec    -9.328 {'run_name': 
'ValidateLargeNonAscii', 'repetitions': 0, 'repetition_index': 0, 'threads': 1, 
'iterations': 13117}
   ```
   
   **Arm Neoverse N1, clang-10**
   _Arm benchmark shows big improvement of tiny and small ascii validation, 
it's reasonable._
   ```
   
------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
   Non-regressions: (8)
   
------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
                  benchmark        baseline       contender  change %           
                                                                                
                     counters
          ValidateTinyAscii   1.214 GiB/sec   1.862 GiB/sec    53.331        
{'run_name': 'ValidateTinyAscii', 'repetitions': 0, 'repetition_index': 0, 
'threads': 1, 'iterations': 91250343}
         ValidateSmallAscii  12.344 GiB/sec  17.937 GiB/sec    45.313       
{'run_name': 'ValidateSmallAscii', 'repetitions': 0, 'repetition_index': 0, 
'threads': 1, 'iterations': 67719531}
       ValidateTinyNonAscii 817.353 MiB/sec 820.267 MiB/sec     0.357     
{'run_name': 'ValidateTinyNonAscii', 'repetitions': 0, 'repetition_index': 0, 
'threads': 1, 'iterations': 54553894}
   ValidateSmallAlmostAscii   2.579 GiB/sec   2.583 GiB/sec     0.161 
{'run_name': 'ValidateSmallAlmostAscii', 'repetitions': 0, 'repetition_index': 
0, 'threads': 1, 'iterations': 13097861}
         ValidateLargeAscii  43.055 GiB/sec  43.101 GiB/sec     0.105         
{'run_name': 'ValidateLargeAscii', 'repetitions': 0, 'repetition_index': 0, 
'threads': 1, 'iterations': 323527}
   ValidateLargeAlmostAscii   2.738 GiB/sec   2.737 GiB/sec    -0.039    
{'run_name': 'ValidateLargeAlmostAscii', 'repetitions': 0, 'repetition_index': 
0, 'threads': 1, 'iterations': 20551}
      ValidateLargeNonAscii   1.207 GiB/sec   1.205 GiB/sec    -0.195        
{'run_name': 'ValidateLargeNonAscii', 'repetitions': 0, 'repetition_index': 0, 
'threads': 1, 'iterations': 9066}
      ValidateSmallNonAscii   1.324 GiB/sec   1.318 GiB/sec    -0.468     
{'run_name': 'ValidateSmallNonAscii', 'repetitions': 0, 'repetition_index': 0, 
'threads': 1, 'iterations': 7425228}
   ```
   
   **Arm Neoverse N1, gcc-9**
   ```
   
-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
   Non-regressions: (8)
   
-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
                  benchmark         baseline       contender  change %          
                                                                                
                      counters
          ValidateTinyAscii    1.637 GiB/sec   2.149 GiB/sec    31.283       
{'run_name': 'ValidateTinyAscii', 'repetitions': 0, 'repetition_index': 0, 
'threads': 1, 'iterations': 122917255}
         ValidateSmallAscii   15.231 GiB/sec  16.637 GiB/sec     9.236       
{'run_name': 'ValidateSmallAscii', 'repetitions': 0, 'repetition_index': 0, 
'threads': 1, 'iterations': 83551083}
         ValidateLargeAscii   43.135 GiB/sec  43.175 GiB/sec     0.093         
{'run_name': 'ValidateLargeAscii', 'repetitions': 0, 'repetition_index': 0, 
'threads': 1, 'iterations': 324160}
      ValidateSmallNonAscii    1.394 GiB/sec   1.395 GiB/sec     0.057     
{'run_name': 'ValidateSmallNonAscii', 'repetitions': 0, 'repetition_index': 0, 
'threads': 1, 'iterations': 7819363}
   ValidateLargeAlmostAscii    2.816 GiB/sec   2.818 GiB/sec     0.055    
{'run_name': 'ValidateLargeAlmostAscii', 'repetitions': 0, 'repetition_index': 
0, 'threads': 1, 'iterations': 21173}
   ValidateSmallAlmostAscii    2.733 GiB/sec   2.732 GiB/sec    -0.012 
{'run_name': 'ValidateSmallAlmostAscii', 'repetitions': 0, 'repetition_index': 
0, 'threads': 1, 'iterations': 13878062}
      ValidateLargeNonAscii    1.410 GiB/sec   1.410 GiB/sec    -0.041       
{'run_name': 'ValidateLargeNonAscii', 'repetitions': 0, 'repetition_index': 0, 
'threads': 1, 'iterations': 10588}
       ValidateTinyNonAscii 1001.808 MiB/sec 999.129 MiB/sec    -0.267     
{'run_name': 'ValidateTinyNonAscii', 'repetitions': 0, 'repetition_index': 0, 
'threads': 1, 'iterations': 66846809}
   ```


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Reply via email to