[GitHub] activemq-artemis issue #1752: ARTEMIS-1586 Reduce GC pressure due to String ...

franz1981 Sat, 06 Jan 2018 10:11:09 -0800

Github user franz1981 commented on the issue:

    https://github.com/apache/activemq-artemis/pull/1752
  
    @michaelandrepearce @clebertsuconic 
    As promised I've provided a benchmark that can be run with ease directly 
from the IDE:
    https://github.com/franz1981/activemq-artemis/tree/jmh_interner_benchmarks
    
    The benchmark is this one:
    
https://github.com/franz1981/activemq-artemis/blob/3e0b4b8152bed30ba747704a653d0c034ebe19d5/tests/performance-tests/src/test/java/org/apache/activemq/artemis/tests/performance/jmh/pool/SimpleStringInternerBenchmark.java
    
    Some of results of my box:
    ```
    
    Benchmark                                                                   
           Mode  Cnt         Score         Error   Units
    SimpleStringInternerBenchmark.artemisIntern                                 
          thrpt   10  15509306.132 Â±  568180.609   ops/s
    SimpleStringInternerBenchmark.artemisIntern:Â·gc.alloc.rate                 
           thrpt   10        â 10â»â´                MB/sec
    SimpleStringInternerBenchmark.artemisIntern:Â·gc.alloc.rate.norm            
           thrpt   10        â 10â»âµ                  B/op
    SimpleStringInternerBenchmark.artemisIntern:Â·gc.count                      
           thrpt   10           â 0                counts
    SimpleStringInternerBenchmark.artemisIntern3Threads                         
          thrpt   10  44734165.507 Â± 1868110.790   ops/s
    SimpleStringInternerBenchmark.artemisIntern3Threads:Â·gc.alloc.rate         
           thrpt   10         0.006 Â±       0.016  MB/sec
    SimpleStringInternerBenchmark.artemisIntern3Threads:Â·gc.alloc.rate.norm    
           thrpt   10        â 10â»â´                  B/op
    SimpleStringInternerBenchmark.artemisIntern3Threads:Â·gc.count              
           thrpt   10           â 0                counts
    SimpleStringInternerBenchmark.guavaInterner                                 
          thrpt   10   6231479.494 Â±  313670.700   ops/s
    SimpleStringInternerBenchmark.guavaInterner:Â·gc.alloc.rate                 
           thrpt   10       443.572 Â±      22.292  MB/sec
    SimpleStringInternerBenchmark.guavaInterner:Â·gc.alloc.rate.norm            
           thrpt   10       112.000 Â±       0.001    B/op
    SimpleStringInternerBenchmark.guavaInterner:Â·gc.churn.PS_Eden_Space        
           thrpt   10       445.183 Â±      80.501  MB/sec
    SimpleStringInternerBenchmark.guavaInterner:Â·gc.churn.PS_Eden_Space.norm   
           thrpt   10       112.375 Â±      18.859    B/op
    SimpleStringInternerBenchmark.guavaInterner:Â·gc.churn.PS_Survivor_Space    
           thrpt   10         0.073 Â±       0.076  MB/sec
    
SimpleStringInternerBenchmark.guavaInterner:Â·gc.churn.PS_Survivor_Space.norm   
       thrpt   10         0.019 Â±       0.020    B/op
    SimpleStringInternerBenchmark.guavaInterner:Â·gc.count                      
           thrpt   10        44.000                counts
    SimpleStringInternerBenchmark.guavaInterner:Â·gc.time                       
           thrpt   10        56.000                    ms
    SimpleStringInternerBenchmark.guavaInterner3Threads                         
          thrpt   10  18200947.459 Â±  933389.842   ops/s
    SimpleStringInternerBenchmark.guavaInterner3Threads:Â·gc.alloc.rate         
           thrpt   10      1295.337 Â±      66.617  MB/sec
    SimpleStringInternerBenchmark.guavaInterner3Threads:Â·gc.alloc.rate.norm    
           thrpt   10       112.000 Â±       0.001    B/op
    
SimpleStringInternerBenchmark.guavaInterner3Threads:Â·gc.churn.PS_Eden_Space    
       thrpt   10      1323.335 Â±     234.954  MB/sec
    
SimpleStringInternerBenchmark.guavaInterner3Threads:Â·gc.churn.PS_Eden_Space.norm
      thrpt   10       114.500 Â±      20.365    B/op
    
SimpleStringInternerBenchmark.guavaInterner3Threads:Â·gc.churn.PS_Survivor_Space
       thrpt   10         0.081 Â±       0.041  MB/sec
    
SimpleStringInternerBenchmark.guavaInterner3Threads:Â·gc.churn.PS_Survivor_Space.norm
  thrpt   10         0.007 Â±       0.003    B/op
    SimpleStringInternerBenchmark.guavaInterner3Threads:Â·gc.count              
           thrpt   10        27.000                counts
    SimpleStringInternerBenchmark.guavaInterner3Threads:Â·gc.time               
           thrpt   10        32.000                    ms
    ```
    Consider that It tests the case of temporal typed UUID-like SimpleString 
interning/pooling, hence a pretty intensive case for the interner I've 
implemented because it need to compute hashCode and equals of long strings (~ 
72 bytes).
    
    Some explanation:
    - score is the throughput in ops/sec
    - `artemisIntern` is the one using `SimpleString.Interner`
    - 'guavaInterner` is the one using the Guava Interner with weak References 
(the strong one is not faster TBH, probably a little slower)
    - the `3Threads` ones are testing 3 threads calling the interner 
concurrently
    
    The results are pretty clear: ~400 MB/sec of allocation rate vs 0 and a 
much higher (~ x2,5) throughput (although most of the time is spent into 
hashCode and equals computations).
    I hope to have shown better why I've designed the interner in the way I've 
done.

---

[GitHub] activemq-artemis issue #1752: ARTEMIS-1586 Reduce GC pressure due to String ...

Reply via email to