[ 
https://issues.apache.org/jira/browse/IO-468?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Thomas Neidhart updated IO-468:
-------------------------------
    Attachment: micro-benchmark.zip

I took the time to re-write my benchmark using jmh.

The results are somehow similar, but more accurate (Score is in ns/operation):

{noformat}
Benchmark                         (length)  Mode  Cnt       Score       Error  
Units
IOUtilsBenchmark.copyFixedArray        100  avgt   20     255.966 ±     1.580  
ns/op
IOUtilsBenchmark.copyFixedArray       1000  avgt   20     349.467 ±     3.376  
ns/op
IOUtilsBenchmark.copyFixedArray      10000  avgt   20    1735.641 ±    47.352  
ns/op
IOUtilsBenchmark.copyFixedArray     100000  avgt   20   20991.194 ±   253.866  
ns/op
IOUtilsBenchmark.copyFixedArray    1000000  avgt   20  253168.783 ± 35859.304  
ns/op
IOUtilsBenchmark.copyNormal            100  avgt   20    2039.469 ±    60.206  
ns/op
IOUtilsBenchmark.copyNormal           1000  avgt   20    2210.150 ±    69.705  
ns/op
IOUtilsBenchmark.copyNormal          10000  avgt   20    3467.281 ±    76.302  
ns/op
IOUtilsBenchmark.copyNormal         100000  avgt   20   23033.581 ±   323.091  
ns/op
IOUtilsBenchmark.copyNormal        1000000  avgt   20  251249.970 ± 23086.488  
ns/op
IOUtilsBenchmark.copyThreadLocal       100  avgt   20     263.559 ±     0.985  
ns/op
IOUtilsBenchmark.copyThreadLocal      1000  avgt   20     359.917 ±     4.831  
ns/op
IOUtilsBenchmark.copyThreadLocal     10000  avgt   20    1795.665 ±    43.034  
ns/op
IOUtilsBenchmark.copyThreadLocal    100000  avgt   20   20777.176 ±   206.086  
ns/op
IOUtilsBenchmark.copyThreadLocal   1000000  avgt   20  251769.416 ± 19318.802  
ns/op

{noformat}

To run the benchmark do the following on the command line:

{noformat}
  mvn clean install
  java -jar target/benchmarks.jar
{noformat}

> Avoid allocating memory for method internal buffers, use threadlocal memory 
> instead
> -----------------------------------------------------------------------------------
>
>                 Key: IO-468
>                 URL: https://issues.apache.org/jira/browse/IO-468
>             Project: Commons IO
>          Issue Type: Improvement
>          Components: Utilities
>    Affects Versions: 2.4
>         Environment: all environments
>            Reporter: Bernd Hopp
>            Priority: Minor
>              Labels: newbie, performance
>             Fix For: 2.5
>
>         Attachments: PerfTest.java, micro-benchmark.zip, 
> monitoring_with_threadlocals.png, monitoring_without_threadlocals.png, 
> performancetest.ods, performancetest_weakreference.ods
>
>   Original Estimate: 12h
>  Remaining Estimate: 12h
>
> In a lot of places, we allocate new buffers dynamically via new byte[]. This 
> is a performance drawback since many of these allocations could be avoided if 
> we would use threadlocal buffers that can be reused. For example, consider 
> the following code from IOUtils.java, ln 2177:
> return copyLarge(input, output, inputOffset, length, new 
> byte[DEFAULT_BUFFER_SIZE]);
> This code allocates new memory for every copy-process, that is not used 
> outside of the method and could easily and safely reused, as long as is is 
> thread-local. So instead of allocating new memory, a new utility-class could 
> provide a thread-local bytearray like this:
> byte[] buffer = ThreadLocalByteArray.ofSize(DEFAULT_BUFFER_SIZE);
> return copyLarge(input, output, inputOffset, length, buffer);
> I have not measured the performance-benefits yet, but I would expect them to 
> be significant, especially when the streams itself are not the performance 
> bottleneck. 
> Git PR is at https://github.com/apache/commons-io/pull/6/files



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to