Re: proposal to optimise the performance of the Jar utility

Xueming Shen Wed, 13 Apr 2011 23:14:47 -0700

 On 4/13/2011 4:55 PM, [email protected] wrote:

Hi Mike,
apart from compression 0 it doesn't make a lot of difference as far asI see it. As I posted earlier, converting the file name to lower caseseems (according the the NB profiler) to take more time than thecompression

File.hashCode() is slow on Windows (yes, we have to go down toString.toLowerCase() to be "correct" and yes, its slow), we have

the comment in Win32FileSystem.hashCode() just for this:-)

    public int hashCode(File f) {
        /* Could make this more efficient: String.hashCodeIgnoreCase */

return f.getPath().toLowerCase(Locale.ENGLISH).hashCode() ^1234321;

I tried to use ArrayList for the "entries" (instead of LinkedHashSet),as showed below(yes, it's total fine to use List for "creating", need a little morework for "updating", but

let's put "updating" aside for now)

    // All files need to be added/updated.
    //Set<File> entries = new LinkedHashSet<File>();
    List<File> entries = new ArrayList<File>();

and run the cf0 on rtjar, which has the rt.jar extracted (time measuredby System.currentTimeMillis()


[List]
$ ../jdk1.7.0/bin/java Main cf0 foo.jar rtjar
expand=406, create=10703
expand=406, create=10797
expand=407, create=11046

[Set]
$ ../jdk1.7.0/bin/java Main cf0 foo.jar rtjar
expand=469, create=10703
expand=469, create=10688
expand=469, create=10656

The "List" version is indeed about 15% faster than the "Set" version, in"expanding", but"expanding" is only about 5% of the over all "creating" time (I've nottried to measureyet, but I will not be surprised to find that we probably spend moretime on i/o than deflating)

So it is nice to have, but does not appear to bring us some thing"significant".


-Sherman

Bearing in mind that I have tweaked my system so that I am accessingfrom cache only, so the elapsed time is pretty much the single coretime, here are the results
the result I have re-run today (hence the delay)
in csv format
---
,cf,cf0,cf1,cf2,cf3,cf4,cf5,cf6,cf7,cf8,cf9
1.6.0_24 (1.6 vm),10255,9751,,,,,,,,,
java 1.7 release candinate,10498,9663,,,,,,,,,
orig ,10481,9707,,,,,,,,,
buffer CRC32 data,,7398,9490,9641,9565,10038,10142,10366,10310,10536,10440
---
this run shows a 25% improvement in cf0 times (9703 ms vs 7398 wthenthe tweaks are applied)cf1 (compressed level 1) takes 9490 and cf9 takes 10440, so not a hugedifference
Regards

Mike



------------------------------------------------------------------------
*From:* Mike Duigou <[email protected]>
*To:* [email protected]
*Cc:* Xueming Shen <[email protected]>; core-libs-dev Libs<[email protected]>
*Sent:* Wednesday, 13 April, 2011 23:00:26
*Subject:* Re: proposal to optimise the performance of the Jar utility
Mike, can you share the results of performance testing at variouscompression levels? Is there much difference between the levels or anapparent "sweet spot"?
For low hanging fruit for jdk 7 it might be worth considering raisingthe default compression level from 5 to 6 (the zlib default). Raisingthe level from 5 to 6 entails (by today's standards) very modestincreases in the amount of memory and effort used (16Kb additionalbuffer space for compression). In general zlib reflects size choicesthat are almost 20 years old and it may be of no measurable benefit tobe using the lower compression levels.
Mike (also)
On Apr 12 2011, at 17:48 , [email protected]<mailto:[email protected]> wrote:
> Hi Sherman,
> I have had a quick look at the current code to see what 'low hangingfruit'> there is. I appreciate that parallelizing the command in itsentirity may not be
> feasible for the first release
>
> The tests that I have run are jarring the content of the 1.7 rt.jarwith varying> compression levels. Each jar is run as an child process 6 times andthe average
> of the last 5 is taken. Tests are pretty much CPU bound on a single core
>
> 1. The performance of the cf0 (create file with no compression) pathcan be
> improved for the general case if the file is buffered.
> In my test env (windows 7 64bit) this equates to a 17% time performance
> improvement in my tests. In the existing code the file is readtwice, once to> calc the CRC and once to write the file to the Jar file. This changewould
> buffer a single small file at a time (size < 100K)
>
> 2. It is also a trivial fix to allow different compression levels,rather than
> stored and default
>
> After that it is harder to gain performance improvements withoutstructural
> change or more discussion
>
> 3. The largest saving after that is in the management of the'entries' Set, as> the hashcode of the File is expensive (this may not apply to otherfilesystems).
> the management of this map seems to account for more cpu than Deflator!
> I cannot see the reason for this data being collected. I am probablymissing the
> obvious ...
>
> 4. After that there is just the parallelisation of the jar utilityand the
> underlying stream
>
> What is the best way to proceed
>
> regards
>
> Mike
>
>
>
> ________________________________
> From: Xueming Shen <[email protected]<mailto:[email protected]>>> To: [email protected]<mailto:[email protected]>
> Sent: Wednesday, 6 April, 2011 19:04:25
> Subject: Re: proposal to optimise the performance of the Jar utility
>
> Hi Mike,
>
> We are in the final stage of the JDK7 development, work like this is
> unlikely to get in the
> last minute. I have filed a CR/RFE to trace this issue, we can use this
> CR as the start
> point for the discussion and target for some jdk7 update releases orJDK8.
>
> 7034403: proposal to optimise the performance of the Jar utility
>
> Thanks,
> Sherman
>
>
> On 04/05/2011 04:42 PM, [email protected]<mailto:[email protected]> wrote:
>> Hi,
>> Not sure if this is too late for Java 7 but I have made someoptimisations for
>> a
>> client to improve the performance of the jar utility in theirenvironment, and
>> would like to promite then into the main code base
>>
>> The optimisations that I have performed are
>>
>> 1. Allowing the Jar utility to have other compression levels(currently it
>> allows default (5) only)
>> 2. Multi-threading, and pipelining the the file information and access
>> 3. multi-threading the compression and file writing
>>
>> A little background
>> A part of the development process of where I work they regularlyJar the
>> content
>> of the working projects as part of the distribution to remotesystems. This is
>> a
>> large and complex code base of 6 million LOC and growing. The Jarfile ends up>> compressed to approx 100Mb, Uncompressed the jar size is approx245mb, about
>> 4-5
>> times the size of rt.jar.
>>
>> I was looking at ways to improve the performance as this activityoccurs
>> several
>> times a day for dozens of developers
>>
>> In essence when compressing a new jar file the jar utility issingle threaded
>> and staged. Forgive me if this is an oversimplification
>>
>> first it works out all of the files that are specified, bufferingthe file
>> names, (IO bound)
>> then it iterates through the files, and for each file, it load thefile>> information, and then the file content sending it to aJarOutputStream, (CPU
>> bound or IO bound depending on the IO speed)
>>
>> The JarOutputStream has a compression of 0 (just store) or 5 (thedefault),
> and
>> the jar writing is single threaded by the design of the JarOutputStream
>>
>> The process of creation of a Jar took about 20 seconds in windowswith the
> help
>> of an SSD, and considerable longer without one, and was CPU boundto one CPU
>> core
>>
>> ----
>> The changes that I made were
>> 1. Allow deferent compression levels (for us a compression level of 1
> increases
>> the file size of the Jar to 110 Mb but reduces the CPU load incompression to
>> approx 30% of what it was (rough estimate)
>> 2. pipelining the file access
>> 2.1 one thread is started for each file root (-C on the Jarcommand line),
>> which scans for files and places the file information into a blocking
>> queue(Q1),
>> which I set to abretrary size of 200 items
>> 2.2 one thread pool of 10 threads reads the file informationfrom the queue>> (Q1) and buffers the file content to a specified size (again Ispecified an>> arbetrary size limit of 25K for a file, and places the bufferedcontent into a
>> queue(q2) (again arbetrary size of 10 items
>> 2.3 one thread takes the filecontent from Q2 and compresses itor checksums>> it and adds it the the JarOutputStream. This process is singlethreaded due
> to
>> the design of the JarOutputStream
>>
>> some other minor performance gain occurred by increasing the bufferon the
>> output stream to reduce the IO load
>>
>> The end result is that the process takes about approx 5 seconds inthe same
>> configuration
>>
>> The above is in use in production configuration for a few months now
>>
>> As a home project I have completed some enhancements to theJarOutputStream,
>> and
>> produced a JarWriter that allows multiple threads to work concurrently
>> deflating
>> or calculating checksums, which seems to test OK for the test casesthat Ihave>> generated,and successfully loads my quad core home dev machine onall cores.>> Each thread allocates a buffer, and the thread compresses a filesinto the>> buffer, only blocking other threads whenthe buffer is written tothe output>> (which is after the compression is complete, unless the file is toolarge to
>> compress
>>
>> This JarWriter is not API compatable with the JarOutputStream, itis not a
>> stream. It allows the programmer to write a record based of the file
>> information
>> and an input stream, and is threadsafe. It is not a drop inreplacement for
>> JarOutputStream
>> I am not an expert in the ZIp file format, but much of the code from
>> ZipOutputStream is unchanged, just restructured
>> ---
>> I did think that there is some scope for improvement, that I havenot looked
> at
>> a. thresholding of file size for compression (very small files dontcompress
>> well
>> b. some file types dont compress well (e.g. png, jpeg) as they havebeen
>> compressed already)
>> c. using NIO to parallelise the loading of the file information orcontent>> d. some pre-charging of the deflator dictionary (e.g. a class filecontains
> the
>> strings of the class name and packages), but this would make the format
>> incompatable with zip, and require changes to the JVM to be useful,and is a>> long way from my comform zone, or skill set. This would reduce thefile size
>>
>> --
>> What is the view of the readers. Is this something, or at leastsome parts of>> this that could be incorperated into Java 7 or is this too late onthe dev
>> cycle
>>
>> regards
>>
>> Mike

Re: proposal to optimise the performance of the Jar utility

Reply via email to