Re: RFR(L) 8237354: Add option to jcmd to write a gzipped heap dump

David Holmes Tue, 11 Feb 2020 14:51:27 -0800

On 11/02/2020 10:55 pm, Yasumasa Suenaga wrote:

Hi Ralf,
On 2020/02/11 0:33, Schmelter, Ralf wrote:
Hi Yasumasa,
   You can use `DCmdArgument<jlong>` for -gz option.
That is what I originally tried. But then you always have to supply acompression level (just specifying -gz doesn't work). Since I wouldexpect most users never caring about the compression level, I switchedto a string option, which can handle this pattern.
I think you can modify DCmdArgument<jlong>::parse_value() to allow theoperation without argument.Or you can add new impl function for integer types which can handledefault value.
_nr_of_threads, _id_to_write, _current in CompressionBackend shouldbe added `volatile` at least.
I don't think that is needed. Apart from the initialization, they areonly changed under lock protection.
I concerned with compiler optimization.
They are class members and they are used in `while(true)` loop.
Of course the problem would not appear in all C++ compiler, but I guessit is more safely if `volatile` is added.

As long as the variable is only accessed under the lock then volatile isnot needed. If a compiler hoisted accesses outside of locked regionsthen all MT code could be broken.


David
-----

BTW how much processing time is different between single threaded andmulti threaded?
I've benchmarked an example, which creates a ~31 GB uncompressed hproffile, with a VM which doesn't use any background threads. Here are thesize of the create files, the compression level and the time spend:
Uncompressed, 31.6 G, 71 sec
gzipped level 1, 7.57 G, 463 sec (x6.5)
gzipped level 3, 7.10 G, 609 sec (x8.6)
gzipped level 6, 6.49 G, 1415 sec (x19.9)
So even the fastest gzip compression makes writing the dump at least 5times as slow.
Also I want to know what number is set to ParallelGCThreads.
ParallelGCThreads seems to affect to thread num for GZip compression.
Originally, I've tried to use the WorkGang (CollectedHeap::get_safepoint_workers()) of the GC to do the work. But this wouldn'twork because Shenandoah could not iterate the heap from a workerthread. So I've opted to start the needed threads itself for the timeof the heap dump. I've used ParallelGCThreads as the maximum number ofthreads, since this is what would be used for a GC too. So it shouldnot clog up the machine more than a GC. Maybe it would be even betterto additionally limit the threads by the compression level.
Thanks!

Yasumasa (ysuenaga)
Best regards,
Ralf Schmelter

-----Original Message-----
From: Yasumasa Suenaga <[email protected]>
Sent: Samstag, 8. Februar 2020 14:46
To: Schmelter, Ralf <[email protected]>; OpenJDK Serviceability<[email protected]>
Cc: [email protected]
Subject: Re: RFR(L) 8237354: Add option to jcmd to write a gzippedheap dump
Hi Ralf,


- diagnosticCommand.cpp
   You can use `DCmdArgument<jlong>` for -gz option.
If you want to use lesser type (e.g. int, unsigned char), I guessyou need to modify GenDCmdArgument class.
- heapDumper.cpp
_nr_of_threads, _id_to_write, _current in CompressionBackend shouldbe added `volatile` at least.
   (Other values need to be checked)
BTW how much processing time is different between single threaded andmulti threaded?
Also I want to know what number is set to ParallelGCThreads.
ParallelGCThreads seems to affect to thread num for GZip compression.


Thanks,

Yasumasa

Re: RFR(L) 8237354: Add option to jcmd to write a gzipped heap dump

Reply via email to