Hi all,

fwiw, here's the current branch with these changes so you can have a look:

https://github.com/tschatzl/jdk/tree/submit/8342382-card-table-instead-of-dcq

I do not think there will be any significant changes in the design (assuming you and other reviewers also like it :P). Aarch64 and x86(-64) are fine imo (for months now...), all but S390 need more testing.

As for the S390 port, I admit I gave up getting a cross-compilation and emulator set up for S390, and its (macro-)assembly seemed so much different that the other ports, and pushing it through GHA repeatedly until it at least compiles takes too much time too.

There is a big FIXME in its generate_post_barrier_fast_path() in g1BarrierSetAssembler_s390.cpp file; other than that method implementation, only cleanup/removal of unnecessary code like for the other ports should be required.

All other (very few) FIXMEs are about removing debug log messages or code that can be removed when all ports are done.

The reason for the delay for posting the PR is that the JEP text needs some more refactoring until we can submit and post the official PR (and there is some time until the JDK 25 fork anyway).

The implementation CR also already contains some more information about the implemetation (https://bugs.openjdk.org/browse/JDK-8342382), which will also be updated a little before asking for reviews.

Hth,
  Thomas

On 22.10.24 10:26, Thomas Schatzl wrote:
Hi all,

  we in the GC team have been experimenting with new post-write barriers for G1 to reduce the throughput difference between G1 and Parallel GC.

After quite a few attempts we think we found a balanced solution between overall complexity and impact on throughput and latency.

The JEP draft in [1] summarizes the new write barrier and other relevant changes; feel free to also chime in in the a public discussion thread [2] if there are any questions.

Summarizing, throughput is always better or the same as before, sometimes getting very close to Parallel GC. There are no changes to latency with pause times being a bit better at the cost of some native memory.

This email is mostly an advance notice that unfortunately the new write barrier is incompatible with the old one, so effort is required to make G1 work again on all platforms. Hence we would like to ask you to help us with them.

We intend to post a PR containing the implementation in the next (one or two) weeks (for this CR [3]); our goal would be to merge sometime for (early?) JDK 25 together with all barrier changes completed.

At Oracle we will as usual fully support the x64_64 and aarch64 platforms.

There are (seemingly) working implementations for riscv, aarch32/arm, and x86, which would need further testing/checking, but completely missing are barriers for PPC and S390 (I tried my luck with those, but gave up a after a bit ...).

Thanks in advance,
   Thomas

[1] https://bugs.openjdk.org/browse/JDK-8342382
[2] https://mail.openjdk.org/pipermail/hotspot-gc-dev/2024-October/049944.html
[3] https://bugs.openjdk.org/browse/JDK-8342382

Reply via email to