On Thu, 13 Mar 2025 15:21:33 GMT, Per Minborg <[email protected]> wrote:
>> As we advance, converting older JDK code to use the relatively new FFM API
>> requires system calls that can provide `errno` and the likes to explicitly
>> allocate a `MemorySegment` to capture potential error states. This can lead
>> to negative performance implications if not designed carefully and also
>> introduces unnecessary code complexity.
>>
>> Hence, this PR proposes adding a JDK internal method handle adapter that can
>> be used to handle system calls with `errno`, `GetLastError`, and
>> `WSAGetLastError`.
>>
>> It relies on an efficient carrier-thread-local cache of memory regions to
>> allide allocations.
>>
>>
>> Here are some benchmarks that ran on a platform thread and virtual threads
>> respectively (M1 Mac):
>>
>>
>> Benchmark Mode Cnt Score
>> Error Units
>> CaptureStateUtilBench.OfVirtual.adaptedSysCallFail avgt 30 24.330
>> ? 0.820 ns/op
>> CaptureStateUtilBench.OfVirtual.adaptedSysCallSuccess avgt 30 8.257
>> ? 0.117 ns/op
>> CaptureStateUtilBench.OfVirtual.explicitAllocationFail avgt 30 41.415
>> ? 1.013 ns/op
>> CaptureStateUtilBench.OfVirtual.explicitAllocationSuccess avgt 30 21.720
>> ? 0.463 ns/op
>> CaptureStateUtilBench.OfVirtual.tlAllocationFail avgt 30 23.636
>> ? 0.182 ns/op
>> CaptureStateUtilBench.OfVirtual.tlAllocationSuccess avgt 30 8.234
>> ? 0.156 ns/op
>> CaptureStateUtilBench.adaptedSysCallFail avgt 30 23.918
>> ? 0.487 ns/op
>> CaptureStateUtilBench.adaptedSysCallSuccess avgt 30 4.946
>> ? 0.089 ns/op
>> CaptureStateUtilBench.explicitAllocationFail avgt 30 42.280
>> ? 1.128 ns/op
>> CaptureStateUtilBench.explicitAllocationSuccess avgt 30 21.809
>> ? 0.413 ns/op
>> CaptureStateUtilBench.tlAllocationFail avgt 30 24.422
>> ? 0.673 ns/op
>> CaptureStateUtilBench.tlAllocationSuccess avgt 30 5.182
>> ? 0.152 ns/op
>>
>>
>> Adapted system call:
>>
>> return (int) ADAPTED_HANDLE.invoke(0, 0); // Uses a MH-internal pool
>> ```
>> Explicit allocation:
>>
>> try (var arena = Arena.ofConfined()) {
>> return (int) HANDLE.invoke(arena.allocate(4), 0, 0);
>> }
>> ```
>> Thread Local allocation:
>>
>> try (var arena = POOLS.take()) {
>> return (int) HANDLE.invoke(arena.allocate(4), 0, 0); // Uses a
>> manually specified pool
>> }
>> ```
>> The adapted system call exhibits a ~4x performance improvement ove...
>
> Per Minborg has updated the pull request with a new target base due to a
> merge or a rebase. The pull request now contains 34 commits:
>
> - Merge master
> - Add test for woven allocation
> - Merge branch 'master' into errno-util3
> - Use lazy initialization of method handles
> - Clean up visibility
> - Merge branch 'master' into errno-util3
> - Add @ForceInline annotations
> - Add out of order test for VTs
> - Allow memory reuse for several arenas
> - Remove file
> - ... and 24 more: https://git.openjdk.org/jdk/compare/4e51a8c9...6d782a84
src/java.base/share/classes/jdk/internal/foreign/CaptureStateUtil.java line 65:
> 63: // returnType in {int.class | long.class}
> 64: // stateName can be anything non-null but should be in
> {"GetLastError" | "WSAGetLastError"} | "errno")}
> 65: private record BasicKey(Class<?> returnType, String stateName) {
Please at least add an `hashCode` - this avoids a hefty overhead from code
generation in ObjectMethods.bootstrap, unavoidable whenever we put entries into
a hash map.
-------------
PR Review Comment: https://git.openjdk.org/jdk/pull/23765#discussion_r2056408940