Re: How important is jspawnhelper crash detection?

Roger Riggs Tue, 15 Jul 2025 08:05:49 -0700

Hi Thomas,

Simpler is better on both sides of the protocol.

The version check will have happened before this part of the protocol sothere's no confusion about matching expectations.

I agree that removing it is preferred.


Roger


On 7/15/25 10:44 AM, Thomas Stüfe wrote:

Hi,
I am currently working on removing (eventually) the vfork mode. Beforewe can do this, we need a bit better error diagnostics. I do this bygently improving the error handling throughout the code, so that wecan generate IOExceptions based on more exact knowledge.
While working at this, I re-examined the "send aliveness ping fromjspawnhelper to parent" logic again I introduced back in 2019 as aworkaround for an obscure glibc bug with posix_spawn (seehttps://bugs.openjdk.org/browse/JDK-8223777). I found that it was notneeded anymore since the glibc was fixed, so I started removing thatworkaround (https://bugs.openjdk.org/browse/JDK-8362257).
But then it occurred to me that an obscure part of the jspawnhelperdiagnostics introduced withhttps://bugs.openjdk.org/browse/JDK-8226242 piggy-backs on thealiveness ping for its"detect-abnormal-process-termination-before-exec" capabilities. Workslike this:
A jspawnhelper starts
B jspawnhelper enters childProcess() and sends alivenes ping
C jspawnhelper does a bunch of other things
D jspawnhelper exec's
In the parent, we count abnormal child terminations that occur beforethe aliveness ping (B) as "spawn error" and print the signal number. Without the aliveness ping we could not tell apart "jspawnhelper endsabnormally due to a signal" from "jspawnhelper exec()'s the userprogram successfully, user program runs and ends abnormally due tosignal".
However, the question is how important or even useful this part really is:
- for externally sent abnormal termination signals (SIGTERM etc), fromthe caller's point of view it probably does not matter when it comes :before or after exec().- it only matters for synchronous termination signals (crashes) weourselves cause; but here it only covers crashes in a rather smallpiece of code, before the liveness ping (B). Basically, just the firstpart of jspawnhelper main(). Any crashes after the liveness ping arestill unrecognised by ProcessBuilder.start, and are instead handled bythe caller calling Process.waitFor().
There are two ways to deal with this:
We could do without the crash protection in point (A), which wouldallow us to remove the liveness ping. I would very much prefer that.It would simplify the jspawnhelper protocol and make it more robust.Because we now don't have any communication in case no error happens -there would be only a single bit of information sent back via failpipe, and only in case of an error. Fail pipe would stay quiet in caseof successful exec(). Abnormal child process termination in the firstpart of jspawnhelper would be covered by the same path that detectsabnormal child process termination in user programs - Process.waitFor().
If we determine we really need this crash detection, we could at leastimprove it: move the liveness ping to just-before the exec() call, sothat we cover all the area from A-D. Also, do it for all modes (FORK,too), to simplify coding.
Bottomline: remove an obscure and complex small mechanism that onlyhelps a bit with detecting program errors (sigsegv etc) inside thefirst part of jspawnhelper main() ?
Thanks, Thomas

Re: How important is jspawnhelper crash detection?

Reply via email to