Hello Ralf,

Very interesting analysis! I have spent a lot of time trying to get the piping to log files, while keeping out/err separation working with as little performance impact as possible. I was aware of problem 3, which causes some annoying output ordering sometimes, but I had not encountered 1 and 2 before. Checking my latest local Windows build, I do indeed have some truncated files, not as many as you, but still 79 (less virus scanning). That would certainly make incremental builds unreliable.

I tried your suggestion for problem 1 and it works for me too. We should definitely get that fix in. I have filed JDK-8207001. Do you mind if I use your patch for it?

I agree on the only solution for the 3rd problem, but I would not want to go there. Delaying output can have lots of weird consequences, if you for example interrupt a build with ctrl-c.

Another option would be to revert to our previous solution, where we used this script: http://hg.openjdk.java.net/jdk8u/jdk8u/file/090b27d25e19/common/bin/logger.sh The problem with that was performance as it created more extra shell instances for each compile command.

The only solution to fix problem 2 would be to stop separating out/err. I wonder how big of a problem that actually is though?

/Erik


On 2018-07-10 05:34, Schmelter, Ralf wrote:
Hi,

while building openjdk on Windows, I've stumbled over a problem which leads to 
truncated *.obj.log files, which in turn leads to empty dependency files (which 
are generated from the contents of the *.obj.log files on Windows). In the end, 
the incremental build will not be reliable.

The reason seems to be this code from the ExecuteWithLog macro in 
common/MakeBase.gmk:
( $(strip $2) > >($(TEE) $(strip $1).log) 2> >($(TEE) $(strip $1).log >&2) || \
     ...

This executes the command given in $2 and tries to write stdout/stderr to the 
file given by $1 and to 'normal' stdout/stderr via process substitution. But it 
has a few problems.

The first problem occurs when the stdout-tee truncates the file, then it writes 
some of the input it got into the file and only after this the stderr-tee has 
run far enough to truncate the same file, which leads to the loss of at least a 
part of stdout.

Another problem appears when the command writes to stdout and stderr. Since 
each tee has its own file position, they would overwrite one another.

The third problem arises because when the command $2 has ended, the bash shell 
will not wait for the process substitutions (the two tee commands) to end and 
instead will process the next command. Here we try to copy the log file in case 
of an error. But the tee commands might not have written the whole input they 
have got, so not the whole content of the file might get copied.


In my case it seems to be the first problem which I see most. When doing a 
fresh build on my Windows machine (which has quite an aggressive virus scanner 
active), I get something like the following:
find . -name "*.obj.log" -size 0 | wc -l
424

find . -name "*.obj.log" -not -size 0 | wc -l
1603

Which means about 20% of the *.obj.log files are truncated.

When I replace the code above with:
   ( $(strip $2) > >($(TEE) $(strip $1).log) 2>&1 || \
     ...

I see no more truncated *.obj.log files. The cost is that stderr is redirected to 
stdout (since the >&2 of the second tee is missing). The last problem can be 
avoided by using something like:
   ($(RM) -f $(strip $1).log && $(strip $2) > >($(TEE) -a $(strip $1).log) 2> >($(TEE) 
-a $(strip $1).log >&2) || \
     ...

This also leads to no truncated files, but the relative order of stdout/stderr 
might be wrong.

If the third problem should be solved too, the only solution I've found was to redierect 
via >  $(strip $1).log  2>&1 to the file and then later use 'cat' to write the 
output to stdout (losing the distinction between stdout and stderr).

Best regards,
Ralf

Reply via email to