[fpc-devel] Deeper problem with Internal Error 200309201

J. Gareth Moreton Sat, 13 Jul 2019 05:13:19 -0700

Hi everyone,

So my patch to fix #32913<https://bugs.freepascal.org/view.php?id=32913> was unfortunatelyrejected because it relied on a slightly hacky feature in the form of anew state variable that, currently, is only used to patch thisparticular issue, and as Florian has hinted in the issue notes, a deeperfix needs to be investigated.


What I can put together so far is that if you have a construct like this...

try
except
  try
    [b]
  finally
    [c]
  end;
end;

...and [b] is not empty ([c] doesn't matter), then the compiler willtrigger Internal Error 200309201. What happened is that during thesimplify stage of ttryexceptnode, if the try block is empty (representedby the 'left' node), the compiler would replace the entire constructwith a 'nothing' node. Logically a sound choice, because it'simpossible for an exception to be raised in this instance. The problemcomes that the except section contains a try..finally block which setsup a stack unwind block. It's a little hard to follow at this point,but what it culminates in is that there's a non-null exception filter(try..finally and try..except get bundled together for the stack unwind)that the compiler is not expecting later on, which causes a sanity checkto fail and trigger the internal error.

One approach is to simply not transform the except section into anothing node, but this adds significant overhead to the compiledbinaries and incur a performance penalty by having an unnecessary stackunwind (such try..except blocks may occur in the case of inlinedfunctions yielding no code, for example).

My first attempt at fixing this was to move the 'simplify' code into'pass_1' for ttryexceptnode, and if it detected an empty try section, tosimply not run "pass_1" on anything in the except section, so no code isever put into the exception filter - I argued this logic to myselfbecause, as already explained, this exception handler would never beexecuted. This worked, but caused a regression in tests/tbf/tb0089.ppand tests/tbf/tb0090.pp - these tests had an empty try section in theirtry..except blocks, because all they had to test was to attempt to use'goto' from the except section to jump to a label outside of the block. This particular syntactic check was only performed in "pass_1", so bynot running it for those nodes, the syntax check was skipped and thetests erroneously compiled successfully.

Since this approach resulted in a regression, it wasn't acceptable andso I had to find some other means to prevent code entering the exceptionfilter without completely overhauling the node parser. This led to whatis how the "current_nodes_dead" flag, set if the try section is emptyand designed with future expansion in mind (if you know nodes areunreachable, you can skip doing any kind of code generation or evendelete them entirely once you do the syntactic checking). Afterwriting the necessary code in the ttryfinallynode class, I got the issuefixed at long last.

It goes without saying that I was upset that the patch was rejected,especially as this particular issue has been on my books for literally ayear now, and I had no other viable solution to offer since everythingelse I had tried resulted in an error elsewhere. However, now thatFlorian's actually responded and hinted at a willingness to look for abigger solution, I'm able to look at it more objectively. Having a flagthat's only used to fix one feature is a little dangerous, even with aproposal for future use, because chances are it will be forgotten aboutand people will just introduce other flags when and where they need.

I would like some discussion on this with the administrative teambecause this does require some careful design if we're going to do thisproperly. Building on Florian's suggestion, I would like to proposesplitting "pass_1" into "pass_syntax" and a pass that transforms thenodes (I would name it "pass_transform", but this doesn't sound'important' enough, given it is effectively compiling the nodes). "pass_syntax" would be a public method declared in tnode as thefollowing "function pass_syntax: Boolean; virtual;" - if it returnsTrue, then everything was fine; if not, it returns False and this can beused to set the error flag. By being defined this way, this is noability for the node to transform itself, but it can set private fieldsbased on the syntax it finds (by setting private fields, the "firstpass" won't have to repeat some of the work needed to determine how totransform the nodes).

There are two ways this can then be implemented... one way is to havethe syntax pass be completely separate from the now badly-named "firstpass" and just do syntax checking on all the nodes. If any syntaxerrors are found, then the error flag can be set and "firstpass" willskip these nodes already. After this syntax pass is done, then thecompiler can move onto the "first pass". The drawback to this is thatit will likely increase compilation time quite noticably, since thecompiler will be running an additional traversal through the entire nodetree.

The second way is to have the syntax pass as the first step of"firstpass", so an extra traversal is not required. However, this willbe harder to develop and maintain because care has to be taken that allnodes are syntax-checked but not transformed if they would result indead or unexpected code (as with internal error 200309201). Given thiswould require a flag for a 'dead node', we may end up in a similarsituation as with the "hacky" patch. However, one thought that occurredto me is that we can make a new flag for tnodeflag named something like'nf_dead', since this doesn't add any new fields, and if this flag isset, then "pass_1" and "simplify" are not called, and it cascades theflag to the child nodes. This would be the 32nd flag for tnodeflag, soit reaches the upper limit for a small set - something to keep in mindif it gets expanded later.

On the surface, the first implementation suggestion feels like thecleaner option, but I don't know how much of a penalty it will add tothe compiler. Personally I would like to attempt a design for thesecond implementation to keep the number of passes down to a minimum andthe compiler reasonably fast (also my motivation behind the x86_64peephole optimizer overhaul). But that's just my opinion.

On an additional note, I do wonder if it's possible to merge "pass_1"and "simplify", since they are treated pretty much identically in the"firstpass" routine - if either of them return something other than nil,it is considered to be a node transformation, running this block of code(there are two copies, one for each call):


p.free;
p := hp;
firstpass(p);

(hp contains the return value of whatever method was just called) Thismay be impractical because "simplify" is called elsewhere for some nodes.

At least that's what I've thought about so far. Where do we go fromhere? I sense for this task, I should write up a PDF design spec like Idid with the optimizer overhaul, but this is something that needs a fairbit of discussion. And implementation will be a fairly mammoth taskbecause it would require introducing the new method to every single nodetype.

Thank you for your time, and I hope we can find the best solution tothis issue.


Gareth aka. Kit



---
This email has been checked for viruses by Avast antivirus software.
https://www.avast.com/antivirus

_______________________________________________
fpc-devel maillist  -  fpc-devel@lists.freepascal.org
https://lists.freepascal.org/cgi-bin/mailman/listinfo/fpc-devel

[fpc-devel] Deeper problem with Internal Error 200309201

Reply via email to