Re: [dmd-internals] Asserts

Walter Bright Sun, 11 Nov 2012 11:51:13 -0800


On 11/10/2012 11:23 AM, David Held wrote:

On 11/9/2012 11:38 PM, Walter Bright wrote:
[...]
I'll often use printf because although the debugger knows types, it rarelyshows the values in a form I particularly need to track down a problem, whichtends to be different every time. And besides, throwing in a few printfs isfast and easy, whereas setting break points and stepping through a program isan awfully tedious process. Or maybe I never learned to use debuggerproperly, which is possible since I've been forced to debug systems where nodebugger whatsoever was available - I've even debugged programs using anoscilloscope, making clicks on a speaker, blinking an LED, whatever isavailable.
You're making my point for me, Walter! I have seen some people whiz throughthe debugger like they live in it, but I would say that level of familiaritytends to be the exception, rather than the rule. And, it always makes me alittle uncomfortable when I see it (why would someone *need* to be thatproficient with the debugger...?). Firing up the debugger, for many people,is a relatively expensive process, because it isn't something that goodprogrammers should be doing very often (unless you subscribe to the schoolwhich says that you should always step through new code in thedebugger...consider this an alternative to writing unit tests).


I think we are agreeing on that.

BTW, we are also specifically talking about debugging DMD source code. I wouldexpect someone capable of understanding compiler guts to be capable of using adebugger to generate a stack trace, so for this specific case I don't see anyneed to cater to compiler devs who are incapable/afraid/ignorant of using adebugger.

Note that getting a call stack for a seg fault does not suffer from theseproblems. I just:
   gdb --args dmd foo.d

and whammo, I got my stack trace, complete with files and line numbers.
There are two issues here. 1) Bugs which don't manifest as a segfault. 2)Bugs in which a segfault is the manifestation, but the root cause is far away(i.e.: not even in the call stack). I will say more on this below.

True, but I find seg faults from deep bugs to be only 10% of the seg fault bugs,the rest are shallow ones. (10% figure is made up on the spot.)

[...]
Especially when there may be hundreds of instances running, while only a fewactually experience a problem, logging usually turns out to be the betterchoice. Then consider that logging is also more useful for bug reporting, aswell as visualizing the code flow even in non-error cases.
Sure, but that doesn't apply to dmd. What's best practice for one kind ofprogram isn't for another.
There are many times when a command-line program offers logging of some sortwhich has helped me identify a problem (often a configuration error on mypart). Some obvious examples are command shell scripts (which, by default,simply tell you everything they are doing...both annoying and useful) andmakefiles (large build systems with hundreds of makefiles almost alwaysrequire a verbose mode to help debug a badly written makefile).
Also, note that when I am debugging a service, I am usually using it in astyle which is equivalent to dmd. That is, I get a repro case, I send it into a standalone instance, I look the response and the logs. This is really nodifferent from invoking dmd on a repro case. Even in this scenario, logs areincredibly useful because they tell me the approximate location wheresomething went wrong. Sometimes, this is enough to go look in the source andspot the error, and other times, I have to attach a debugger. But even when Ihave to go to the debugger, the logs let me skip 90% of the single-stepping Imight otherwise have to do (because they tell me where things *probably workedcorrectly*).

Sure, I just don't see value in adding in code for generating logs, for dmd,unless it is in the service of looking for a particular problem.

[...]
I've tried that (see the LOG macros in template.c). It doesn't work verywell, because the logging data tends to be far too voluminous. I like totailor it to each specific problem. It's faster for me, and works.
The problem is not that a logging system doesn't work very well, but that alogging system without a configuration system is not first-class, and *that*is what doesn't work very well. If you had something like log4j available,you would be able to tailor the output to something manageable. Anall-or-nothing log is definitely too much data when you turn it on.

I am not seeing it as a problem that I tailor the printf's as required, or whythat would be harder than tailoring via log4j.

On 11/9/2012 11:44 PM, Walter Bright wrote:
[...]
There is some async code in there. If I suspect a problem with it, I've leftin the single thread logic, and switch to that in order to make itdeterministic.
But that doesn't tell you what the problem is. It just lets you escape tosomething functional by giving up on the parallelism. Logs at least tell youthe running state in the parallel case, which is often enough to guess at whatis wrong. Trying to find a synchronization bug in parallel code is prettydarned difficult in a debugger (for what I hope are obvious reasons).

Yes, that's true, but it enables me to quickly determine if it is a bug in theasync logic or not. BTW, my experience in adding logs to debug async code isthat they make it work :-).

[...]
Actually, very very few bugs manifest themselves as seg faults. I mentionedbefore that I regard the emphasis on NULL pointers to be wildly excessive.
I would like to define a metric, which I call "bug depth". Suppose thatincorrect program behavior is noticed, and bad behavior is associated withsome symbol, S. Now, it could be that there is a problem with the immediatecomputation of S, whatever that might be (I mean, like in the same lexicalscope). Or, it could be that S is merely a victim of a bad computationsomewhere else (i.e.: the computation of S received a bad input from someother computation). Let us call the bad input S'. Now, it again may be thecase that S' is a first-order bad actor, or that it is the victim of a bugearlier in the computation, say, from S''. Let us call the root cause symbolR. Now, there is some trail of dependencies from R to S which explain themanifestation of the bug. And let us call the number of references which mustbe followed from S to R the "bug depth".
Now that we have this metric, we can talk about "shallow" bugs and "deep"bugs. When a segfault is caused by code immediately surrounding the badsymbol, we can say that the bug causing the segfault is "shallow". And whenit is caused by a problem, say, 5 function calls away, in non-trivialfunctions, it is probably fair to say that the bug is "deep". In myexperience, shallow bugs are usually simple mistakes. A programmer failed tocheck a boundary condition due to laziness, they used the wrong operator, theytransposed some symbols, they re-used a variable they shouldn't have, etc.And you know they are simple mistakes when you can show the offending code toany programmer (including ones who don't know the context), and they can spotthe bug. These kinds of bugs are easy to identify and fix.
The real problem is when you look at the code where something is failing, andthere is no obvious explanation for the failure. Ok, maybe being able to seethe state a few frames up the stack will expose the root cause. When thishappens, happy day! It's not the shallowest bug, but the stack is the nexteasiest context in which to look for root causes. The worst kinds of bugshappen when *everyone thinks they did the right thing*, and what reallyhappened is that two coders disagreed on some program invariant. This is thekind of bug which tends to take the longest to figure out, because most of thecode and program state looks the way everyone expects it to look. And whenyou finally discover the problem, it isn't a 1-line fix, because an entiremodule has been written with this bad assumption, or the code does somethingfairly complicated that can't be changed easily.
There are several ways to defend against these types of bugs, all of whichhave a cost. There's the formal route, where you specify all valid inputs andoutputs for each function (as documentation). There's the testing route, whereyou write unit tests for each function. And there's the contract-based route,where you define invariants checked at runtime. In fact, all 3 are valuable,but the return on investment for each one depends on the scale of the program.
Although I think good documentation is essential for a multi-coder project, Iwould probably do that last. In fact, the technique which is the cheapest butmost effective is to simply assert all your invariants inside your functions.Yes, this includes things you think are silly, like checking for NULL pointers.


I object to the following pattern:

     assert(p != NULL);
     *p = ...;

i.e. dereferencing p shortly after checking it for null. This, to me, is utterlypointless in dmd. If, however,


    assert(p != NULL);
    s->field = p;

and no dereference is done, then that has merit.

But it also includes things which are less silly, like checking for emptystrings, empty containers, and other input assumptions which occur. It'sessentially an argument for contract-based programming. D has this feature inthe language. It is ironic that it is virtually absent from the compileritself. There are probably more assert(0) in the code than any other assert.
DMD has a fair number of open bugs left, and if I had to guess, the easy oneshave already been cherry-picked. That means the remainders are far morelikely to be deep bugs rather than shallow ones. And the only way I know howto attack deep bugs (both proactively and reactively) is to start makingassumptions explicit (via assertions, exceptions, documentation), and give thepeople debugging a visualization of what is happening in the program vialogs/debug output. Often times, a log file will show patterns that give you afuzzy, imprecise sense of what is happening that is still useful, because whena bug shows up, it disrupts the pattern in some obvious way. This is what Imean by "visualizing the flow". It's being able to step back from thebark-staring which is single-stepping, and trying to look at a stand of treesin the forest.

The bulk of the bugs result from an incomplete understanding of how the varioussemantics of the language interact with each other. For example, behavior X ofthe language is accounted for in this section of code, but not that section.What has helped with this is what Andrei calls "lowering" - rewriting complex Dcode into a simpler equivalent, and then letting the simpler compiler routinesdeal with it. What has also helped is refactoring. For example, the code to walkthe expression trees tends to be duplicated a lot. By switching to a singlewalker coupled with an "apply" function, a number of latent bugs were fixed. I'dlike to see more of this (see src/apply.c).


Dang I wish I could use ranges in the D source code :-)



_______________________________________________
dmd-internals mailing list
[email protected]
http://lists.puremagic.com/mailman/listinfo/dmd-internals

Re: [dmd-internals] Asserts

Reply via email to