Re: [fonc] Spec-Driven & Self-Testing Code

BGB Wed, 13 Oct 2010 23:27:19 -0700

----- Original Message -----From: "Julian Leviston" <[email protected]>

To: "Fundamentals of New Computing" <[email protected]>
Sent: Wednesday, October 13, 2010 9:13 PM
Subject: Re: [fonc] Spec-Driven & Self-Testing Code




On 11/10/2010, at 12:24 PM, BGB wrote:

<--
Does anyone know about a language (possibly something in smalltalk) thatinvolves spec-driven development?
A language that stipulates a spec (in code - ie a test) must be writtenbefore any code can be written, and that can also do self-checks (sort oflike a checksum) before the code is executed?
Julian.
-->

possibly relevant questions:
what would be the merit of imposing such a restriction?
how would one enforce such a restriction?
how could this be done without compromising the ability to use thelanguage to get much useful done?
...
IMO, any such restrictions are better left under the control of "thedevelopment process" rather than trying to enforce them at the languagelevel.

<--

The merit of imposing that users write specs first assumes that programmershave a plan before they start writing code, and then encourages them tobecome aware of their plan and requirements, explicitly laying them out in aseries of test of behaviour. (See http://behaviour-driven.org/ possibly fora good explanation of where I'm coming from here).

-->

quick skim of some of it...

ok, there seems to be a conflict here...

writing specs/plans, and testing quick/early, are very different approachesto software design.

in a typical spec or plan-based process, a person will try to work outeverything in advance, and delay coding until after the spec is relativelycomplete. this is what is usually termed "waterfall method" or "BDUF".


waterfall: first spec; then code; then test and debug; then deploy.

in a test-driven setup, usually the tests and code are written early andwritten alongside each other, and planning is usually very minimal (usuallyjust writing out maybe basic ideas for what the thing should do).

in this later form, usually specs are written *after* the code, usuallyeither to document the code, or to help identify which directions they wantto go, ...


this would seem to be more the style advocated by the posted link.

it seems like there is jumping between advocating several differentmethodologies.

admittedly, I tend to combine both strategies, usually writing specs forindividual components or features, but using an incremental process andregular testing for large scale features.

<--

I'm not advocating enforcement, rather encouragement. As most of us know,you cannot happily enforce things on people, let alone programmers - forexample Java's nightmarish type system hinders more than helps (in my humblepersonal opinion, only). One *can* encourage certain things through the FORMof the language/IDE/workspace/etc., though (for example, smalltalk's classbrowser encourages a programmer to think in terms of classes)...unfortunately composition might be a better practice most of the time and it*may* be that having a class browser as a default tool encouraged the"over-use" of inheritance that we have seen as quite a dominantpractice/idea in the OOP world.

-->

enforcement also usually only works when there is something well defined toenforce, or something can be enforced in terms of taking away some specificfeature.

<--

What I *am* advocating, though, is a checksum that is written by testingsuites at various levels that would mean that code wouldn't run unless ithad at least the most basic testing suite run against it. I'm advocatingthat behaviour-tests should be a mandatory part of development (anydevelopment process - be it writing a song, programming a program orbuilding a house). The best of us as developers in any realm do these kindsof requirement-conformance "tests" when building whatever it is we'rebuilding ANYWAY, so I see it as simply a formalisation of a hithertounexpressed best-practice.

-->

IMO, "checksum" is also a bad choice of a terms to use in reference to unittests or similar.typically, what a checksum does and what a unit test do are very different,and it would create unneeded confusion to use one term to describe theother.

the problem is (with the proposed idea), until both code and tests can bewritten, one is almost inevitably going to write a test which always passes,so that they can run the code, effectively so that they can in turn test anddebug the code.

traditionally, code will be run regardless of whether it is known to work,and the main point of unit testing then is to start throwing up red-flagsthat something may need to be looked at or fixed, or to track newfunctionality which "should" work, but doesn't as of yet (if wanting to adda new feature may make the test fail, which may break the build, then peopleare going to delay adding the test until after the feature is generallyknown to work, partly defeating some of the purpose of unit testing).

<--

Code should simply be able to test itself, to perform minimalself-diagnostics in the same way that advanced photocopiers or computers cando this. If we bake this in at a micro-granular level, then the reliabilityand stability of systems will skyrocket. Not only this, but code essentiallyself-documents then. If we are as programmers, encouraged to document before*and* after we write code, we will end up with a neat description of what wewere trying to do and not just the output.

-->

usually I just write tests as little special-purpose frontends which testthat a piece of code works as expected.I have tried using these for my C compiler as well, but sadly I am then leftto realize that my compiler is partly broken (some tests fail for "reasonsnot yet identified", seriously, in some cases I comb through the ASM outputand still can't find the cause of the error), and I have neither the timenor the energy to really beat all the bugs out of it (my main codegen waswritten with, sadly, very little testing, and is very internally complex, sosadly it is left on faith that a lot of this stuff actually works...).

sadly, some pieces of code, particular compilers, are somewhat difficult totest on a fine-grain basis, and so have to be tested on a more coarse scale:if fragments of code compile and behave as expected. this leaves a lot ofneed for manual intervention in testing and debugging, such as reviewing theoutput of various internal processes, trying to track down specific errors(in possibly a large mess of other data), making debugging a relativelylabor-intensive process.


this is, sadly, an ongoing battle in my cases.



<--

As a community, I'm trying to get us to shift from being the kid in mathsclass who says "12" when the teacher says "What is 3 times 4"? The workingout is at least as important as the answer... as we all know in latermathematics, more weight is attributed to the process than to the outcome...if there are 12 steps in a process to build an answer, and only the 12th isincorrect, then that is 11 out of 12 as a score, as averse to if there isonly an incorrect answer and no working out, that is 0 out of 12 as ascore... or 12 out of 12 if it is correct.


Do you see my point here?
-->

hmm... not all of use are good with maths...


<--

Of course, just as it's possible to write structured code inside Smalltalk,or any Object-Oriented language, it would be possible to "get around" thisencouragement of behavioural-driven-testing built into a language. Butthat's beside the point. I'm not trying to get to a state where things areIMPOSSIBLE to break. I'm trying to get to a state where they're encouragedto work.

If I didn't want to write tests, I could simply write tests that evaluatedin true. Hands dusted and done. Mind you, the stakeholders (maybe me) wouldprobably get frustrated as the project gets larger and more regressionerrors creep in with each suite of changes imposed on the system.

-->


even OO is not a requirement:

even though OO is popular, not everything needs to be OO or in an OOlanguage, and OO or not-OO is really a side issue to the matter of unittests...

sadly, if the tests prevented the project from building or running if theyfailed, this would essentially leave the tests always (or almost always)evaluating to true as the only really reasonable outcome in most cases...

granted, testing could have separate "warning" and "error" conditions, wherea warning condition is a test which fails, but the code can still be run(only something minor has broken, or correct behavior remains as a futuregoal), and an error condition is where "something has just gone horriblywrong here", with the assumption that the test itself can be used to helpdebug the problem.

often, it is still needed to run even known broken code in order to helpbetter isolate the problem.now, preventing doing an official release with broken code is an altogetherdifferent issue...



or such...


_______________________________________________
fonc mailing list
[email protected]
http://vpri.org/mailman/listinfo/fonc

Re: [fonc] Spec-Driven & Self-Testing Code

Reply via email to