Re: [darcs-users] managing change

zooko Sat, 07 Feb 2009 16:05:15 -0800

On Feb 6, 2009, at 11:01 AM, Eric Kow wrote:

As Petr points out, this approach has its disadvantages:
1. Slow:... it takes us one whole year to get rid of anything, asmajor darcs releases are every 6 months;2. Time-consuming: because it causes us to split our time betweenmaintaining the old stuff and working on the new stuff and3 Potentially bad engineering: we're increasing the number ofpossible code paths (yuck! conditional compilation!) therebyreducing the amount of time that each path is explored.So despite my claims, it's not even completely clear that the so-called "conservative" sunset procedure is the sort of responsibleengineering practice that it aspires to be.

These are pretty strong criticisms. It could be that the sunsetapproach would make new releases buggier instead of more stable. Ihave some experience with this sort of approach and while I don'tknow if my experience generalizes to the darcs codebase, I would saythat the sunset approach didn't work out well for me.

Third, we want to make sure that we never break darcs, becausesometimes Life Just Happens: deadlines pile up at work, buses hitpeople, hackers get girlfriends, babies are born.

This is a good consideration to keep in mind (and by the way the samething happens in proprietary software development in a company --priorities change all the time). I think the right approach to thisis to have the policy of "trunk is always good". You never breaktrunk at time T, intending to fix it again at time T+n. Instead youdo whatever work you need to make it good in a branch, and thencommit it to trunk once it is strictly better than the currenttrunk. A corollary of this is that if a patch lands in trunk andthen is discovered to contain a regression, then that patch is rolled-back.

Obviously this is pretty much impossible without test-drivendevelopment. If you don't have thorough tests, then how do you knowif you're breaking things with your patches?


You have some good questions about test-driven development:

1. The kinds of things the sunset procedure aims to catch areintegration errors (unexpected interaction between different partsof darcs), and also real-world errors (e.g. HTTP not working behindproxies) that seem tricky to capture in laboratory conditions. Idon't mean to say "we shouldn't do automated testing because itcan't cover everything". Of course we should do more automatedtesting. But how should we catch the real world errors?

My experiences with Twisted, and with Brian Warner on Tahoe, havetaught me that such issues are a lot more programmable andreproducible than I had thought. Things that I used to considerobviouly "manual", like "Write to the AIX user and ask him tomisconfigure his network in that same way again and try again withthis new build", are to these guys "automatable", like "Run abuildslave on AIX, figure out exactly which parts of our source codecan be affected by network misconfiguration, and test how that codehandles that effect.".

This is not to deny your point -- certainly integration and "realworld" are always full of surprises, and some things can't beautomated with reasonable effort, and you will always want manualtesting after all the automated testing is done. But what I'velearned is that automated testing can address 90% of those cases thatI formerly thought required manual testing.

2. I'm not sure how to go about testing IO-intensive stuff (I guessour functional tests, i.e. the shell scripts, are a good example)

In Twisted and in Tahoe, I've seen two complementary approachestaken. One is the lower-level, "unit test" sort of approach --figure out what functions will be called with what sort of inputs inresponse to the I/O, and thoroughly test those functions under thoseinputs. Haskell should be *great* at this, right? The whole *point*of side-effect-free programming is that you don't have to worry aboutthings *other* than the arguments affecting the computation.

The other is a more holistic "functional test" approach -- simulatethe circumstances that the code under test is required to handle. Ifyou want to test that the code handles a user who mashes down the "n"key, then launch a subprocess, exec darcs in that subprocess, send athousand "n" chars on its stdin, and examine how it behaves. IfHaskell is not already good at this sort of thing, then you canalways write your functional tests (as currently) in bash (ugh), Perl(ugh), Python (yay!) or something, but Haskell is probably going toget good at this sort of thing, because Haskell is growing up, andthis is the sort of thing that a modern, well-rounded practicallanguage needs to be good at.

3. It seems that for a heavy reliance on testing to work, we aregoing to need to have much much wider test coverage. How do webreak out of this chicken and egg? Do we put everything on holdand launch a massive darcs testing initiative?

What the Twisted folks did when switching from their previouspractices to the Ultimate Quality Development System was simply tomandate that any new patches had to fully satisfy the newrequirements. This works well, because if the current code containsbugs, then at least they are old bugs, and in practice it causes lesshavoc to keep old bugs than it would to replace them with new bugs.The result of Twisted's practice has been a near-monotonicimprovement in code quality -- the rate at which new bugs areintroduced by patches is now much lower than the rate at which oldbugs are fixed by patches.


Regards,

Zooko
---
Tahoe, the Least-Authority Filesystem -- http://allmydata.org
store your data: $10/month -- http://allmydata.com/?tracking=zsig
_______________________________________________
darcs-users mailing list
[email protected]
http://lists.osuosl.org/mailman/listinfo/darcs-users

Re: [darcs-users] managing change

Reply via email to