Re: destructors and finalization [hijacked from Re: Where are the software engineers of tomorrow?]

Christopher Smith Mon, 14 Jan 2008 22:58:54 -0800

James G. Sack (jim) wrote:

Christopher Smith wrote:

Andrew Lentvorski wrote:

..
Unfortunately, constructors and destructors are excruciatingly
difficult to get right in the presence of exceptions.  The fact that
that you have to pull out about 3 different books to get this right
says that something is broken.

Or perhaps that there is something that can be learned.... ;-)


Most of the people who can actually explain clearly to me the
differences between destructors and finalization will observe that the
former is far more useful than the latter, and they both make it
difficult to accurately characterize the behaviour of a system
(finalizers manage to do so even without adding exceptions in to the mix
;-).


Can someone explain clearly the differences?
Seriously.
A quick glance at some googlehits and w'pedia doesn't give me a feel for
why this is a significant question, although I gather it's (maybe?) got
something to do with deterministic concerns (perhaps correctness?).

So, the differences are subtle, but as it turns out, significant. Afinalizer is run when an object is determined to be unreachable, whereasa destructor is run when an object is being destroyed. This is importantbecause technically an object isn't destroyed either before or after afinalizer is run. In fact, it is possible for the code in a finalizermight cause an object to become reachable again, sort of like"reincarnating the object", except that the object never died in thefirst place... As with garbage collection, finalizers usually aren'tguaranteed to run at any particular time or even at all, whereas thedestructor is guaranteed to run at exactly the time it is destroyed.Also, in multithreaded systems it isn't unusual for finalizers to run ina dedicated thread/thread pool that is otherwise unrelated to theobjects being finalized.

Destructors are even more useful in C++ because you can have objectswhose lifetime is scoped to a particular block. That means you can haveguarantees that a destructor will get invoked while you drop out ofscope. In languages that have exceptions but who don't have destructors,you usually see something like a "finally" keyword that allowsprogrammers to define a block that is guaranteed to execute when youleave scope. While it accomplishes much the same thing, it doesn't allowthis cleanup work to be encapsulated in the object (you have to write itout each time). So, for example, in Java a common idiom is:


Connection conn = null;
try {
   conn = ....;
   /* do some stuff that may cause exceptions */
} finally {
   try {
       if (null != conn) {
           conn.close(); //give it the ol' college try
       }
   } catch (SomeExceptions e) {
       log.error(e, "Error while closing connection);
   }
   conn = null; //not necessary, but you see people do this anyway
}

You get dizzy with all those blocks? Follow all the program flow? Nowimagine if you had three different resources that needed to be cleanedup before you got out of dodge (not too uncommon really). Lots ofnesting of blocks. It's awesome (NOT!).


Now, in C++ the idiom is more like this:

{
   Connection conn(.....);
   /* do some stuff that may cause exceptions */
}

The "Connection" object's destructor cleans up the connection as best itcan when conn drops out of scope. All the logic for how you tear down aconnection is nicely encapsulated in Connection. The one significantdrawback is that if an exception is thrown and the stack starts tounwind, the program will terminate if conn's destructor gets invoked aspart of the unwind *and* it throws an exception. The general rule forthis in C++ is "don't throw exceptions from a destructor", although itturns out there are some very rare cases where that is the right thingto do. Anyway, Connection's destructor will probably look something likethis:


Connection::~Connection() {
   try {
       close();
   } catch (...) {
       log.error("Error when closing connection......");
   }
}

There are some more fun cases involving exceptions and cleaning upresources (from my point of view, particularly with finalizers), but youget the basic idea.


Oh, and one last fun area for both of them: inheritance.

What if your parent class has a cleanup method, and so do you? Well,with finalizers, you generally have to remember to explicitly invoke theparent class's finalizer or it gets ignored. There isn't much a parentclass can do to prevent this beyond making the finalize methodnon-overrideable ("final" in Java parlance) and then have its finalizerdo whatever cleanup it wants to and then invokes some overridable"finalize hook" method that subclasses can get involved with. CLOS hasmuch prettier ways of doing this than most languages. With destructors,the parent destructors will get invoked after you've finished your work,and there's not much you can do to stop it beyond crashing. With C++'sstatic binding, there's one more fun thing: if you are going to havesomeone derive from your class, you almost certainly need to make yourdestructor virtual. Here's why. Imagine class A and class B, with Bbeing a subclass of A:


class A {
public:
   ~A(); //destructor for A
};

class B : public A {
public:
   ~B(); //destructor for B
};

{
   A* foo = ....; //one should use auto_ptr's, but I don't want to confuse
   B* bar = ....; //folks who aren't super familiar with C++
   delete (bar);
   delete (foo);
}

Now, what's going to happen when you invoke "delete(bar)"? Well, thecompiler see "bar" is pointer to an object of type B, so it is going toinvoke B::~B() on *bar, and then in turn invoke A::~A() on *bar, beforefinally cleaning up the memory for bar. That all seems well and good.What about what happens when you invoke "delete(foo)"? Well, thecompiler sees "foo" is a pointer to an object of type A, so it is goingto invoke A::~A() on *foo, and then it'll clean up the memory. Soundgood? Well, it is.... if foo really is a pointer to an instance of A.However, it is entirely possible it could be a pointer to an instance ofB. In that case, whatever extra cleanup work that would happen inB::~B() is never going to happen. This can result in a resource leak(ick!). The solution is to make A::~A() virtual. Then the calls to bothA::~A() and B::~B() become virtual function calls, which are guaranteedto catch whatever overrides might be in derived classes, and the expenseof an additional jump instruction (and having to have a vtable for yourobjects, and not being able to inline the destructor quite so easily....).

Anyway, in practice, finalizers prove not to be that useful for mostcases, and also a source of much additional complexity (thoughthankfully in the common case most of the complexity is in the hands ofwhomever has to write the memory manager) and destructors prove to bequite useful, particularly for managing non-memory related resources,but are also a source of much additional complexity.

My observation has been that, in general, cleaning up properly is one ofthose things that people naturally tend to think of as being trivial andunimportant, but programmers quickly learn is actually where a lot ofthe work and complexity comes from. Even in languages withoutdestructors, finalizers, or exceptions, it's not like these problems goaway, just that you deal with resource clean up on a case by base basis(which allows for a lot of the cases to be quite simple), rather than ina generalized fashion. People who work primarily in languages withoutsuch constructs seem to often think they save themselves a lot of painand suffering by just not having language features for resource cleanup, but my observation is they just aren't quite as aware of how muchmore painful it is going to be to deal with the issue.


--Chris

--
[email protected]
http://www.kernel-panic.org/cgi-bin/mailman/listinfo/kplug-lpsg

Re: destructors and finalization [hijacked from Re: Where are the software engineers of tomorrow?]

Reply via email to