Re: [fpc-devel] Boehm garbage collector for freepascal

Hans-Peter Diettrich Tue, 16 Nov 2010 11:14:02 -0800

Thaddy schrieb:

The rants about finalizers versus destructors in the context of GC are amany.Let me make it clear that these discussions almost invariably presumethe context of the C(++) language.The main thing being that C++ allocates objects/classes from the stackby default. In Freepascal (in Delphi mode or objfpc mode)objects/classes are guaranteed to be allocated from the heap.

I doubt that this is a reasonable argument, since stack objects do notdeserve any GC at all.

In the context of stack allocated objects there is (are many!) aconflict between finalizers and destructors which makes them mutuallyexclusive..In de context of heap allocating languages like Freepascal or Delphithese conflicts do not exist, not even in a serialized context.


Except for Object, the deprecated C-style object implementation.

Those arguments are therefore invalid. Even local objects are from theheap in Freepascal, as soon as the object goes out of scope it is markedfor deletion by the GC

That's not a general rule, unless there exist language features thatcontrol (disallow) e.g. passing of according references to subroutines,or storing them in other data structures (lists...) of extended lifetime.

For a good understanding why this is so you have to understand how adestructor in object pascal relates to the finalzer as used in the BoehmGC.The finalizer is called - and only called - if the memory (object) isalready marked for deletion. This means the memory is technicallyalready unreachable.It is perfectly legal to call a destructor on such an object even in aasynchroneous context..

Maybe, but what about owned objects and other references, still residingin an unreachable object? And how to prevent multiple destructor calls?

IMO destructors and finalizers are mutually exclusive, I remember anote like "Why a garbage collector never should call an destructor",that at least applies to mark-sweep GC.
This is *only* true for stack allocated objects like in C++ butdefinitely not for heap allocated objects like in freepascal and Delphi.Strongly put: the fact that Freepascal allocates from the heap makes itextremely suitable for the GC.

Distributed object *references*, in the stack, the objects themselves,and in global variables, are the biggest problem in GC. The distinctionbetween stack and heap objects in contrast is not a problem, since theGC knows about all heap objects.

If you read the documentation for the Boehm collector you can deductthat. (Also in the context of the Java discussions on the same subject.)
It should be clear that a destructor, that destroys further (owned)objects, will confuse an mark-sweep garbage collector, since it caninvalidate the marks. Consequently all allocated memory areas/objectsshould be flagged as either managed or unmanaged. Then FreeMem candecide, inside the memory manager, whether the memory block should bereleased immediately (if unmanaged), or should be marked for laterdeletion (if managed). Dunno about the concrete Boehm implementation...
This is not the case: When the finalizer is called the memory (alwaysallocated from the heap) is already outside of the scope of the normalprogram flow and can safely be released by the GC.


Finalization of an object can occur at any time, not only during GC.

Mark/sweep will work safely.


Safety by the conservative approach (if in doubt, let it survive).

The Boehm GC is smart enough to "see" the live pointers that nest insidethe main object.


At a high cost, as long as it has to guess what *are* pointers.

Regarding refcounted strings: the way it is implemented here doesn'tcarry any prize for beauty, but it seems to work alright.
That's a different GC model, not mark-sweep. Eventually the un/managedflag has to be extended, into managed-by-refcount andmanaged-by-mark-sweep.
Mark sweep will - empirically, granted - work after f.e. the refcount is0, not before. The inner workings of the string mechanism are simply notchanged. That's mainly why I consider my solution not pretty, btw.
And, frankly, I am not sure, as I wrote before.

I don't see any need why refcounted objects should be subject to anothergarbage collection. When a mark-sweep GC tries to deal with e.g. dynamicstrings or arrays, the result can only be *false* references to otherobjects. That's why both managers should work independently.

For a heap based language there is no conflict between a finalizer and adestructor. It is perfectly acceptable to call a destructor in thefinalizer, because it doesn't touch the stack at all. Eliminating thestandard " objections" .

Your stack argument is meaningless, for several reasons. Stack objectsnever are recognized and handled by mark-sweep GC, their "destructors"don't deallocate their memory at all. When FreeInstance does nothing inthe Pascal Class model - as it *must* do in a GC environment - there isno more difference between the destruction of stack and heap objects.

Nonetheless I'm still in doubt, whether (typical) destructor code isapplicable as a finalizer, since it may affect other objects by callingtheir destructors. We all know about problems with multiple calls to thedestructor of an object, when the destructor does not finalize (nil) allreferences to the destroyed objects; this is common practice, because adestroyed object goes away immediately after the destructor has beencalled, and some people argue that the use of FreeAndNil in andestructor indicates a bad design! And now you are going to call thedestructor of an object during finalization, without knowing whetherthat destructor already has been called! :-(

The real finalization problems reside elsewhere, see the Boehmdocumentation about the finalization semantics, how to avoid loops etc.- all that is related to heap objects as well.


DoDi

_______________________________________________
fpc-devel maillist  -  fpc-devel@lists.freepascal.org
http://lists.freepascal.org/mailman/listinfo/fpc-devel

Re: [fpc-devel] Boehm garbage collector for freepascal

Reply via email to