Re: tutorial on using Cleaner-based finalization

Peter Levart Sun, 07 May 2017 12:42:51 -0700

Hi Rick,

On 05/07/2017 08:59 PM, Rick Hillegas wrote:

Thanks for investigating that sample case, Peter, and for providing acandidate solution. It is much appreciated. Your expert experienceconfirms my amateur hunch that this migration is a non-trivialre-factorization which merits its own mini-project. I have logged anew technical debt issue to track that effort:https://issues.apache.org/jira/browse/DERBY-6932
If I correctly understand what has been said so far, then I think thatwe should be able to get away with a single Cleaner instance for eachcomponent, essentially, one for each jar file which we build today.After we convert Derby into jigsaw modules, that might translate intoone Cleaner instance for each module. I don't see any advantage to theextra complexity of a separate Cleaner instance for each class whichcurrently implements finalize().
Does that sound reasonable to you?

Each Cleaner instance means a separate thread which processes cleanupactions. If you create an instance in some common module and expose itto other modules, the same instance can be used in other modules thatdepend on it.


Regards, Peter

Thanks,
-Rick


On 5/6/17 2:19 PM, Peter Levart wrote:
Thinking of this for some more time...
Although this is a nice exercise in converting finalize() to CleanerAPI, I strongly suspect that ClientConnection.finalize() is doingunnecessary things. What it does is it:
- prints some trace message to logWriter
- closes logWriter (whatever it is)
- closes raw socket input and output streams
- closes the socket
- notifies listeners
I believe all this is unnecessary (apart from notifying listenersperhaps?) as those objects already have their own cleanup mechanismwhen they are left behind. I believe there would be no resource leaksif ClientSocket.finalize() was simply removed. Before doing theconversion from finalize() to Cleaner API one should always askhimself whether finalize() method is actually needed. All 3rd partycode (with notable exceptions which use JNI) are based on Java SEAPIs and these APIs already take care of resources held by objectsthat are left behind. There's usually no need to do the same in thehigher layers of 3rd party code.
What do you think?

Regards, Peter


On 05/06/2017 10:01 PM, Peter Levart wrote:
Hi Rick and others,

On 05/04/2017 06:48 PM, Lance Andersen wrote:
Here are a few examples I believe:
http://svn.apache.org/repos/asf/db/derby/code/trunk/java/client/org/apache/derby/client/am/ClientConnection.java
http://svn.apache.org/repos/asf/db/derby/code/trunk/java/client/org/apache/derby/client/ClientPooledConnection.java
http://svn.apache.org/repos/asf/db/derby/code/trunk/java/engine/org/apache/derby/impl/jdbc/EmbedPreparedStatement.java
I took a bite at the 1st one (ClientConnection). Here's the result:

http://cr.openjdk.java.net/~plevart/misc/Cleaner/derby/ClientConnection_finalize2cleaner.patch
I haven't tested this, but I believe it should work. It was quite achallenge, because of the way current ClientConnection code isstructured. I tried to make the patch not incompatibly change publicAPI of ClientConnection and related classes and I almost succeeded.The problematic part was the protected booleanClientConnection.closed_ flag. If there is any sublclass ofClientConnection (apart from NetConnection which is derby code) thatmodifies this field, you are out of luck as changing (not onlyreading) this field directly my have an undesirable consequence (orit may not, since the only thing that changing these field to falsewould do is it would redundantly force performing the cleanupaction. If the cleanup is idempotent, then all is OK).
Further complication with ClientConnection is that it maintains asplit state - some of it resides in ClientConnection and subclasses(such as NetConnection) and some of it in embedded object of Agentclass and subclasses (such as NetAgent). Both - some of this statefrom connection object and some from the agent state are needed toperform the cleanup that is currently executed from the connectionfinalize() method. When using Cleaner API, we have to capture thisstate from both places (or more since each class has a hierarchy)and then arrange for cleanup action to use this state. Capturedstate can not reference the tracked object (ClientConnection in thiscase) either directly or indirectly since then it will never beGCed. When cleanup action is run, the tracked object is alreadyunreachable - this is the main difference from finalize() wherethere is a phase in object's life-cycle where it is still reachable,albeit guaranteed only from the thread executing finalize() method.We can not capture the Agent object either, since it maintains areference back to the ClientConnection object. All this is furthercomplicated by the fact that captured state is mutable and we haveto arrange for it to be mutated in both places. If the mutable stateis captured by reference and the instance containing it neverchanges during the lifetime of the tracked container object, then itis easy - we just capture the object after the tracked containerinstance is constructed. If the captured state includes mutablefields directly in the tracked container object, then we mustarrange for them to be synchronously mutated in two places. Suchfields are:
- ClientConnection.open_ (replicated inClientConnection.CleanupAction.open)
- Agent.logWriter_ (replicated in Agent.CleanupAction.logWriter)
- NetAgent.rawSocketInputStream_ (replicated inNetAgent.NetCleanupAction.rawSocketInputStream)- NetAgent.rawSocketOutputStream_ (replicated inNetAgent.NetCleanupAction.rawSocketOutputStream)
Fortunately all of this state is encapsulated with protected fieldClientConnection.open_ being an exception.
Note that Cleaner API also allows for cleanup action to be triggeredexplicitly, which then de-registers it. This is one of itsadvantages over finalize() where you can not deregister an objectwhen it is already explicitly closed for example. finalize() willalways be called even if closed explicitly. If you create lots offinalizable objects (such as connections, statements, etc...) andpromptly close() them, they still wait for finalization and useresources (heap, CPU when GC searches for them, ReferenceHandlerenqueues them, and finally finalize() method which is executed afterthe fact). Explicit triggering and de-registration of the cleanupaction is performed in the ClientConnection.closeResourcesX()(called from public close() and closeResources()) after theconnection has already being marked as closed. Cleanup action willbe a no-op at this point, but it will also be de-registered. This isimportant to not bother GC with reference processing when it is notneeded any more. In situations whre cleanup action logic is the sameas explicit closing logic (in the case of ClientConnection it isnot), close() method could just invoke cleanable.clean() anddelegate the meat of processing to the cleanup action.
Hope this non trivial example helps illustrate what is needed whenconverting finalize() to Cleanup API.
Regards, Peter

Re: tutorial on using Cleaner-based finalization

Reply via email to