[jruby-dev] A choice of two serialization/persistence strategies

Alan McKean Sun, 15 Jul 2007 17:15:21 -0700

I currently have two strategies implemented for serialization andpersistence. Both require storing the runtime in a thread localvariable 'currentRuntime'. They fetch the runtime from there whenthey need it via a Ruby.getCurrentRuntime() call. I would like tohear your critiques and comments.


First (least intrusive) strategy:

Decouple an instance from the runtime by making RubyObject.metaClasstransient, and

Add a RubyObject.metaClassName field

When an object is serialized/persisted, it goes alone. When it isdeserialized, it uses the metaClassName to fetch its metaclass fromthe runtime. Voila! It's ready to go


Second, more intrusive strategy:

Leave the object coupled to its metaClass by leaving the metaClassvariable non-transient.Decouple the metaclass from the runtime by making the RubyClass/RubyModule marshal, allocator, methods, superclass, and parent fieldstransient.Add a new transient field in RubyModule called 'loadedByRuntime' witha default value of true.

Leave only the RubyModule.classId field serializable.
Serialize/persist the object and its metaclass.

When the object is deserialized/reloaded, it brings along aduplicate, decoupled instance of the metaclass with it. On the firstmethod invocation, RubyObject.callMethod checks the 'loadedByRuntime'boolean and swaps the object's reference to its metaclass with areference to the original. The deserialized metaclass is then garbagecollected and the object is hooked up to its original metaclass andis ready to go again.


Pros and Cons

The first strategy is simple but requires and additional instancefield 'metaClassName' in RubyObject. Code that is only touched in acouple of places:1) the constructor, when it sets the metaClass variable, also setsthe metaClassName.2) the getMetaClass() accessor in RubyObject. It lazily initializeswhen the metaClass is null and the metaClassName isn't.

It trades space in every object for the simplicity of the solution

The second strategy only requires an additional field in RubyModule:the transient 'loadedByRuntime' flag. However, to get rid of theduplicate metaclass requires switching the reference at runtime andthe check for that is done in callMethod() on every methodinvocation. This happens on every object, not just the persistentones. Code is touched in several places, but is mostly in RubyModuleand RubyClass.1) The accessors in RubyModule cause a swap of all of the referencesto the allocator, marshal, methods, etc. on the first invocation of amethod in the deserialized object. The persisted metaclass is nowready to go.2) RubyObject.callMethod calls a 'MetaClassSwapper' with tasks (kindalike Runnables) that are configured at launch. One task switches themetaclasses and would be needed by serialzation to get rid of theduplicate metaclass3) The second task is one that we would post to the swapper. Itswitches what we call the POM (persistent object memory) Ids. Oncethe POM Ids are switched, the persisted metaclass will not bereloaded along with the persistent object.

I prefer the first solution. It's simpler. Whaddya think? I will behappy to post a diff showing what I have done.

[jruby-dev] A choice of two serialization/persistence strategies

Reply via email to