Re: [jruby-dev] Serialization and Persistence

Charles Oliver Nutter Tue, 03 Jul 2007 00:33:35 -0700

Alan McKean wrote:

Since the lack of Java serialization of JRuby objects stops us dead inour tracks when trying to hook up our persistence engine, I aminterested in either getting someone on this end to work on it orjumping in myself. In either case, I need some background on the JRubyruntime architecture and some guidance on particular issues. The issuesare about how to detach an object from its runtime elements and how torestore them when the object gets reloaded into memory:
1) When we first tried saving a JRuby object to our database, we saw itdrag along a gaggle of runtime objects. Given that it might be loadedinto a different VM when it is brought in from the database,is reconnecting the object to a particular runtime important? If so, isthere a way of determining which of the available runtimes would be bestto connect it to?
2) Detaching an object from its 'runtime' variable and making the'metaclass' variable transient lets us store the object in our databasewithout dragging much else along. But we need to reconnect things whenthe object is reloaded into memory. Is there a canonical name for themetaclass that we could store in the database along with the instance?If not, what information is available for reconnecting. iWe persist typeinformation in our Java product by storing the fully-qualified name ofthe class with the object, then lazily loading and initializing theconnection (using the name) when we reload the object to memory. Willthis work in JRuby?
If someone has thought through a strategy for deserializing a JRubyobject and restoring its connections to its runtime, I would love tohear about it.

I've been looking into this a bit tonight. This email represents merambling.

Marking metaclass and finalizer as transient are no-brainers. I'm goingto go ahead and commit that.

I'm going to see what would be needed to remove getRuntime everywhereit's needed. It would be a big job...be back in a few minutes...


...ok I'm back. I think it's doable. Here's more rambling thoughts.

The runtime connection is used for a few things:

1. to construct other objects

This is mostly a self-fulfilling prophecy. Objects require a runtimewhen they're created, so all objects need runtime available to createobjects. If we break that chain, a number of places that depend onruntime disappear.


2. to locate classes in order to construct objects

This is a little harder to eliminate. In order to construct a Ruby"String" object, you need to have access to the "String" metaclass. Thatmeans having access to the place where the "String" metaclass is stored,currently in the runtime. Again, this is largely self-fulfilling; youneed access to a metaclass to construct an object, so you need to locatethe metaclass, and since the metaclasses are currently rooted in theruntime, you need the runtime. But the runtime dependency is largelyperipheral to the use case.


3. to access runtime-global and thread-local data at execution time

This is probably the hardest to eliminate. Every thread Ruby codecreates or encounters is associated with a ThreadContext, which containsextra thread-local state needed for executing Ruby code. Every externalthread that touches a given runtime is "adopted" and given aThreadContext and a Ruby "Thread" avatar to represent it. So a givenJava thread may have many ruby "Thread" and "ThreadContext" associatedwith it, one per runtime it has touched. This allows us to share threadsacross runtimes, rather than having a given thread bound to a givenruntime execlusively, as in many other JVM languages. But it alsorequires that we locate the runtime, and therefore the ThreadContext, ina different way. Therefore, we have the runtime dependency.

This essentially sums up all the major reasons why we have so manydependencies in code on access to a runtime object. And ultimately,requiring access to a runtime object obliterates the possibility ofthird-party manipulation and transport of Ruby objects.

So to summarize, the three actual reasons we depend on runtime beingpresent are as follows:


1. to access and maintain types associated with a specific ruby worldspace
2. to access and maintain state associated with a specific ruby worldspace

3. to provide execution state and primitives for code running in aspecific ruby worldspace

Now let's rewrite the list by substituting in a different concept forour top-level ruby worldspace:


1. to access and maintain types associated with a specific ClassLoader
2. to access and maintain static associated with a specific ClassLoader

3. to provide execution state and primitives for code running in aspecific ClassLoader


So let's examine how we'd solve these issues.

First off, IRubyObject.getRuntime(). Let's assume that the classloaderthat loads the Ruby class is our chosen, ultimate worldspace:


public Ruby getRuntime() {
  JRubyClassLoader cl = (JRubyClassLoader)Ruby.class.getClassLoader();
  cl.getRuntime();
}

Everything else largely falls out of this. Starting up a new instance ofJRuby largely becomes the act of constructing the top-level classloaderin which it will live and telling it to "go".


JRubyClassLoader cl = new JRubyClassLoader(..., properties);
cl.evalScript("puts 'hello'", "(eval)");

Everything lives underneath the classloader, and since all classes haveaccess to that classloader, all code can retrieve the runtime associatedwith it.

Would something like this work? The view from inside the classloaderseems pretty reasonable...we already have this root context andpartitioning as part of Java's classloader support, and it seems fairlynatural to use it. But I'm not well-enough versed in Java serializationto know if this will solve our deserialization issues. It may requireyou to have more control over the object stream...but of course if youhave control over the object stream, you could also just have it ask aspecific runtime to unmarshal objects, avoiding the issue completely.


Thoughts? More ideas?

- Charlie


---------------------------------------------------------------------
To unsubscribe from this list please visit:

   http://xircles.codehaus.org/manage_email

Re: [jruby-dev] Serialization and Persistence

Reply via email to