Distributed ExecutorService

Peter Firmstone Sun, 06 Dec 2009 16:34:04 -0800

I've had a few thoughts about the whole "move the code to the data"concept (or "Move the code to the Service node") for some time,considering it a low priority, I have kept quiet about it, untilrecently when the topic came up during a recent email discussion.

Current Practise for River applications is to move code and data aroundtogether in the form of marshalled objects. Two particular groups ofObjects are of interest, those that are process or code intensive wheremethods process and create returned results and data intensive objectswhere there is little to be done in the way of processing, where minorcopy / transformations are performed on existing state.

I think that the River platform addresses these Object groups quiteeffectively when the processing is known at compile time or when theservice requirements are clear. However there are Occasions when itwould be less network intensive or simpler to submit the distributedequivalent of a ScheduledTask or Runnable to consume an existing dataintensive service at the origin of that service and make the desiredresult available via a temporary service or some other mechanism orprotocol. In cases where particular class files and libraries requiredto perform processing are available at the service node, but unavailableat the client due to a legacy java environment, no ability to loadremote class files, or a constrained memory environment that cannotprovide enough memory space for the processing required. The result ofthe uploaded runnable class file can be transformed into a locallyavailable or compatible class file.

The Runnable uploaded code might be uploaded to the service node, by theclient or a third party mediator. Any suggestions for what themechanism should be would also be useful. I'm thinking that a signedOSGi bundle containing a set of permissions would be a good model tostart from, considering that OSGi already has many of the Securitymechanisms that would make such a thing possible.

In essence the DistributedScheduledTask is a remote piece of client codethat is executed in the service node. I'm wondering just what should aDistributedExecutorService provide, if anyone else has had thoughtssimilar to mine.

For instance, a Reporting Node in a cluster might send out the sameDistributedScheduledTask to all available services of a particular typeto perform some intensive data processing or filtering remotely at eachnode and retrieve the results from each after processing. The ReportingNode might have changing reporting requirements similar to performingqueries for instance.


Cheers,

Peter.

Distributed ExecutorService

Reply via email to