Re: [osgi-dev] OSGifying an existing application

Peter Mon, 20 Feb 2017 02:48:05 -0800

Thanks Pete, good to hear from you again, I must admit it's been toolong. We last spoke when I was refactoring a class dependency tool touse ASM instead of the jdk's tools.jar. You once asked, how do you finda dependency for calls to Class.forName?

The reasons you've stated are also why I've chosen to support a verynarrow use case, in which you may have already noted that the serializedconnection is between two identical bundles, in separate jvm's withcompatible package imports.

There's no intent to support transferring any classes outside of theService API and that includes overriding classes. What you describeabout data hiding reminds me of Entry's, which have public fields.

I'll be the first to admit there are significant issues with the designof Java's Serialization's extralinguistic api. Ironically though thewire protocol is reasonably well thought out, with regards to evolution.

As an exercise to fix security issues, I have reimplemented Javaserialization with input validation using a public api, it has backwardcompatible serial form, but only supports a subset of Javaserialization, it doesn't support circular object graphs for example asthis would compromise security. It performs input validation, setsresource limits and expects periodic stream resets to avoid DOS andgadget attacks. The problem is there's a lot of existing software thatutilises java Serialization, that's going to need support for some time.Things like Serialization and Remote method invocation are damaged byattempts to implement too much functionality, when a more rigid subsetwould avoid a number of issues. But I guess no one was thinking ofmodularity and versioning when they created these frameworks either.

James Gosling said something once about why Generics weren't included inJava from the outset, which was because at the time they didn't know howto do it properly, it's better leaving it out until you do.

JBoss has a nice web page with some graphics that illustrate some majorissues with implementation hiding you've mentioned with Serializationand modular frameworks here:https://developer.jboss.org/wiki/ModularSerialization

It's worth noting Service API of the smart proxy bundle doesn't need tobe Serializable, instead it's relegated to a communication means betweentwo identical bundles in different JVM's. It's also important torecognise that it doesn't need to be the communcation mechanism either.These bundles have an identical class namespace, although there maybe variances in package import versions.


Yes we are also looking at moving away from java serialization.

Also over time, because this is a service interface, at some point downthe track, serialization can be replaced, without impacting the publicapi. So yes the underlying protocols can be stripped out to data andmessage passing if that's more satisfactory.

So yes java serialization is an existing part of our application and ithas it's warts.

But we've also had a number of users over the years who have requestedsupport for OSGi.

This is not a greenfields project, I'm hoping that I'm not going to betold that no, the chasm is too wide you can't cross over to OSGi,rewrite or start again, there's just too many LOC.

So you have raised some important questions. Some of our users have hada lot of success with Maven (recognising there a pro's and cons's withmodule based versioning and transitive dependencies), where versioningon a module level allows codebase annotations to be utilised in remoteinvocation, avoiding class visibility issues by mapping moduleClassLoader's directly to a URI based identity. However with OSGithere's a mismatch between different jvm's and how bundles and packagesimports will be resolved will end up being wired, so we can't rely oncodebase annotations for OSGi.

Jvm's using OSGi frameworks are quite likely to have differentdependency graphs (wires) between bundles and their package imports.

While I don't expect to solve the worlds problems or boil the ocean, I'mlooking for the most workable compromise, one that doesn't promise theworld and is easier to explain what users can and can't expect to do.I'm relatively pragmatic. To me it would seem logical that a subsetwhere two identical bundles (that should have resolved similar packageimport versions) should be a good place to start.

Hence my post on this list, as I realise many of you have already spenta lot of time bumping into these issues.


Cheers,

Peter.

On 20/02/2017 6:38 PM, Peter Kriens wrote:

After working in this area for too many years I’ve come to theconclusion that objects cannot be really transferred to other systemsin a reliable way, only self typed data can. JPA, RMI, and many othersystems promise heaven to the programmer that they can use theirobjects local and remote transparently. The consequence of this dreamis a huge amount of complexity that far outweighs any gains inprogrammer friendliness. Few things have caused so much trauma in thesoftware world as ORM. (Persistence is communications to a futureprocess.)
The reason objects are so complex to use in communications is that itis in direct violation of the goal of OO to hide your data. However,once you expose the internal data on the wire you have effectivelymade it public but too many people they can still have the advantagesof abstract data types. OSGi is a bitch in this case because it tellsyou that you’re trying to do something wrong by refusing to cooperate.In this case, it balks at you because you create an invisibledependency between the sender and the receiver. Though this is a goodthing too often the receivers of this message blame the messenger.
You can handle this dependency but you’ll find out is that it is ahugely complex task that introduces a lot of frailty in the overallsystem. Having tried this several times I can assure you that anygains in programmer friendliness are dwarfed by the complexity ofcreating this facade.
The best solution I found is to give up on data hiding. The fact yourobjects is on the wire means that that wire format is public. Itherefore use Data Transfer Objects, in my case objects with publicfields. On both sides I have my own objects to provide behavior tothis data with methods and classes but this data record is at the coreof my code. Since this data is public because it goes over the wire itis better to wrap you code around that ‘standardized public’ objectthan to try you internal object data.
If you look at the OSGi specifications of the past 5 year then youwill notice that all applicable APIs have been designed to be usefulwith Distributed OSGi. Calls do not pass objects but they pass DTOsback and forth. They do not rely that the receiver and sender haveexactly the same type and version. In this model it is easy to replacean endpoint using another language, which is a really good sign.
For Java developers this is often an unpleasant message, and quiteoften OSGi get the blame. However, the fact OSGi gives you theseproblems means that you’re trying to do something that has hiddendependencies.
Distributed computing has 7 well known fallacies[1] but I stronglybelieve that there is an eighth: ’One can communicate objects over anetwork’.
Now your question. Yes, you could run a resolve and load the properbundles but you introduce a huge amount of error cases and a largeamount of complexity and you won’t solve the fundamental problem.
Kind regards,

Peter Kriens

[1]: https://en.wikipedia.org/wiki/Fallacies_of_distributed_computing
On 20 Feb 2017, at 05:13, Peter <j...@zeus.net.au<mailto:j...@zeus.net.au>> wrote:
Hello,

I'm currently working on converting an existing application to OSGi.
This application has a network service architecture based on javainterfaces. I've broken the application into modules, using a Mavenbuild, which uses bnd and bndtools to create bundle manifests. Someof these modules are ServiceLoader provider's, so I've usedannotations to ensure these are loaded into the OSGi service registryusing the Service Loader Mediator.
The main issue that I face is this application is a networkedapplication and has it's own Remote Invocation protocols (whichcurrently utilise Java Serialization, but not Java RMI). As you'llappreciate, class visiblity is a little different in OSGi. :)
The services mentioned above are remote services, these remoteservices have a proxy which implements the service interface, theseservices are discovered and installed at the client. There are twotypes of proxy's, one, called a smart proxy, requires a codebase fromwhich to retrieve a jar or jar files that are downloaded andinstalled at the cleint (traditionally during deserialization), theother type of proxy is called a dynamic proxy (it's basically just aninstance of java.lang.reflect.Proxy), which is dynamically generatedat the client.
The Service implementation is broken up into three components:

 1. The service api
 2. The smart proxy (resolved and provisioned into in client jvm).
 3. The server
The server bundle imports packages from the smart proxy bundle, whilethe smart proxy imports packages from the service api as well asexporting it's own packages, as required by the server bundle.
The server that provides the remote service has three bundles loaded;server-impl, smart-proxy & service-api.
The client only has the service api bundle installed at deploymentand the smart proxy is resolved and provisioned before the service ismade available via the local OSGi service registry, where the clientwill learn of it's existence using ServiceTracker.
At first glance only the smart proxy bundle needs to be provisionedat the client, however for cases where a dynamic proxy is required toimplement interfaces from different packages, where class visibilityissues may exist, it may be beneficial in these cases to utilise andprovision a proxy bundle that imports all these interfaces, one mightdo that by taking advantage of java's interface multiple inheritance;create a bundle that contains one interface (annotated with@ProviderType) which extends all interfaces, which the bundle doesn'texport, so we ensure that the dynamic proxy has a proper bundlemanifest with all package imports and version ranges correctly defined.
The inbuilt remote invocation protocol has server and clientendpoints, the protocol is extensible and has a number ofimplementations (for example https, http, tls, kerberos, tcp). Eachendpoint is assigned a ClassLoader when it's created.
For classes installed at the client, these are typically installed ina URLClassLoader, typically with the Application loader as parentloader. In an OSGi environment however, the smart proxy bundle willbe installed at the client, it's ClassLoader utilised by the clientendpoint, the smart proxy bundle will also be installed at the serverand it's ClassLoader utilised by the server endpoint. In this casethe visibility of the bundles at each endpoint will be utilised toresolve serializable classes. Private smart proxy serializableclasses will be resolvable at each end, but only public classes fromimported packages will be deserializable, since the client interactsusing the Service API, all serializable classes in the Service APIpackages will need to be exported and public and imported by theclient and smart proxy.
Once a bundle has been provisioned its ClassLoader will be given tothe client endpoint and the marshalled state of the proxyunmarshalled into it. At this point the service that the proxyprovides would be registered with the OSGi service registry for theclient to discover and consume. The smart proxy communicates withit's server via an internal dynamic proxy (java.lang.reflect.Proxy),it's used to invoke methods on the server.
While the existing protocol uses Java serialization, it doesn't useJava serialization's method of resolving classes. Java Serializationwalks the stack and finds the first non system classloader (lookingfor the application ClassLoader). The existing class resolutionmethod isn't suitable for OSGi, however the mechanism is extensible,so can be replaced with something suitable.
Does anyone have any advise or experience utilising the OSGiEnterprise Resolver Service Specification (chapter 136) and the OSGiEnterprise Repository Service Specification (chapter 132) to resolveand provision a bundle for the smart proxy at the client?
The intent here is the bundle manifests at each endpoint will be usedto determine class visiblity, so the resolution and provisioningprocess will be of critical importance.
For anyone curios, the application is a fork of Apache River / Jiniand I'm experimenting with support for OSGi. I'm also a committerand PMC member of Apache River. This isn't the old Jini we all knowand love however, there are some additional features that allowprovisioning to occur using a feature called delayed unmarshalling,so we can avoid the need for codebase annotations and URLClassLoaders.
The work in progress can be found here, for anyone who's curious:

https://github.com/pfirmstone/JGDMS/tree/Maven_build/modularize/JGDMS

Regards,

Peter.

_______________________________________________
OSGi Developer Mail List
osgi-dev@mail.osgi.org
https://mail.osgi.org/mailman/listinfo/osgi-dev
_______________________________________________
OSGi Developer Mail List
osgi-dev@mail.osgi.org
https://mail.osgi.org/mailman/listinfo/osgi-dev


_______________________________________________
OSGi Developer Mail List
osgi-dev@mail.osgi.org
https://mail.osgi.org/mailman/listinfo/osgi-dev

Re: [osgi-dev] OSGifying an existing application

Reply via email to