Re: Lookup Service Discovery using DNS?

Peter Firmstone Sat, 16 Jan 2010 03:50:33 -0800

Thanks Gregg,

Your spot on.

The memory explosion caused by multiple class loaders, one for eachremote location, is significantly eliminated by your solution ofpreferred local classes. This works since all marshalled instances areunloaded in the one classloader and utilise the same bytecode alsoimproving compatibility, by preventing isolation of compatible objectsby classloader tree Type differences.

I think utilising the OSGi framework combined with Codebase services,eliminating the coupling between codebase URL's and Classloaders, canperform a similar reduction in downloaded code, although not quite assmall, all classes from a package can use the latest compatible bundle,share the same classloader, bytecode and any other packages that aredepended upon, significantly reducing RMIClassloader explosion andduplicate bytecodes, once a compatible bundle has been downloaded, itcan be utilised for all instances of that class, we should prefer latestcompatible bundles I believe.

Package version Metadata (specified in each bundle) can be stored inMarshalledObject instance Metadata, the OSGi versioning scheme,specifies compatibility across bundle upgrades.

In reality, MarshalledObjects are a compromise to reduce downloadingremote code, they duplicate the information held within the binarymarshalled form of an object (such as implemented interfaces.) Keepingduplicated information to a minimum is important, whenever we duplicatedata, we risk duplication errors or increased size. At some point itbecomes more efficient to just unmarshall the object rather thanincrease stored metadata, this is best done at the client, internetservers just wont have the resources to perform queries while thesecurity implications of executing foreign code are significant.

I think the full load issues in Reggie can be fixed, so it doesn't failunder load, levelling out instead. Although I think your right, Reggieisn't suited for the internet. Perhaps some type of Global indexingservice that crawls the list of available services through DNS-SD andstores their marshalled proxy instances to perform the functions thatReggie currently performs. Perhaps an interface inserted into thehierarchy implementing a subset of Reggie's current methods for somecompatibility with Reggie? Another interface might implement othermethods that assist in filtering the results, or returning a bytestream,where proxy's are unmarshalled one at a time at the client andinspected, then dropped until a suitable match is found, the remainingbytestream can be discarded. Garbage collection would clean up unwantedproxy's during bytestream inspection, keeping memory usage to a minimum.

Looking at this service type definition, Daniel is using DNS-SD tolocate a Jini service directly:


Mutiple identical matches are returned with an index integer appended.

DNS SRV (RFC 2782) Service Types athttp://www.dns-sd.org/ServiceTypes.html contains this service entry:


 jini            Jini Service Discovery
                 Daniel Steinberg <daniel at oreilly.com>

Protocol description: Convention giving adeterministic programmatic mappingbetween Jini service interface names and subtypes ofthe DNS-SD servicemeta-type "_jini._tcp". For example, a client wishingto discover objectsthat implement the "com.oreilly.ExampleService"interface would broswse forthe DNS-SD service subtype"ExampleService.oreilly.com._sub._jini._tcp".(Note: Using Apple's Bonjour programming API, servicesubtypeslike this are expressed as a comma-separated listfollowing

                 main type, e.g. "_jini._tcp,ExampleService.oreilly.com".

This allows an object that implements severalinterfaces to specifyall of those interfaces in a list when it registersits service.When browsing for services, at most a single subtypeis allowed.)

                 Defined TXT keys: None

Some observations:

  1. Re-discovering the correct identical service would be almost
     impossible with DNS-SD
  2. Reggie is not designed for a world wide network.
  3. Interfaces utilised by services must not change over time, they
     can be extended when change is required.
  4. Filtering is limited to qualified names.
  5. Additional Types can be registered for one service instance.
  6. We can crawl the DNS-SD list downloading marshalled proxy's
     forming the basis of a new lookup service implementation.

Some thoughts:

  1. You would be aware of which services you would want to make
     global, perhaps a configuration option?
  2. We probably want to restrict Service proxy's to simple reflective
     proxies with Secure Jeri for now, security is much simpler.
  3. Smart proxies, hmm, proxy verification, hmm needs more thought.
  4. Interfaces for services should be in separate bundles from
     implementations, to prevent Interface duplication in local JVM
     ClassLoaders when implementations change in an incompatible manner
     ( versions over time), allowing a service to remain as an
     abstraction.  The OSGi frame work could locally upgrade an
     interface bundle when a new service utilises a new interface or an
     interface extending existing interfaces, allowing old and new
     service implementations to be utilised as the same Type within a
     local JVM.

DNS-SD might be used when we don't care about identity or matchingsemantics.

What about when we cared about identity?Similar ground has been covered before in Project Neuromancer withXuidDirectory, perhaps this could be a crawler instead of utilisingregistration? Don't worry about leasing, just repeat the crawl,discarding the older data on a cyclic basis? Would it be safe tounmarshall downloaded proxy instance to query for Xuid, provided it wassandboxed with no permissions and utilised integrity checking? What doyou think Jim?


Interface UuidDirectory
{

Lease register(Xuid id, Object o, long leaselen) throwsUnknownXuidException, RemoteException;Lease[] register (Xuid[] ids, Object[] o, long[] leaseLens) throwsUnknownXuidException, RemoteException;Object lookup(Xuid id, XuidDirectory[] visited) throwsUnknownXuidException, RemoteException;

Not only would we need worldwide available Codebase services, butcaching Codebase services also.

eg interfaces:

org.apache.river.global.CodebaseService //code I make available which Isign.org.apache.river.global.CachingCodebaseService //code I've dynamicallydownloaded, not signed by me ( extends CodebaseService)

A caching code base service is important to ensure bytecode remainsavailable over time when other service locations go down.


Unsigned jar files would not be allowed in a Codebase Service.

For Objects passed between services where identity is important, localJVM immutability would be desired. The same object is duplicated acrossnodes and doesn't change so we don't need to coordinate transactions etc.

All platform services would rely on local code. Platform code could beupgraded using codebase services on a periodical basis, worldwideutilising the OSGi platform. I wonder how project Jigsaw will pan out,perhaps the JVM will become dynamically upgradeable also?


Your Thoughts?

Peter.

Gregg Wonderly wrote:

One of the primary issues with the current lookup server design andthe ServiceRegistrar interface in particular is the fact that one canonly receive unmarshalled services. My work on providing marshalledresults, visible in the http://reef.dev.java.net project, allows theopportunity to find stuff without getting a JVM memory explosion.However, there is a further issue, and that is in order to "see into"the marshalled object you need to either resolve it or dive into thestream of bytes. My further work on the PreferredClassLoadermechanism for establishing "never preferred" classes helps to make itpossible to do resolution of remote objects using locally definedclass instances, so that you can, for example, look at Entry objects.
Also, in my reef work, I investigated adding the names of all classesthat are visible in the type hierarchy of the objects so that youcould ask "instanceof" kinds of questions without unmarshalling.
There are just all kinds of issues related to this that come intoplay. Performing a Jini lookup, on the internet today, would be likeasking your web browers to open a tab for every page on the net, andthen waiting for that to finish so that you could click through thetabs to find what you are looking for.
Clearly, lookup needs to be a completely different concept to exist ina large world such as is visible "on the internet."
Gregg Wonderly

Peter Firmstone wrote:
Anyone got any opinions about Lookup Service Discovery?
How could lookup service discovery be extended to encompass theinternet? Could we utilise DNS to return locations of Lookup Services?
For world wide lookup services, our current lookup service mightreturn a massive array with too many service matches. Queries presentthe opportunity to reduce the size of returned results, howeversecurity issues from code execution on the lookup service presentproblems.
If we did allow queries on a Lookup Service, could we do so with arestricted set of available Types utilising only trusted signedbytecodes? If bytecode becomes divorced from the origin of aMarshalled Object, and instead obtained from a trusted codebaseservice, then perhaps we could have a system of vetting source codesubmitted for the purpose of becoming trusted authorised querytypes? Any query utilising untrusted bytecode might return anUntrustedByteCodeException?
Perhaps we could make service match results available as abytestream, clients that couldn't handle large amounts of data couldinspect the bytestream, continually discarding what isn't required?
Check out this link on DNS service discovery:

http://files.dns-sd.org/draft-cheshire-dnsext-dns-sd.txt

Cheers,

Peter.

Re: Lookup Service Discovery using DNS?

Reply via email to