Re: Proposed approach for M2

Jim Marino Sat, 08 Jul 2006 02:21:26 -0700

I have a lot of comments inline but want to overall summarize bysaying I think we should address all of your concerns byincrementally improving core2. As you said below, you are not arguingfor a rewrite and I think that would be the best way to accommodatethe wide variety of things the community is interested in working on(not everyone wants to work on the "baby steps", as valuable as theyare).

Having worked with you for over a year, I'm absolutely sure you canmake significant contributions to improving what we have in core2.How about it?


Jim


On Jul 7, 2006, at 10:17 AM, Jean-Sebastien Delfino wrote:

More comments inline.

Jim Marino wrote:
Comments inline
On Jul 6, 2006, at 6:17 PM, Jean-Sebastien Delfino wrote:
Jeremy,
I won't comment on your attacks at the bottom of this email. Iwas hoping for a more constructive technical discussion. I addedmy answers and comments on the specific technical issues inline.
Jeremy Boynes wrote:
On Jul 5, 2006, at 12:43 PM, Jean-Sebastien Delfino wrote:
My proposal is not to merge M1 and the core2 sandbox. I amproposing to start a new fresh code stream and build theruntime through baby steps. We may be able to reuse some piecesof existing code, but more important is to engage our communityin this exercise and integrate the new ideas that will emergefrom this.
I don't believe the two issues are necessarily coupled. Quite afew members of the community are engaged on the sandbox codealready and we could work with you to improve that rather havingto throw everything out and start over with all new ideas.
Here's an example where I'm struggling with both M1 and thecore2 sandbox and thinking that we can do better if we startwith a new fresh stream: our (recursive) assembly metadata model.
- M1 does not implement the recursive composition model andwould require significant changes to support it. Core2 is anattempt to implement it but I'm not sure it's quite right, andalso think that it can be simplified.
It would really help if you could come up with concrete areaswhere it is not right or where it could be simplified - forexample, end user scenarios that are not supported.
- M1 used Lists to represent relationships, Core2 uses Maps, Ithink M1 was better since it allowed to keep the order in therelationships.
There's nothing I remember in the assembly spec where ordermatters. On the other hand there are many areas where things arekeyed by a name which has to be unique. This seems like anatural mapping (sorry) to a Map. In M1 I started to move towardsimple map structures but you replaced it with what seemed likea fairly complicated specialized List implementation that sentnotifications that updated a Map anyway. Given the desire forsimplification, are there any end-user scenarios that requireordering to be preserved and that can't be supported with abasic HashMap or LinkedHashMap?
As an administrator I'll want my administration tool to displaycomponents displayed in the order I declared them in SCDL.
SCDL isn't the only form assembly can be serialized to/from. Also,if I were an admin, I'd probably want to sort the componentsaccording to some useful criteria, not how they are listed in aSCDL as most admins will never look at XML. One could always useLikedHashMap though.
Maybe SCDL isn't the only form but this is not relevant, we need tosupport SCDL don't we? As soon as you put assembly elements in adocument that a user/developer can edit the order is relevant.
I disagree with your statement about administrators. They oftenlook at and work with XML configuration files. If you want tosupport other sort criteria in addition that's fine, but admin,config and editing tools need to at least support the order fromthe XML document.

Actually that is generally not the case regarding XML in adatacenter. In my experience, most admins in datacenters changethings through admin consoles or scripts so that there is an audithistory (in U.S. financial services institutions it is obligatory).

Of course admins in less bureaucratic environments may tweak XMLconfiguration files (I should have worded my previous responsebetter), but my point was very few admins crack deployment archivesand mess with application artifacts. That is extremely bad practiceand something we should never encourage. This was a fundamentaldesign principal we had in the spec group (i.e. deployment unitsshould not be cracked) and hence we included an override mechanism inthe Assembly Specification. For dynamic wiring, assemblers (adifferent role than admins) would use some kind of tool - having themedit an XML file would not be the best approach to this problem.

So, I just don't see the value in this use-case, although it would betrivial for us to implement even if it promotes an anti-pattern andunnecessarily complicates what we currently have.

I'll also want a configuration or admin tool loading/savingmodified SCDL to write things in the order that they wereinitially, not in a random order. As an application developer I'dlike to have an SCA debugging tool showing me my components in alist in the right order as well. Also if I want to implement themodel defined by the XML schemas in the spec using any of theDataBinding technologies out there, I'll end up with Lists, notMaps.
We have been using StAX just fine for this and it accommodates anumber of databinding solutions for extensions. Are you proposingwe revisit this decision made back before the M1 release to gowith STaX loading? If so, for what reasons? BTW, not alldatabinding solutions will have problems - XStream will work justfine with what we have. Also, are you sure about XMLBeans and JAXBor are you just speaking about a current implementation of SDO?
Not quite correct, the decision we made back before M1 was to gowith StAX loading, write the loaders by hand for now, and see howthe SDO team could generate this code after M1. Independent ofthat, I don't want to tie us to any specific data binding, so webetter pick representations for model relationships that arecommonly used by most databindings to represent XSD <element...maxOccurs="unbounded"/>, i.e. Lists, not Maps.

JAXB, Castor, XStream, and JiBX (through an shipped extension)support Maps, and probably a number of other databinding solutions doas well. But let's set that aside since there is a more importantissue...

The decision as I recall was twofold. First, go with a runtime modelthat was natural for Java developers and supported Java idioms, notthe constraints of a particular databinding solution. In my book, ifa databinding solution cannot accommodate the requirements of theruntime, it is not the right tool for the job, in this case, loadingof core configuration data (not extension configuration). Jeremychose to implement this requirement with StAX. It worked, was simple,and provided the ability to have extension developers use theirdatabinding framework of choice to load required configurationinformation (which may involve evaluating artifacts other than XML).This last feature was important, as the runtime must be able tohandle multiple databinding technologies simultaneously.

The second part of the decision was to decouple the runtime from SDO,not because people don't like SDO, but because it promotes choice,modularity, and simplicity. This is entirely consistent with SCA,which does not mandate SDO and which I imagine will be used with avariety of databinding technologies (e.g. JAXB). This also promotesmodularity and simplicity as it allows people to come and work on (orextend) the runtime without having to learn a SDO (or anotherparticular databinding technology).

Also, just to beat a dead horse further (how long have we been havingthis debate ;-) ), to me, and probably a lot of other Javadevelopers, StAX is a pervasive and simple way of dealing with XML -its in the javax namespace, pervasive in open source, and will be inthe JDK. Given that we can use SDO, JAXB, etc. to handle extensions,what's the problem with using what we have? What benefit do we gainby constraining the runtime model's use of very common (and in myopinion effective) Java idioms?

Finally even if we decided to use Maps in some cases to providekeyed access to some elements of the model, we'd have to do itdifferently. For example a single Map containing all components,references and services in a composite (according to the specthey cannot have the same names) instead of three Maps like youhave in Core2.
And this is why LinkedHashMap will not help you here.

Again, this is trivial to implement and LinkedHashMap will do justfine with many of the databinding solutions available today.

- Core2 only defines implementation classes for the model, Ithink we should have interfaces + default implementationclasses instead, like we had in M1, to allow for alternateimplementations of the model.
One of the most complex things with the M1 model was all theinterfaces involved, the need to pass factory implementationsaround, the number of different factories involved (one perextension implementation) and the potential issues with codeassuming its implementation of the factory was the one used.
The core2 model uses concrete classes which are really just dataholders - there's no behaviour in them to be abstracted throughthe interface. This gives a much simpler programming model forextensions using the model.
Do you have any scenarios that would require differentimplementations of the model? Are they so different that theymight as well just use different classes?
I don't think that having just implementation classes is muchsimpler. If you interact with the model SPI, reading interfacesis simpler IMO and more suitable for inclusion in a specificationdocument... allowing multiple implementations of theseinterfaces. Also we have to support the whole lifecycle of an SCAapplication (development, deploy/install, runtime, admin etc.)and I'd like to allow some flexibility for different tools,running at different times to use different implementations ofthe assembly model interfaces.
Oisin from the STP project said the POJO based approach would suitthem just fine. I don't see the complexity. On the contrary, allof the AssemblyFactories we had in M1 lead IMO to a massiveantipattern where they were passed throughout the core. I'm happyto walk through the relevant code if people are interested. Allthe factories did was new up a POJO. Not worth the complexity inmy opinion but I'm happy to compare the work in the sandbox withyour proposal if you'd like to walk us through it.
When the runtime depends on too many factories, this is themanifestation of bigger coupling problems. The factories for allthe extensions should not be visible at all from the core runtime,and if we externalize the WSDL and Java interface support and theJava implementation support our of core like I'm proposing, you'renot dealing with many factories.

But we will be. Factories will be proliferated through the entireloader infrastructure unless the loaders new the factories up andthat seems a bit superfluous given the factories themselves just newup POJOs. Just new the POJOs up and things are simple. Also,factories will be proliferated across or testcases which was also acause of needless complexity in M1.

I am sure that tooling projects will need to add much to thismodel, support for events, change tracking, tracking between XMLelements and model objects to provider proper feedback toapplication developers, integration with modeling technologies usedin the tooling world, support for cloning maybe... tons of things.I spent several years developing tools so I think I know what I'mtalking about here. The first thing I'll ask as a tooling developeris: please give me interfaces so I can hook what I need in theimplementations.

And the tooling people should go do all of that (if they want to) butkeep it out of the runtime (and vice versa) ;-) ! We are writing codefor a runtime, not a tooling environment. Use change notification,interfaces, round-tripping support, cloning, whatever when writing atool. In other words, tooling people should build the technology thatis right for them and the runtime people likewise. Sharing a discrete(and relatively small) number of classes really doesn't buy that muchgiven the divergence in use cases between tooling and runtime. If wecan do it, great, but we should not compromise the runtime or toolingto do so.

Also, I'm not sure your requirement for interfaces is shared byeveryone on the tooling side. Oisin (Eclipse STP) indicated he wouldbe fine with the POJO approach.

I'm happy to walk people through the interfaces or answer anyquestions on the list,

Great, how about doing a diff between core2 and your proposedapproach and how core2 could be improved to accomodate your issues?

- Over usage of Java Generics breaks flexibility in some cases,for example Component<I extends Implementation> will force youto recreate an instance of Component to swap its implementationwith an implementation of a different type (and lose all thewires going in/out of the component).
There may be cases where generics may be overkill but I don'tthink that really requires us to throw out the model. There areother cases where the use of wildcards would be appropriate; forexample, in the scenario you give here you could just create aComponent<Implementation> to allow different types ofimplementation to be used.
Then instead of
Component<Implementation> {
  Implementation getImplementation();
}
I think we can just do
Component {
  Implementation getImplementation();
}
What we have now in core2 is overkill IMO.
then do we need to cast to the right impl type?
The core runtime should not have to cast, simply because it shouldnot depend on any component implementation type (not even the Javaor System implementation types).

A loader will need to cast the above.

- Core2 defines ReferenceDefinitions (without bindings) andBoundReferenceDefinitions (with bindings). IMO there areReference types and Reference instances and both can havebindings.
or Reference.
I'm with you here - we need to refactor the way bindings arehandled for both Service and Reference. One thing the sandboxmodel is missing is the ability to associate multiple bindingswith a single Service/Reference.
My main point is not about supporting multiple bindings on aService or Reference. I think this is secondary and theinterfaces I put in my sandbox to support a design discussiondon't even have that either. My point is that Services,References, and their instantiation by Components are at thefoundation of the SCA assembly model... and therefore need to bemodeled correctly. I'm proposing a different design, illustratedby the interfaces I checked in.
Could you elaborate?
I think it should be very simple:
- Component types have service and reference types
- Components are instances of component types and have services andreferences, which are instances of the service and reference types
- Service and reference types can have bindings
- Bindings can be overriden in service and reference instances
This is clear when you look at a Composite. A composite is aComponent Type, has service and reference types (aka compositeservices and references) which can have bindings.A component can be implemented by a Composite, has services andreferences, which can use the (default) bindings from theirrespective service and reference types, or specify (override)bindings.

I would find it extremely useful if you could perhaps compare thiswith what we have in core2 and point to how core2 could be improvedto accommodate some of your concerns in this area.

- I think that Remotable should be on Interface and not Service.
I agree Service is wrong and that it should be onServiceContract. Thanks for catching it.
- Scope should be defined in the Java component implementation,separate from the core model.
Scope is not a Java specific concept.
Interaction scope (stateless vs. stateful) can apply to anyServiceContract.Container scope is the contract between an implementation and aScopeContainer and applies to any implementation type that cansupport stateful interactions. This would include JavaScript,Groovy, C++, ... I think that means that support for statemanagement (which is what scope is configuring) belongs in thecore with the configuration metadata supplied by theimplementation type.
I don't think it's quite right. First interaction scopes aredefined on interfaces and not service contracts. Also theycontrol whether an interface is conversational or not,independent from any state management.Anyway I was talking about a different scope, the implementationscope defined in the Java C&I spec, which governs the lifecycleof Java component implementation instances. I think thedefinition and implementation of lifecycle management will varygreatly depending on the component implementation type, forexample Java component implementations and BPEL componentimplementations typically deal with this in a very different way.
Well, I don't think that's the case at all and actually there is aconcept of implementation scope in assembly - it just varies byimplementation type, which is entirely consistent with our design.BPEL is the odd case, and this came up as we wrote the scopechanges into the spec (Ken did a lot of the work here). Acrossmany implementation types, e.g. Groovy, JavaScript, Ruby, Jython,etc. (maybe even C++) I see use for the same scopes as in Java. Doyou disagree?
Also, I'm curious why you think the scope containers complicatethe core and need to be moved out? Or are you saying this based onyour reading of the spec? They seem quite simple to me.
I'm saying that scope management is specific to the implementationtype and therefore needs to be made pluggable, i.e. moved out ofcore. The Java scope management is just one example of scopemanagement.

We have it partly pluggable. There are just a few more things to do.Would you care to help out on this?

I think it makes sense to keep commonly used scopes in core and haveimplementation specific ones as plugins, not necessarily tied to animplementation (outside of BPEL, most scopes are probably applicableto a wide variety of types).

Therefore, in my view state/lifecycle management should be leftto the component implementation contributions and not belong tocore.
I think this would lead to over-complication, particularly for theextension developer. Right now, scope containers can be reused. Inparticular, how would conversational services be handled? If Iwant to use module or session scope containers for my Groovyscript, then I'd have to write those rather then just reuse whatthe core gives me? Also, be reusing, we also allow an additionalextension type in terms of scope. For example, someone could add adistributed cache scope and have that shared by Groovy, Java,whatever.
I'll also note two things. Getting scope containers to workproperly with instance tracking is not trivial. I'd hate to pushthat on extension developers. Second, this basic design has beenthere since before M1. Why wasn't this brought up before since itis such a significant issue?
- Java and WSDL interfaces should be defined separate from thecore model, we need to support multiple interface definitionlanguages supported by plugins, not in the core.
The model supports generic IDL through the use ofServiceContract. Java and WSDL are two forms of IDL that aremandated by the specification. This is really just a question ofwhere those implementations are packaged and again I don't thinkthis warrants a rewrite.
Packaging issues are important and often hide bigger dependency/coupling problems. I think we should package the support for Javaand WSDL interfaces separate from the core to prevent anycoupling between the two, and also give people who will have tosupport new interface definition languages a better template tofollow.
Individual issues do not warrant a rewrite. What about the sum ofmany issues?
None of the issues warrant a rewrite, not even the sum. Most ofyour criticisms seem centered around the model which is fairlydecoupled from the bulk of the core2 runtime. Even if we adoptedyour changes wholesale, I'd doubt they would change the core2runtime significantly. Even the scope containers could be movedout without breaking anything and very little code changes,although that would be a mistake IMO. I'm sorry but I fail to seethe need for a rewrite.
I am not proposing a rewrite of the whole runtime, see my originalemail, it's not a whole rewrite. I'm proposing a staged / baby stepapproach integrating the good work from M1 and the sandbox and newdiscussions where I think what we have is not right or where newideas from the group come up.

Why not just do this starting from core2? If it's not a rewrite, thenit sounds like incremental improvements. We could start fromscenarios and go through doing this. I think this approach wouldaccommodate those that wish to focus on end-user scenarios and thosethat are focused on more technology scenarios such as conversations.I believe we need to be inclusive of both, and cannot force oneapproach to doing scenarios on the entire community. Also, I thinkthis is the correct *long-term* way to getting more people involved.We will always have newcomers and we need to ensure the runtime ismodular enough so they can work their way in as deep as they areinterested. Doing a "baby-step" rebuild just for the people that arepart of the community now doesn't really teach us how to continuouslygrow the community.

I'm starting with the model SPI because I think that having theassembly model right is critical for an assembly runtime. Most ofthe ideas here have an impact on the architecture of the runtime,so I thought this was a good starting point, and also a good baseof discussion to help all in our community discuss and understandbetter the new recursive composition model.

And here's where I think the crux of the disagreement lies...and it'sthe same debate that started over a year ago and I thought we hadgotten past. The *configuration* model should be decoupled from therest of the runtime, not determine its architecture. Also, theconfiguration model is only one small part of the SPI/API (I think weneed to begin to make this distinction as suggested by Jeremy).Similarly, the SCA specifications are not blueprints for a runtimedesign; they describe a wiring and programming model for service-based applications. If multiple runtimes with divergent architecturescannot implement SCA, then it will have failed as a set ofspecifications.

Of course, that is not to say we should not have SCA conceptsreflected in the runtime architecture. Along these lines, one of thekey changes we made in core2 was to do this better with the actualruntime structures. Namely, we reserved the "Component" naming schemefor runtime artifacts as opposed to the configuration model (theyused to be called "Context"), as those will be dealt with bydevelopers working on core and extenders much more than theconfiguration model will be.

Also, the model is just an in-memory representation of configurationdata, nothing more and nothing less. One of the key culprits in theM1 architecture was the fact that we did not have this cleandistinction. We did agree to have it, we just did not evolve the codeenough in that direction, and that was one of the key driving factorsfor creating core2.

- Implementation should extend ComponentType IMO instead ofpointing to it, and we may even be able to simplify and justremove Implementation. Also I am not sure why we need todistinguish between AtomicImplementation andCompositeImplementation.
One of the problems the assembly spec has is that it isdifficult to do top-down design because you cannot link acomponent to a componentType without having an implementation. Iagree this is an area that we (and the spec) need to sort out.
IMO a component is associated with one componentType but mayhave multiple implementations so I don't think they are quitethe same thing or that either can be removed.
AtomicImplementation is a marker for implementations that cannothave children.
In my view a component has a type. The ComponentType is eitherabstract (just defining the types of services offered, referencesused, and properties that can be configured), or concrete. A POJOcomponent implementation is a concrete ComponentType.
Perhaps we could walk through your model?
Yes, the interfaces I put under m2-design are there to illustrateideas and support a design discussion. I'm working on some UMLdiagrams that I think will help too.

Great, perhaps another area we could discuss in the context ofimprovements to core2?

- Support for Composite Includes is missing, this is asignificant part of the recursive composition model (half ofit, one of the two ways to nest composites).
It's not really half - it's really just a very small part of themodel, comparable to the <import> element we used to support inM1. Again, I don't see why we need to rewrite the model to addthis in. Quite the opposite: you've said you've been looking forway to engage and this could be it.
I disagree. Includes are a very significant part of the assemblymodel (the other part is the ability to use a composite as acomponent implementation). Two examples:- An included composite is the equivalent of a module fragment inthe 0.9 spec. This concept is key to allowing a team to work onvarious pieces of an application, split in multiple composites,included in a composite representing the application.- When (formerly subsystems) composites get deployed to an SCAsystem, they are actually included in that system, rather thanbeing used as component implementations.
It's not "half" of the recursive model. In fact, most of the timewe spent in the spec group was grappling with other issues relatedto recursion.
I don't see an immediate relation between the time spent by thespec group on a specific item and its importance for applicationdevelopers. I am looking at this from an application developerpoint of view and saying that I'll use Includes as much ascomposition through (composite) components. Includes will allow ateam to distribute work on an SCA application and also represent akey concept for system composition. I am starting to look atscenarios and can actually see the usage of Includes in almost allof them, but having trouble finding good use cases for the otherform of composition (nested component implementations). So I standby my statement that understanding how includes work is key here.

Yep, and that's why I originally pushed the include mechanism in thespec group over a year ago (I didn't push the fragment classpathapproach though, so don't blame me for that one). However, to saythat is "half" of implementing the recursive model leaves the detailout. In this context Jeremy's metaphor of the paddling duck isapropos. The runtime should be like a duck in that on the surface itjust merrily moves along the water but under the surface it ispaddling away. The same for the runtime: from the app developer'sperspective things just work and they are simple, but under thecovers the runtime is managing all of the complexity. In my opinion,there is a lot more complexity to recursion than what the appdeveloper sees. Hopefully in the end, the runtime architecture isgraceful and we don't wind up with an ugly duckling as we did in M1 ;-)

This list is not exhaustive... Another idea would be toexternalize support for Composites in a separate plugin notpart of the core service model (since there may be other waysto compose services in addition to an SCA composite, withSpring or other similar programming models), I'd like to knowwhat people think about that.
Having the composite implementation type in the core does notpreclude that - again, it's just packaging for ease-of-use.
I think it's more significant than packaging. Are you saying thatwe could move the code supporting composites out of core2 withoutbreaking the code in core2?
Why would we do this? We can already support multiple compositeimplementation types - have a look at the Spring extension. Thatjust sounds like unnecessary complication.
Why? to avoid unecessary and dangerous coupling that will hurt uswhen we try to evolve this runtime. How to illustrate that? howabout trying to move code supporting composites out of core2? I'mrealizing I'm asking the same question again... but I think it's animportant question, sill unanswered.

I don't think moving packages around guarantees dangerous coupling;pluralism, vigilance, and good design do. It may, though,unnecessarily compound complexity. We have a Spring extension. Howabout adding more composite types to core2? This way, we can expandthe number of containers, achieve a level of pluralism, watch that wedon't over-couple, and derive a good extension design?

You seem to have the impression that the core is sealed and thatwe only support things that are included within it. That is notthe case. The only place we need things bundled with the core isin the bootstrap phase - specifically, we need them bundled withthe primordial deployer. The actual runtime is created from theSCDL passed to that primordial deployer, can contain any mix ofcomponents and need not contain any of the infrastructure usedto boot it.
I just checked in sandbox/sebastien/m2-design/model.spi a setof new interfaces. This is just an initial strawman to triggera constructive discussion and ideas on how to best representthe recursive model. I also need help to define a scenario (notunit test cases, but an end to end sample application) to helpput the recursive composition model in perspective and makesure we all understand it the same way.
I am troubled that you have chosen to start on your own codebaseat a time when most of us have been trying to have constructivediscussion on this list. Based on the approach you proposed inyour original email I would have hoped that we could havestarted with your end-user scenarios and had a chance to explorehow they could be supported by M1, the sandbox, or some othercode before starting another codebase. I'm disappointed that,having started this very thread nearly a week ago with thepremise of community, your first response on it was to commit alarge chunk of independent code rather than follow up with anyof the other people who have already contributed to the discussion.
I think discussion led to compromise and consensus on thescenario-driven approach that you proposed. As shown above andin other recent threads, there's plenty of room for improvementsand/or new features in our current code and a willingness todiscuss them, albeit in terms of technical merit rather thanpersonal opinion. I hope you can find a way to join in ratherthan forge your own path.
--Jeremy
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
--Jean-Sebastien
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
--
Jean-Sebastien


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: Proposed approach for M2

Reply via email to