Re: Distribution and start/stop of clustered deployments

Gianny Damour Tue, 13 Nov 2007 18:36:51 -0800

Hi Joe,

After some investigations, here is my understanding of problem 1:there are two deployments because by default, i.e. when no target isspecified, the distribute command executes against all theconfiguration stores defined by a Geronimo instance. Note that thisdefault behavior is also applied by other deployment components, suchas the hot directory scanner or the installation portlet. To someextent, I believe this default behavior should be changed to deployto only one configuration store. Indeed, I am not convinced thatusers distributing applications would expect their applications to bedeployed as many times as the number of configuration stores definedby the targeted Geronimo server. Also, having the same configurationmultiple times in a Geronimo instance does not make a lot of sense.

A potentially better default behavior would be: only distribute tothe first target returned by DeploymentManager.getTargets().Internally, our implementation of getTargets returns as the firsttarget the "default" configuration store.


Problem 3) is caused by problem 1).

What do you think?

Thanks,
Gianny


On 13/11/2007, at 7:14 AM, Joe Bohn wrote:

Hi Gianny,
Lots of newbie questions from me. I'm not even going to pretendthat I understand your clustering changes just yet ... so pleasebear with me. I just want to point out a few things that I noticedwith a single server instance and get your take on them.
1) Deploying a simple web app. I deployed a simple snoop.war webapp without a plan to a Jetty server image using the command line.It ended up deploying 2 configurations based upon the outputmessages. Based on your description I think this is correct butfrom a user perspective it seems confusing and wrong. I hadn'tconfigured anything for clustering and I was only deploying 1thing. I expected to see results of just 1 configID for thedeployed item. Perhaps everything would have been fine if I hadused a plan but I don't think we can assume that users will alwaysuse a plan. Here are the messages that were output:
    Completed with id default/snoop/1194895785124/war
    Completed with id default/snoop/1194895785559/war
    Deployed default/snoop/1194895785124/war to
org.apache.geronimo.configs/clustering/2.1-SNAPSHOT/car?ServiceModule=org.apache.geronimo.configs/clustering/2.1-SNAPSHOT/car,j2eeType=ConfigurationStore,name=MasterConfigurationStore
    @ /snoop
    Deployed default/snoop/1194895785559/war to
org.apache.geronimo.configs/clustering/2.1-SNAPSHOT/car?ServiceModule=org.apache.geronimo.configs/clustering/2.1-SNAPSHOT/car,j2eeType=ConfigurationStore,name=ClusterStore
    @ /snoop
2) Undeploy? What would I undeploy if I wanted to undo what I justdid? Do I need to undeploy each configuration individually? Whatdo you think about leaving the current deploy capability as is andadding new commands/functions when deploying into a cluster so asnot to confuse users in the more simple case without clustering?
3) Web Console. From the web console instead of 1 configuration Iinitially expected, or the 2 configurations indicated in themessages at deploy time ... I actually see 3 configurations (2 ofthem started and 1 stopped ... now I'm even more confused ;- ) ):
  - default/snoop/1194895785124/war  started
  - default/snoop/1194895785559/war  started
  - default/snoop/1194895785702/war  stopped
Again, I'm not sure how the user is supposed to manage/interpretthis. It seems that if we implement these concepts there are anumber of comparable console and cli changes that will be necessaryto manage the multiple CARs in a clustered scenario. Is thereanyway we can keep the single server use cases intact until we havethose capabilities?
4) TCK for Jetty is toast. I started to play with the individualserver because when I attempted to run Jetty TCK tests everythingwas failing with lifeCycleExceptions. I image that we need torework some of the tck for this change. We might be able to avoidthat if we can keep the single server use cases unchanged. If thatisn't possible will you be looking into the necessary TCK changes?
Thanks,
Joe

Gianny Damour wrote:
Hi,
I have just checked in support for distribution of configurationsto clusters and also management, i.e. start/stop, of suchclustered deployments.I will try to explain how everything hangs together so that peoplecan jump in, provide feedback, request enhancements etc.
There is now a secondary configuration store:
org.apache.geronimo.configs/clustering/2.1-SNAPSHOT/car?ServiceModule=org.apache.geronimo.configs/clustering/2.1-SNAPSHOT/car,j2eeType=ConfigurationStore,name=MasterConfigurationStorewhich is a configuration store, which is aware of the clustermembers statically configured by users (more on this later). Itsresponsibilities are:
* (un)installation of configurations on cluster members; and
* creation of "master" configurations defining GBeans able toremote start and stop a given configuration on a specific clustermember.Here is what happens when a configuration, e.g. groupId/artifactId/2.0/car, is distributed to this store:1. The usual configuration processing is executed. This resultsinto a backed configuration, i.e. with its associated GBeans,ready to be installed by the clustered store.2. The clustered store uploads the backed configuration to theregistered cluster members, which subsequently locally installthem. If the "remote" installation fails for one of the members,then the clustered store removes the configuration from all themembers having successfully installed it so far.
3. The clustered store installs the configuration locally.
4. The clustered store creates from scratch a masterconfiguration, e.g. groupId/artifactId_G_MASTER/2.0/car. Thismaster configuration is made of GBeans, one for each member, whichcan remote start or stop the configuration on a given member: whenthe master configuration starts, its GBeans start, which in turnremote start the configuration on a given member. In order to beable to start the master configuration without all the members up,these GBeans "fail" silently when a remote start fails. However,as these GBeans expose startConfiguration and stopConfigurationmanaged operations, it is pretty easy to remote start aconfiguration on a given member later via JMX. As expected, whenthe master configuration is stopped, its GBeans stop, which inturn remote stop the configurations.The clustered store relies on the static configuration of clustermembers. This static configuration MUST be done withinorg.apache.geronimo.configs/clustering//car as nodes must beregistered before the start of any master configurations. Indeed,master configurations are injected with this static clusterconfiguration to retrieve the necessary JMX connection info toconnect and cluster members and remote start/stop configurations.At step 3. of the above deployment process, I wrote that theconfiguration is locally installed, i.e. into the clusteredconfiguration store. At this stage, this is pretty much useless;however, I believe that keeping a carbon-copy of the configurationin the master repository may become quite handy. For instance,within the master configuration, we could add a GBean able toupload on demand this configuration to a given member. This way,when you add a new member to an existing clustered deployment, yousimply need to add a new GBean to remote start/stop theconfiguration on this new member and upload the configuration tothis new member via the utility GBean.
Hope the above is clear enough.
I will comment the org.apache.geronimo.configs/clustering//cardeployment plan as there are new GBeans declarations not tooobvious to understand without reading the code.Following this, I will move to the remote start/stop of Geronimoinstances from a single Geronimo server. This should provide a setof administration GBeans admin console people may want to leverageto improve the remote management of Geronimo instances. TheseGBeans will talk to GShell instances and send arbitrary groovyscripts for execution within GShells.Meanwhile, if people are interested by working on the clusteringof Tomcat or OpenEJB via WADI, then please reply as I am keen andhappy to provide help. One of those two new features will be thenext stuff I will work on after completion of the above managementenhancement.
Thanks,
Gianny

Re: Distribution and start/stop of clustered deployments

Reply via email to