Re: [osgi-dev] How to cleanly update/uninstall bundles

Christian Schneider Sun, 15 Feb 2015 10:07:59 -0800

If you do not implement something special for clean shutdown of inflightexchanges then the normal error handling should take effect like youmentioned.So for example a db transaction should roll back. Some issue may bethat e.g. a service call can not be rolled back.

On the other hand I think implementing clean shutdown will add a lot ofcomplexity. The special code will only be executed for quite rare cases.These two effects increase the change of programming errors in the code.So I am with you that in most cases you can just implement normal errorhandling and jsut live with the fact that inflight calls might run intoerrors.

What I have seen on production systems is that they mark a machine to beupdated as inactive on a front end load balancer. So no new requestscome in and after some time you can quite safely update the bundles.This is a quite low tech solution but I think exactly for this reason itworks so well.

So while I wanted to understand clean shutdown better for the discussionon aries dev I do not think it should always be done.

Btw. For my current redesign of jpa I have one problem that I would liketo get some feedback / ideas.I am providing a so called EmSupplier:https://github.com/cschneider/jpa-experiments/blob/master/jpa-support/src/main/java/net/lr/jpa/impl/EMSupplierImpl.java

This class will be offered as a service per persistence unit and shouldhelp to work with jpa. There is a precall method that will create an EMon the thread. Then there is a get() to retrieve the local thread em anda postcall that will close the EM again. As discussed a bundle shouldhave stopped all work when the stop method is done. In this case thisapplies to the case where the PU bundle will be stopped. So theEntityManagerFactory will also be deregistered and closed. As theEMSupplier depends on the EMF it will also have to be closed.

Now the problem is that there might still be threads working on theirper thread EMs. The really safe way is to wait until all these threadshave closed their EMs. This is what I am doing now. To make it a littlemore predictable I added a timeout and close the remaining EMs after thetimeout.

So the question is: Is this a best practice ? The clear disadvantage isthat stopping a PU bundle could take quite long (depending on timeout).Would it be better to just let the threads close the EMs asynchronouslyand ignore the fact that this might go wrong if the bundle isuninstalled in the mean time.


Christian

Am 15.02.2015 um 18:38 schrieb Peter Kriens:

As always with design, it is about trade offs. As indicated in mymail, the recovery time can be shortened if you can do a controlledshutdown. I know this was a big issue with mainframes, however, Idoubt that with today’s highly distributed systems this is still veryrelevant. In general, when I have the choice in these circumstances Iwould rather focus on reducing startup time instead of trying tomanage shutdown more nicely.
I think the complexity of the additional recovery part is alsodangerous, especially since you will have a common path and one thatonly gets executed when the shit really hits the fan. I think that isworth some additional startup time in one of the many machines in thecluster.
That said, every case is special. Just sharing my long experience inseeing overly complicated solutions that looked good close up butprovided no real gain when you looked at the overall picture.
Kind regards,

Peter Kriens
On 15 feb. 2015, at 13:18, Graham Charters <chart...@uk.ibm.com<mailto:chart...@uk.ibm.com>> wrote:
Hi Peter,
I think you and I see different customer use cases. As I mentioned atthe last OSGi f2f, we have customers whose applications take asignificant amount of time to start and they have many instance.Rolling updates can therefore take a long time if full applicationrestart is necessary, so these customers want to minimise applicationupdate time and disruption. These are transactional deployments withfailover so they can be recovered if someone trips over the powerchord, but that doesn't mean they want use this during normalmaintenance.
Regards, Graham.

Graham Charters PhD CEng MBCS PhD
STSM, WebSphere OSGi Applications & Liberty Repository LeadArchitect, Master InventorIBM United Kingdom Limited, MP 146, Hursley Park, Winchester, SO212JN, UKTel: +44 1962 816527 Email: chart...@uk.ibm.com<mailto:chart...@uk.ibm.com>
Peter Kriens --- Re: [osgi-dev] How to cleanly update/uninstallbundles ---
From: "Peter Kriens" <peter.kri...@aqute.biz<mailto:peter.kri...@aqute.biz>>To: "OSGi Developer Mail List" <osgi-dev@mail.osgi.org<mailto:osgi-dev@mail.osgi.org>>
Date:   Sun, 15 Feb 2015 11:48
Subject:        Re: [osgi-dev] How to cleanly update/uninstall bundles

------------------------------------------------------------------------

I am not sure I agree with your conclusion. :-)
Since it is theoretically impossible to protect against hard failure(power, kernel panic, kill -9, distributed call when the cable isplugged, etc) any valuable application must have protection againstan unexpected exit at any moment in time. Idempotency, consensus, andtransactionality are your friends in these cases. So if you areprotected against these bad failures, how bad can an in-flightshutdown be? Best case you can shorten the recovery time at restartbut this often requires additional complexity that can then alsofail. Since the chance that things go wrong in-flight is quite smallI would take the recovery cost in the unlikely event you got caught.
Related is my very old opposition to an update or uninstall callbackto the bundle. Though it is an awfully attractive idea with lots ofgood stuff the party is spoiled because you cannot guarantee such acall circumstances.
Billy Joy (Sun Founder) once told us a story about the development ofthe Internet, of which he took part. Initially they tried to makeevery router perfect but this turned the routers incredibly expensiveand there were still failure scenarios that even a perfect routercould not handle (power, cable cuts). Then someone proposed to assumethe routers were very imperfect and that the end points shouldcorrect the problems in the net. This changed a very large number ofvery hard to handle failure scenario into one problem: how to handlea missing package. If a router panicked, lost power, a cable wascost, too busy, out of memory, had no clue: discard the package.
It is a pervasive problem in Enterprise software world that we wantto ignore failure because it is so hard. For example, Blueprint hasthis awful service damping that looks so attractive for the developer(Look Ma, no dynamics!) but by hiding the reality you get caught inlots of unexpected places.
Bad software expects an unchanging perfect world, good software ismore realistic. Embrace failure! :-)
Kind regards,

Peter Kriens
On 15 feb. 2015, at 11:09, Christian Schneider<ch...@die-schneider.net <mailto:ch...@die-schneider.net>> wrote:
Thanks to all of you for the insights.
From the responses I take that clean shutdown is not in scope ofOSGi itself.I agree that it is best solved on the application level. On theother hand I see that the Quiesce API can at least cover some
cases and so it has its values.

Christian

Am 13.02.2015 um 17:55 schrieb Raymond Auge:
To my knowledge what you are speaking of is not intentionallysupported by the dynamics of osgi. This topic comes up all thetime, it's funny.
If you must support "in flight" changes, then you have to implementthis support in your code using concurrency constructs.
Note that unregistering a service is a synchronous operation during"shutdown" of a bundle, and so with proper concurrency measures inplace, a bundle could both be shutting down (meaning it's notreachable by other bundles) and also finishing any ongoing work.
Anyone feel free to correct me but this is what I've learned in myshort experience.
- Ray
_______________________________________________
OSGi Developer Mail List
osgi-dev@mail.osgi.org
https://mail.osgi.org/mailman/listinfo/osgi-dev

--

Christian Schneider

http://www.liquid-reality.de

Open Source Architect
Talend Application Integration Division http://www.talend.com

_______________________________________________
OSGi Developer Mail List
osgi-dev@mail.osgi.org
https://mail.osgi.org/mailman/listinfo/osgi-dev

Re: [osgi-dev] How to cleanly update/uninstall bundles

Reply via email to