Re: ext-scripting status

Bernhard Huemer Sat, 12 Dec 2009 14:56:26 -0800

> What about following hypothetical case
>
>
> private void doSomething() {
>     .. long running
>     C myVar = appContext.getBean("C");
>     .. long running
> }

Okay, first of all, yes, that will eventually result in aClassCastException, but there is a different reason for that. If you"circumvent" dependency management, well then, sorry, there's no way toget to know that dependency then. However, the cause of this problem isnot a race condition, it's just missing information. So basically itwill always raise an exception, regardless of how many more requests theuser initiates (i.e. the only "advantage" that synchronizing requestshas in this case is that it won't fail for that particular request, butit will fail for the following ones as well).


regards,
Bernhard

Werner Punz wrote on 12/12/2009 08:00 PM (GMT):

Bernhard Huemer schrieb:
Okay, I'll tell you how that works in my case, though I'm not reallysure if I got your example entirely right (in fact I am most probablymistaken). The thing is if class A somehow references class C, Calready has to be loaded at that time - you cannot even load the classA otherwise. Now if the developer modifies the class C, obviously thedaemon thread will notify the system to refresh all relevant beans. Ifit turns out, that there is a relevant bean of a different class (e.g.the relevant bean somehow has a dependency on something that is of thetype C), the system will tell the reloading class loader in my case toforcefully reload that particular class (i.e. assuming that therelevant bean is an instance of the class A, it will also reload Aagain, regardless of whether the source actually changed or not). Thepurpose of this forceful reload is to correct linkage dependencies,i.e. if the class A on its own depends on class C (e.g. there's asetter setC(C c)), it will reload A just in order to ensure that it'susing the correct version of C.
Well I do the same but I drop just the beans for not having the samecontrol as you have in spring.
As far as I know your system that only works on spring level and do youput the requests on hold while the object and class refresh happens?
What about following hypothetical case


private void doSomething() {
    .. long running
    C myVar = appContext.getBean("C");
    .. long running
}



Now lets assume following case:
A is currently processing doSomething, the compile is in full process
and has compiled C, you load C via the app context, the running A
then assignes a new class of C to the old C myVar and you run into
the famous classcast exception.

I am in this case somewhat safe in single user environments because
the refresh happens synchronously, and I still think that eithersynchrionizing the compile operation between requests is the cheapestway to go to prevent a situation like that.
So there are either two chances, you have to find a way that doSomethingdoes not start in the first case or you have to wait until doSomethinghas stopped before doing the compile.
The forceful reload is quite nice, I use it very similarily, but since Idont have springs dependency capabilities I do it differently, but theissue still stays, you will run into a classcast Exception that way.
Give it a try do something along the lines of

private void doSomething() {
    ..
    while(true) {
        C myVar = appContext.getBean("C");
        Thread.sleep(...)
    }
}
and then chance C you will get the classcast exception here! Because youcannot terminate the doSomething here but the classloader will push inthe C to the old reference!
You cannot really implement it in a different way I suppose asotherwise you've got to take care with the order how you refreshclasses and beans (i.e. determine the class that no other classdepends on and refresh that at first, etc.. - doesn't work for cyclesthough).
regards,
Bernhard

Werner Punz wrote on 12/12/2009 03:09 PM (GMT):
Bernhard Huemer schrieb:
 > Under normal no locking circumstances, the beans get
 > replaced in the middle of the request because someone
 > else triggered it for the application singleton, which
 > is probably fine but somewhat dirty because in some
 > cases this might end up with a temporary classcast
 > exception which is resolved then at the following
 > request cleanly.
Well, you're listing more and more issues that are only valid if yourefresh beans at the beginning of a request. What you're saying isthat the application is in an inconsistent state from the moment yourecompile classes until the beginning of the next request thatrefreshes beans, renderer, etc. for which those recompiled classesare relevant. However, to be more precise you'd have to say that theapplication is in an inconsistent state from the moment yourecompile until all the relevant artifacts are refreshed. As yourefresh artifacts only at the beginning of a request, you'll have tosomehow synchronize requests, granted, but that doesn't mean thatit's necessarily also the case if you'd refresh artifacts in yourdaemon thread instead. Ensuring that the recompile/refresh operationis an atomic one is just so much easier, if you don't have to waitfor the next request for the refresh (as - again - that's where yourefresh artifacts).
The main issue here is to avoid inconsistent states as much aspossible, if you do the refreshing asynchronously you just push theinconsistencies one level up.
I will give an example.
The compile and refresh is atomic ok, that is a common point!
The main issue the application state for the user.
If you compile and refresh asynchronously without having old statesof the objects not only the classes you basically exchange classes andobjects in the middle of a request. Ok granted this does not happento often but it can happen!So what happens, is that a) the user has to wait in the middle ofrequest processing that the atomic compile and refresh is done (ornot depending what you want to lock there) and then to the worse yousuddenly in the middle of the request you have the beans and classesexchanged.Ok this is not too different to what happens if you refresh inrequest level if you dont streamline the requests during the compileand refresh cycle.So pretty much you end up with one request in an inconsistent stateand probably errors.Anyway, I have given the solutions for the problem and it does notmatter when you compile, it is either double buffer the classes andobjects or streamline the requests for the time of compile andrefresh the objects!
 > What we are talking about here is a 1% corner case which
 > imposes 90% extra work in that area, and that is definitely
 > a post 1.0 thing to solve.
Granted, but just don't get me wrong. I've never meant to point outevery single tiny, inconvenient and maybe even insignificant issueas you were the one who brought up the Windows file locking issue(which I btw. still doubt that it exists as even Windows provides -if I'm not mistaken and if not specified otherwise - exclusive read,write and delete access to one process at a time only). What I'msaying is, yes, there are certain race conditions, but that's atleast partly a result of your "JSP-like" refresh approach.
I still dont think those issues except for a longer waiting time hasanything to do with the jsp like approach, granted you haveto wait for the compiler instead of having it executed parallely(which is a fraction of a second, but the rest of the problems withthe inconsistencies of the application state are the same, and to thethird you give the developer basically in a single developerenvironment back the control when to compile instead of enforcing it.
But as I said that was not even my intention I just had the jsp logicin my mind when coding it and did not think about asynchronous compile.
But the rest of the application state problems exist in eitherapproach. All you gain is a faster compile for the sake of takingaway the deveopers control of when to compile exactly in a typicaldev environment.
 > [...] (the biggest issue simply is the singleton constructs like
 > application scoped managed beans, that means double buffer the
 > class files so every compile has to go into a separate dir, [...]
Why do you think that you have to use separate directories all thetime? Once the class loader has loaded the class, it's in the mainmemory anyway, just reuse the in-memory definition of the class andthen you could basically drop the class file on the file system.What you mean is probably to somehow freeze the reloading process sothat it only picks up reloaded classes at a certain time, but thatdoesn't require you to use separate directories (and again, that'sonly required if you refresh artifacts JSP-like).
Not really true, you definitely need a full snapshot, you haveoverlooked one corner case:
See it that way, bean a references classes b and c, c on a laterstage loaded dynamically.
By the time the class of a and b and c gets recompiled c has not beenloaded,a developer/user hits the refresh at a time the compile is in fullforce or has a running request at the time he still has the oldreference to a, but then because the classes are exchanged exactly atthe request b and c get refreshed, b and c are referenced, b is stillpicked up because the old version is in the ram, but c is loadeddynamically and not yet in ram, and you might end up with an errorbecause something does not match (in the worst case classcast alongthe lines of c cannot be cast to c), because for a and b you arestill on the old version while c is loaded from the new version.
So it is either, buffer all classes as snapshot in ram for the"compile" transaction (which with normal classloader logic is onlypossible for 95% due to the lazy initialisation of classesclassloader in fact do) so that old requests get a consistent stateor buffer the classes on the hd and keep the logic in the classloaderdown to the bare minimum, so it is just either ram or diskspace. Theother solution is just compile when no request is going on and blockall requests until the compile and replace is done.
Normal classloader logic can deal with most cases but not with thefully dynamical part which gets loaded somewhere in the code vialoadClass!
But as I said, this is so much logic overhead to cover a cornercasewhich is not really that important for a development environment.The worst case is in this case just a lost request. And if we look atpure scripting languages, they do not even remotely try to solve this.If the application logic and data structures go haywire then thedeveloper has to perform the reboot in those languages!
For example, you could do something like: save the time stamp of thebeginning of the request and only reload class definitions if thelast modified time stamp of the according class file is less thanthe previously saved one (i.e. basically if the class file has beenrecompiled before the beginning of the current request, use it -which also means, you won't care about recompiled classes during therequest). However, that's just an idea, I haven't tried it as Idon't have to implement something like that in my case.
I am doing that on bean level to kick through the session and customscoped beans, the timestamp part needs a full snapshot of allclasses, but yes that is definitely the way to identify when thetransactional boundary is reached.
 > And to go back to the original discussion, the compile trigger
 > point is mostly a matter of preferrence, I have to admit doing
 > the compile on request start was just because I had jsps
 > behavior in mind, when I was coding it, I was not even
 > thinking of doing it parallely in the watchdog daemon thread.
.. which is why I told you about the possibility of doing it thatway now. You know, four eyes can see more than two and I really likethis module, I think it could be a great advantage of MyFaces.That's why I'm trying to suggest improvements as far as possible. ;-)
Yes indeed... and no offence taken.
regards,
Bernhard

Werner Punz wrote on 12/12/2009 10:31 AM (GMT):
Bernhard Huemer schrieb:
I´d rather have a single pretictable triggering point than having
the compiler being triggered continously in unpredictable manner.
A standalone developer can code and save and can cause continous
errors. But at the time he hits refresh, he can be pretty sure that
his code should work (well often it does not but that is a different
matter)
Even if you compile continuously the developer can introducemistakes, save them and the application won't pick them up as itsimply doesn't compile anyway - or do you mean runtime errors?Just thinking about it - apparently it doesn't really matter atwhich point you pick up the changes as long as you pick them up atall (which you do), which basically means, if the developerintroduces runtime errors at runtime it will affect yourapplication regardless of whether you recompile it JSP-like or not(btw. using the term "JSP-like" as a way to express how you managecompilation isn't really precise either as e.g. the Jasper 2engine provides background compilation as well - but let's stickwith the usual approach to define what "JSP-like" means).
Anyhow if it works JSP-like in your case, then you can't justtreat users and developers the same. The relationship that anydeveloper who uses your module is a user of your module doesn'treally matter when it comes to race conditions, so I'd suggestwe'll ignore that fact.However, what matters is that there are people who issue requeststo the web server, namely the users, and people who actuallymodify the source files of those applications, the developers. Theproblem with the users requests being the "compilation trigger"is apparently that you'll have to deal with race conditions asthere are multiple possible request threads. If, however, thedeveloper, or more precisely said the daemon thread that checksfor file modifications, triggers compilations you've only got onethread - the file monitoring thread - that could possibly accessthe compiler, hence no need for synchronization at all in this case!
Well, we've already talked about it a lot anyway, and it'sprobably just a matter of preference, I just wanted to point outsome issues and compare different approaches. Maybe others want tofollow that discussion as well, which is why I'm still respondingto this emails as well
Actually the trigger point of the compiler is really just a matterof personal preference, but the concurrency issues go way deeperthan that and mostly are singleton related.
We have application scoped, session scoped and request scoped beans.
Well what happens if a compile is done in a middle of a request forsomeone who hits the site, this happens in both approaches.
Under normal no locking circumstances, the beans get replaced inthe middle of the request because someone else triggered it for theapplication singleton, which is probably fine but somewhat dirtybecause in some cases this might end up with a temporary classcastexception which is resolved then at the following request cleanly.
If you want to solve it cleanly you have various options.
a) Let the requests run out which already are in progress
   Then compile and while compilation put any new request on hold
   Then let the requests through again.
The compile has to be seen as transaction boundary, everythingbefore the compile has to be a single unit, which is not mutable,everything after the compile also.
The problem here starts with long running requests like cometframeworks issue them, then suddenly the compiler literally has towait for ages until it can trigger (until the timeout for the cometrelated long running xhr request, if you run for instance on Bayeuxnot on websockets which are handled differently).
b) Try to double buffer everything possible so that requests beforeand during the compile see a single application state (the biggestissue simply is the singleton constructs like application scopedmanaged beans, that means double buffer the class files so everycompile has to go into a separate dir, double buffer the managedbeans which means the old beans have to be preserved until the lastjsf request has terminated which accesses the current state, so Ieven assume we need an unlimited nesting depth of the applicationstate here.
Just in short terms to sum it up, this is way too much to handlefor my 1.0 version, which is mainly aimed at easing the life of thedevelopers.I probably will add solution a) but will make it only optionallyturned on sort of as additional safety net for production siteswhich do not run comet over jsf (99% of all sites). I am not aimingfor a 100% perfect solution in 1.0 but only for a solution whichshould ease the life of the developers by reducing the number ofserver restarts as much as possible.
What we are talking about here is a 1% corner case which imposes90% extra work in that area, and that is definitely a post 1.0thing to solve. After all the entire library is not done with 1.0,1.0 is just a first version which aims to solve certain things tosome extend.And we are not talking about rendering the application in anunusable state but that after compile time users in a multiuserenvironment might get an error for exactly one request. A situationwhich cannot happen in a single user dev environment entirely.So hot patching a running server or having multiple developersprogramming against a running server might trigger this, but onlyfor one request only. It simply is not worth it for 1.0 to solvethat, although I am sure some users will run into it, hence thisneeds to be documented!
And to go back to the original discussion, the compile triggerpoint is mostly a matter of preferrence, I have to admit doing thecompile on request start was just because I had jsps behavior inmind, when I was coding it, I was not even thinking of doing itparallely in the watchdog daemon thread.

Re: ext-scripting status

Reply via email to