Re: Potential improvements and fixes to 1.0.0 binaries

Mark Struberg Thu, 26 Apr 2012 06:19:19 -0700

> For example, we *always* construct an OWB WebContext for each and every 
> application and *use* it in injection and fire all CDI lifecycle events even 
> when the application is not CDI enabled.  If the application isn't CDI 
> enabled then simply no beans are listening for the events and the BeanManager 
> will be nearly empty minus a few built-in beans, but our code doesn't see 
> any difference.  This was done intentionally to avoid the scenario where 
> someone 
> adds a beans.xml to an application and suddenly things that worked now blow 
> up, 
> resulting in the person being incorrectly mad at CDI or (correctly) mad at 
> TomEE.


I know it's only an example. 


The bad news: doing those improvements can sometimes be tricky:

How do you detect if the app is a CDI app?
First we scan all META-INF/beans.xml.
If we don't find any, there might STILL be beans!
Any Extension is free to add additional AnnotatedTypes via 


http://docs.oracle.com/javaee/6/api/javax/enterprise/inject/spi/BeforeBeanDiscovery.html#addAnnotatedType%28javax.enterprise.inject.spi.AnnotatedType%29

and similar methods!

Next thing is to scan the classpath for the BDAs with META-INF/beans.xml. Even 
if you don't have a single CDI annotation, all teh beans will still get picked 
up as @Dependent! Also there are quite a few CDI Lifecycle hooks where you can 
add your own Beans etc.



The good news: we've done this already for you ;)

Well here it goes: BeanManagerImpl#isInUse()

I guess I should document this a _bit_ better in the future :)

At least I documented the member variable itself:

    /**
     * This flag will get set to <code>true</code> if a custom bean
     * (all non-internal beans like {@link 
org.apache.webbeans.component.BeanManagerBean;} etc)
     * gets set.
     */

All you need to do is to check this flag. 


LieGrue,
strub




----- Original Message -----
> From: David Blevins <[email protected]>
> To: [email protected]
> Cc: 
> Sent: Thursday, April 26, 2012 2:02 AM
> Subject: Re: Potential improvements and fixes to 1.0.0 binaries
> 
>G reen TCK run and green build from all the changes.
> 
> Going to start merging code into the branch, get that through the TCK and a 
> green build in the CI system.
> 
> With any luck we'll have some binaries up by tonight!
> 
> Now that I've had some sleep, some details on "faster"....
> 
> - The functionality we're running with these much faster numbers is the 
> *same* functionality we were running with the much slower numbers.
> 
> It is definitely not the case that these improvements came from yanking 
> things 
> out we needed.  I find that kind of "optimization" to be cheap and 
> meaningless -- I'd rather have an honest slow time than a dishonest fast 
> one.
> 
> We'd never really done heavy performance optimization on TomEE, so there 
> were a lot of nails sticking up.  I hammered as many down as I could in the 
> time 
> we had.
> 
> I think of the benefits of optimizations in these categories (in my order of 
> preference):
> 
>   - Variable
>   - Fixed
>   - Conditional
> 
> Variable is where the benefit of the optimization depends on the app size.  A 
> small app might see nothing and a large app might see huge benefit.  Say you 
> add 
> an optimization that shaves off 10% of deploy time, our 40 second deploy 
> becomes 
> 36 second while our 2 second deploy still feels the same.  The percentage of 
> time you save is almost never constant, so saying "10%" improvement is 
> usually a rough indicator at best.
> 
> Fixed is like shaving say 1s off of a deploy time and no matter what you'll 
> only get the one second; a 100MB app and a 2MB app would get the same flat 1 
> second.  If the deploy times were 40 seconds and 2 seconds, after then 
> optimization you'd see 39 seconds and 1 second.
> 
> Conditional is where the benefit effectively goes away because the boost is 
> predicate on your app not needing some particular kind of functionality and 
> that 
> functionality.  These are of the "if you're not using JSF, why enable 
> MyFaces" kind.  I really don't like these -- more on that later.
> 
> The last month's worth of performance tuning had all been in Variable 
> benefit optimizations as I personally find these to have much greater value.
> 
> The last two days have largely been Fixed benefit boosts, some Variable 
> benefit 
> boosts (yay) around tld scanning, and exactly one Conditional benefit 
> optimization.  The Fixed benefit optimization were split between the server 
> itself and actual app deploy times.  The server Fixed optimizations you get 
> once, the app deploy optimizations you get once per deploy.  I tried to focus 
> on 
> app deploy times as much as possible and indeed this is where we saved the 
> most 
> on "startup" time.  Who really cares how fast your empty server 
> starts.
> 
> On the Conditional benefit optimization, that was actually a bad bug.  The 
> spec 
> defined "metadata-complete" is essentially a Conditional benefit 
> optimization required as part of certification meant to avoid scanning.  We 
> were 
> still scanning.
> 
> On that, an explicit note to Neale.  I was aware of this bug over the last 
> month 
> of our performance tuning, but intentionally didn't fix it :)  I really 
> wanted to tune large, real-world apps and so that's what we did.  We made 
> big improvements in actual deploy times of actual apps under the scenario 
> that 
> "everything was on", when in fact, for the apps we were using, not 
> everything should have been on :)  I recall you making note of that -- you 
> were 
> spot on.  Now that the metadata complete flag is properly functioning, these 
> apps will boot much faster and probably at a speed that is very on par with 
> Tomcat, but if we want to get real performance numbers against beta-2 with 
> the 
> latest changes we should edit the web.xml to be 
> "metadata-complete=false".
> 
> So final note that aside from "metadata-complete", there were no 
> conditional benefit optimizations added.  They have a place, but I don't 
> really care for that kind of thing and think it is short-sighted.  Things 
> tend 
> to slow down or break when you add bits to your app and suddenly you get 
> different behavior.  We're still executing all critical paths all the time, 
> period.  That's really the only way you can keep them well oiled and moving 
> smoothly.  This is actually at the core of OpenEJB/TomEE an I think a major 
> reason for the speed and quality.
> 
> For example, we *always* construct an OWB WebContext for each and every 
> application and *use* it in injection and fire all CDI lifecycle events even 
> when the application is not CDI enabled.  If the application isn't CDI 
> enabled then simply no beans are listening for the events and the BeanManager 
> will be nearly empty minus a few built-in beans, but our code doesn't see 
> any difference.  This was done intentionally to avoid the scenario where 
> someone 
> adds a beans.xml to an application and suddenly things that worked now blow 
> up, 
> resulting in the person being incorrectly mad at CDI or (correctly) mad at 
> TomEE.
> 
> In terms of future perf work, I see room for more Variable benefit 
> optimizations 
> and not really too much around Fixed benefit optimizations.  Still some 
> scanning 
> (variable) work we can do.  There are some JAXB (fixed) optimizations we 
> could 
> do as well.  Overall, though, I'd expect things to lean towards the Variable 
> side as they should.
> 
> So, yes, we are "fast", but hopefully the above will break that word 
> down into a more useful description of what fast means for us.
> 
> 
> -David
> 
> On Apr 25, 2012, at 7:48 AM, David Blevins wrote:
> 
>>  Would be great to at least some of the performance work into the 1.0.0 
> binaries as there's a bit over 3x increase in Eclipse startup time and a bit 
> over 2x in regular startup time.
>> 
>>  Aside from performance I did find two bugs:
>> 
>>   - Web.xml metadata-complete effectively ignore
>>     https://issues.apache.org/jira/browse/TOMEE-166
>> 
>>   - Omitting <ejb-name> from xml may result in failed deployment
>>     https://issues.apache.org/jira/browse/OPENEJB-1830
>> 
>>  OPENEJB-1830 is not terrible, but TOMEE-166 is pretty bad.  The point of 
> metadata-complete is to avoid scanning and quite simply it doesn't result in 
> scanning being avoided.
>> 
>>  We pass the related TCK tests because we correctly ignore the data 
> we've scanned, but applications still have to pay the full price of scanning 
> regardless.
>> 
>>  A lot of the overall improvements were in the .tld file area.  Those were 
> the changes that bought us the most time (over a second off of startup and 
> redeploys)
>> 
>>  Those are fantastic optimizations, not the "our stuff will be fast, 
> but yours will be slow" kind.  If you add a ton of tld libraries in 
> tomcat/lib/, your apps will start much faster than they even do in plain 
> Tomcat 
> because of these optimizations.
>> 
>>  In short, we cache the results of the scan (just the urls, not the file 
> contents) and report that back anytime someone asks.  If you have 5 apps, you 
> get a 5x improvement on scanning of shared tld libs.  This works regardless 
> of 
> how they're shared too.  So if you add a common classloader, for example, 
> children of that classloader won't have to each individually scan the common 
> classloader on each deploy.
>> 
>> 
>> 
>>  -David
>> 
>

Re: Potential improvements and fixes to 1.0.0 binaries

Reply via email to