Re: Proxy.isProxyClass scalability

Peter Levart Tue, 16 Apr 2013 07:20:27 -0700

Hi Mandy,

I prepared a preview variant of j.l.r.Proxy using WeakCache (turned intoan interface and a special FlattenedWeakCache implementation inanticipation to create another variant using two-levels ofConcurrentHashMaps for backing storage, but with same API) just tocompare performance:


https://dl.dropboxusercontent.com/u/101777488/jdk8-tl/proxy-wc/webrev.01/index.html

As the values (Class objects of proxy classes) must be wrapped in aWeakReference, the same instance of WeakReference can be re-used as akey in another ConcurrentHashMap to implement quick look-up forProxy.isProxyClass() method eliminating the need to use ClassValue,which is quite space-hungry.

Comparing the performance, here's a summary of all 3 variants (original,patched using a field in ClassLoader and this variant):



Summary (4 Cores x 2 Threads i7 CPU):

Test Threads ns/op Original Patched (CL field)Patched (WeakCache)======================= ======= ============== =====================================Proxy_getProxyClass 1 2,403.27163.70 206.884 3,039.01202.77 303.388 5,193.58314.47 442.58

Proxy_isProxyClassTrue 1 95.0210.78 41.854 2,266.2910.80 42.328 4,782.2920.53 72.29

Proxy_isProxyClassFalse 1 95.021.36 1.364 2,186.591.36 1.378 4,891.152.72 2.94

Annotation_equals 1 240.10152.29 193.274 1,864.06153.81 195.608 8,639.20262.09 384.72

The improvement is still quite satisfactory, although a little slowerthan the direct-field variant. The scalability is the same as withdirect-field variant.

Space consumption of cache structure, calculated as deep-size of thestructure, ignoring interned Strings, Class and ClassLoader objectsunsing single non-bootstrap ClassLoader for defining the proxy classesand using 32 bit addressing is the following:


original Proxy code:

proxy     size of   delta to
classes   caches    prev.ln.
--------  --------  --------
       0       400       400
       1       768       368
       2       920       152
       3      1072       152
       4      1224       152
       5      1376       152
       6      1528       152
       7      1680       152
       8      1832       152
       9      1984       152
      10      2136       152

Proxy patched with the variant using FlattenedWeakCache, run on currentJDK8/tl tip (still uses old ConcurrentHashMap implementation with segments):


proxy     size of   delta to
classes   caches    prev.ln.
--------  --------  --------
       0       560       560
       1       936       376
       2      1312       376
       3      1688       376
       4      2064       376
       5      2352       288
       6      2728       376
       7      3016       288
       8      3392       376
       9      3592       200
      10      3872       280

...and the same with current JDK8/lambda tip (using new segment-lessConcurrentHashMap):


proxy     size of   delta to
classes   caches    prev.ln.
--------  --------  --------
       0       240       240
       1       584       344
       2       768       184
       3       952       184
       4      1136       184
       5      1320       184
       6      1504       184
       7      1688       184
       8      1872       184
       9      2056       184
      10      2240       184

So with new ConcurrentHashMap the patched Proxy uses about 32 bytes moreper proxy class.

Is this satisfactory or should we also try a variant with two-levels ofConcurrentHashMaps?



Regards, Peter


P.S. Comment to your comment in-line...

On 04/16/2013 12:58 AM, Mandy Chung wrote:

On 4/13/2013 2:59 PM, Peter Levart wrote:
I also devised an alternative caching mechanism with scalability inmind which uses WeakReferences for keys (for example ClassLoader)and values (for example Class) that could be used in this situationin case adding a field to ClassLoader is not an option:
I would also consider any alternative to avoid adding theproxyClassCache field in ClassLoader as Alan commented previously.
My observation of the typical usage of proxies is to use theinterface's class loader to define the proxy class. So is itnecessary to maintain a per-loader cache? The per-loader cache mapsfrom the interface names to a proxy class defined by one loader. Iwould think it's reasonable to assume the number of loaders todefine proxy class with the same set of interfaces is small. Whatif we make the cache as "interface names" as the key to a set ofproxy class suppliers that can have only one proxy class per oneunique defining loader. If the proxy class is being generated i.e.ProxyClassFactory supplier, the loader is available for comparison.When there are more than one matching proxy classes, it would haveto iterate all in the set.
I would assume yes, proxy class for a particular set of interfaces istypically defined by one classloader only. But the API allows tospecify different loaders as long as the interfaces implemented byproxy class are "visible" from the loader that defines the proxyclass. If we're talking about interface names - as opposed tointerfaces - then the possibility that a particular set of interfacenames would want to be used to define proxy classes with differentloaders is even bigger, since an interface name can refer todifferent interfaces with same name (think of interfaces deployed aspart of an app in an application server, say a set of annotationsused by different apps but deployed as part of each individual app).
Agree. I was tempted to consider making weak reference to theinterface classes as the key but in any case the overhead ofClass.getClassLoader() is still a performance hog. Let's moveforward with the alternative you propose.
The scheme you're proposing might be possible, though not simple: Thefactory Supplier<Class> would become a Function<ClassLoader, Class>and would have to maintain it's own set of cached proxy classes.There would be a single ConcurrentMap<List<String>,Function<ClassLoader, Class>> to map sets of interface names tofactory Functions, but the cached classes in a particular factoryFunction would still have to be weakly referenced. I see somedifficulties in implementing such a scheme:- expunging cleared WeakReferences could only reliably clear thecache inside each factory Function but removing the entry from themap of factory Functions when last proxy class for a particular setof interface names is expunged would become a difficult task if notimpossible with all the scalability constraints in mind (justthinking about concurrent requests into same factory Function whereone is requesting new proxy class and the other is expunging clearedWeakReference which represents the last element in the set of cachedproxy classes).- one of my past ideas of implementing scalable Proxy.isProxyClass()was to maintain a Set<Class> in each ClassLoader populated with allthe proxy classes defined by a particular ClassLoader. Benchmarkingsuch solution showed that Class.getClassLoader() is a peformance hog,so I scraped it in favor of ClassValue<Boolean> that is nowincorporated in the patch. In order to "choose" the right proxy classfrom the set of proxy classes inside a particular factory Function,the Class.getClassLoader() method would have to be used, or entrieswould have to (weakly) reference a particular ClassLoader associatedwith each proxy class.
Thanks for reminding me your earlier prototype. I suspect the cost ofClass.getClassLoader() is due to its lookup of the caller class everytime it's called.

Even without SecurityManager installed the performance of nativegetClassLoader0 was a hog. I don't know why? Isn't there an implicitreference to defining ClassLoader from every Class object?

Considering all that, such solution starts to look unappealing. Itmight even be more space-hungry then the presented WeakCache.
WeakCache is currently the following:
ConcurrentMap<WeakReferenceWithInterfaceNames<ClassLoader>,WeakReference<Class>>
another alternative would be:
ConcurrentMap<WeakReference<ClassLoader>,ConcurrentMap<InterfaceNames, WeakReference<Class>>>
...which might need a little less space than WeakCache (only oneWeakReference per proxy class + one per ClassLoader instead of twoWeakReferences per proxy class) but would require two map lookupsduring fast-path retrieval. It might not be performance critical andthe expunging could be performed easily too.
I am fine with either of these alternatives. As you noted, the latterone would save little bit of memory for the cases when several proxyclasses are defined per loader e.g. one per each annotation type.
Mandy

Re: Proxy.isProxyClass scalability

Reply via email to