Inex branch merged into trunk...

Emmanuel Lécharny Mon, 30 Apr 2012 18:09:09 -0700

Hi,

just to inform you that the index branch has been merged with no harmtoday. I just had to fix 3 conflicts, and two bugs I introduced in thebranch before the commit.

The server performance is way better for searches, with a fewimprovements I did those last 4 days. It was impressive how easy it wasto improve the speed with little modifications. The global result isthat the server is now :o Object scope search (lookup) : 49 880 req/s compared to 23 081 on theprevious trunko One Level scope search (5 entries returned) : 68 715 entries returnedper second, compared to 33 120/so Sub Level scope search (10 entries returned ) : 70 830 entriesreturned per second, compared to 18 910/s

There is room for more improvement, but it will be more complex. Thearea that can be improved are :o get rid of the extra getSearchControls() call in intercepotrs. This isthe easiest fixo review the way we handle entries modification before we return them.Currently, we clone the entry, and remove the attributes the user hasnot required. See DIRSERVER-1719 for more explaination on this subject.Note that the filtering of attributes represent around 9% of the globalCPU time.o getting back the ID from a Dn is a very costly operation (19% of theglobal CPU time), and the longer the DN, the longer the operation. Foreach RDN, we have to do a lookup in the RdnIndex. The only solutionwould be to have a Dn -> ID cache somewhere. This would boost the serverperformance, that's for sure.o fetching an entry from the backend cost 38% of the global time, out ofwhich 29% represent the cost to clone the entry. If we could avoid doingthis clone (see upper), we may have some major performances increase.o when evaluating an entry to see if it fits the filter, we use thereverseIndex, which is also a costly operation. We shoudl re-evaluate ifit wouldn't be better to use the MatchingRules comparator to do thatinstead (reverse lookups account for 4% of the used CPU time)

One interesting result is that the LRUCache.get() operation represent13% of the used time. This is definitively not small. There is probablysome room for some improvement here, but this is way more complex...

All those numbers have been collected using YourKit on a Lookup test(150 000 lookups on one single element have been done)

There are also some improvements to expect on the Add/Delete/Moveoperation, as we have to delete/add the keys on the RdnIndex. This issomething Im going to work on tomorrow.

One more thing : the number I get when running the server-integ searchperf are way below (from 2900 to 5400 per second). This is plain normal.When going through th network, we pay some extra price :

o the client code eats 57% of all the time it takes to run the test
o On the server, normalizing the incoming Dn costs 7% of the processing time
o the entries encoding is very expensive

All in all, on the server, unless we test it on a different machine thanthe injectors, all the measures are pretty impossible to do. There istoo much noise...

I'd be interested to conduct largest tests on a multi-core server, withlots of memory, and a lot of entries, with external injectors, to seewhat kind of performances we can get...

In the next few days, I will probably fix some pending bugs. I think wecan cut a M7 release by the end of this week, and make it available bynext week.


Thanks !

--
Regards,
Cordialement,
Emmanuel Lécharny
www.iktek.com

Inex branch merged into trunk...

Reply via email to