[E-devel] Memory optimizing EO

Marcel Hollerbach Tue, 18 Feb 2020 07:10:17 -0800

Hi,

here a little draft of what i will work on beginning on the 10th ofmarch. So until there, its time for discussion.


== Introduction ==

Eo is a OOP framework implementation. Classes are basicallyimplementations of a set of interfaces / parent class / mixins. EachInterface / parent class / mixin, consits of a set of functions.

The API a Class provides is called the absolut set of functions of a class

Each element in the absolut set of functions has to have aimplementation. Which has to be stored somewhere, this piece of code andstructure is called vtable. Every entry in the vtable consits of theimplementation of a API, and the source Class. (Which is needed forprivat data calculation, which is another topic)

Right now, each function has a ID (fid), this id is globally unique, andjust incremented for each and every function that gets registered to EO.Each vtable of a class has the space to store *all* functions that havebeen registered to the current point.Right now this happens in a way where the function ID of each functionis split into a upper part and a lower part (based on the bitrepresentation), this then results in the dich-chain1 ID (dc1) anddich-chain2 ID (dc2) which are used to get to the correct slot (pseudocode like: vtable->dichchain1[dc1]->dichchain2[dc2]). If a class needsto store the function X, then dich-chain1 at space dc1 gets a allocateddichchain2, which consists of 0's expect in the slot dc2, there we storethe implementation and the source.


== The problem ==

The problem with this approach is that we waste quite a lot of memory,esp. for widgets that do not implement a lot of API which have a closefid, leading to the fact that we allocate a lot of dich chain2s whereonly a few slots are really used. If you go and messure how many slotswe allocate and how many we use we are having a mean value of roundabout 0.36, which is quite bad IMO. In total, we have 35296 slotsallocated, and we use 16807. (This is already honoring the eooptimizations we have, the referenced slots are NOT added to this)


== The Idea number 1 ==

What we can do to improve this is: we change the heuristic how weallocate fids. We could increment for each class/interface/mixin one dc1counter, and for each function in this class/interface/mixin weincrement a class/interface/mixin privat counter, which will be the dc2.We then combine them together to the fid via something likedc1*10000+dc2. When a eo call then happens, we simply decompose the fidlike we do it now, and call the two dich chains, like we do it now.What this changes in the memory layout is that we fully use each andevery dichchain2 that we allocate.

The downside of that method is that we get a longer dichchain1 array,right now with elementary_test we are having 32 pointer long dichchain1at max, meaning we are allocating in total 2KB over all classes we have.

With this new idea the dichchain1 will be 190 elements long meaning 70KBover all classes we have.However, due to the savings in the dichchain2's we are saving roundabout 144KB of allocated data, (1 slot is 8byte). Which means we arestill having a safe of round about 74KB.

This is something which I would implement, additional min / maxcheckings or COW on the dichchain1 are probably even saving more.


== The Idea number 2 ==
After Idea 1 is implemented, we could work on this.

Right now we are allocating one dc1 slot *per* class we have. Right nowwe have 26 mixins 90 regulars 13 abstracts and 61 Interfaces (Summing upto 190). Which results in the length of the dc1. To improve thesituation we could go and say that each dc1 ID of a regular class is themax of all functions including those of the parent class + 1. Thatmeans, that fid's are not globally unique anymore, but they are stillunique within the type they are defined.The more interesting thing is, that all regular APIs are now in the samedc1 slot, which means, it is enough to allocate the size of the dc1 asthe size of the sum of iterfaces and mixins, ending up with somethinglower than 15KB per class. Bringing us to saving 130KB of allocated data(compared to the state that we have right now).

Another "downside" of that approach is also that we loose the ability ofmulticlass inheritance, however, we only have one widget that uses thatability, and this does not work anyways, so we could just deprecate it ithink.

Funcy side effect of this idea: if we resolve a regular API call to aobject, the dc1[0] slot will always be the same. That means, we couldstore that pointer in parallel to the vtable pointer, saving us oneindirection PER call resolve. (Right now we have 2).

Any ideas, thoughts about this ? I will start the work on that on the10.03.2020 :)


Greetings,
   bu5hm4n

PS: you can find a plot about our usage ratio in EO in the attached plot.

_______________________________________________
enlightenment-devel mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/enlightenment-devel

[E-devel] Memory optimizing EO

Reply via email to