Re: Self Optimizing Code

Mark J Twain via Digitalmars-d Thu, 04 Aug 2016 12:01:56 -0700

On Thursday, 4 August 2016 at 11:35:41 UTC, crimaniak wrote:

On Tuesday, 2 August 2016 at 22:06:38 UTC, Mark "J" Twain wrote:
Instead, a better solution would be to use variables:
if (n*length > m*capacity) expand(l*length)
Some time ago I played with self-optimizing cache layer.
Problem: Time to obtain cache items is unknown and serverdependant. For example, network can be involved in this.Sometimes is it localhost, sometimes server on anothercontinent. Time to cache it is unknown too. For example ifmemcached extension is not installed on this server thenfallback to disk backend will slow cache performance very much.So we don't know is it worst to cache.
Solution: cache measures own performance and acts accordingly.

I did not think of this in terms of networking. I suppose it canbe used there too but, as you mention, latency could be a factor.

So I make next things:
1. Functional interface instead of save/load type interface.cacheLevel.get(functor_to_get_item) allow to measure itemobtaining time.2. Implement all measuring/controlling logic in separate classwith interface AbstractStatist, in CacheLevel class make justhooks for it. So changing AbstractStatist implementation I canchange work mode to measure statistics and use it or work asusual cache (EmptyStatist for all empty hooks). I makeimplementation to skip all items not worst caching(calcTime/calcHits*totalHits-totalTime < 0) but it's possibleto make more smart things (like caching only most efficientitems not exceeding cache size).

If it is more complex, that what I described, it would have to bethought out a great deal. My goal was to simply have the programexpose optimization points(variables) then allow an optimizer tochange those to find better points. The program itself would bevirtually unmodified. No code to interact with the optimizationprocess except to use variables instead of constants(which isminimal and necessary).

Exposing an interface for the program itself to guide theoptimization process seems like a lot more work. But, of course,ultimately is better as it allows more information to flow in tothe optimization process. But this design is beyond what I'mwilling to achieve(this way could be months or years to get doneright), while my method could take just a few hours to code up,and is rather general, although a bit dumb(fire and forget andhope for the best).

In my experience I can draw some conclusions.
1. It need to separate measure mode and control mode. You can'thave accurate statistics while changing system behavioraccording to current statistics state.2. Statistics can be different for different applications butfor specific application in specific conditions for most casesit can be approximated as constant.

Yes, it is tricky to make the algorithm stable. This is why Ithink, for a simple optimizer, it would need to do this over longperiods(months of program use). Because there are so manyaberrations(other programs, user behavior, etc), these can onlybe statically removed by repetitive uses. Essentially "low passthe data to remove all the spikes" then compare the avg resultwith the previous.

So for array allocating strategy more realistic scenario thenext, I think:1. Application compiled in some 'array_debug' mode then somestatist trait added to array, collect usage statistics andwrites optimal constants at the application exit.2. Programmer configure array allocator in applicationaccording to these constants.3. Application builds in release mode with optimal allocationstrategy and without any statist overhead and works fast. Usersare happy.

This is more static profiling type of optimizations. I am talkingabout something a bit different. Both methods could be usedtogether for a better result, but mine is for simplicity. Wegenerally blindly set constants for things that affectperformance. Let's simply turn those constants in to variablesand let a global blind optimizer try to figure out better valuesthan what we "blindly" set. There is no guarantee that it wouldfind a better result and may even introduce program instability.But all this stuff can be somewhat measured by cpu and memoryusage and given enough parameters, there is probably at leastseveral optimal points the optimizer could find.

Ultimately for just a little work(setting the variables andspecifying their ranges and step, say), we could have mostprograms being created, in D for now at least, optimizingthemselves(while they are being used by the user after they havebeen shipped) to some degree. This, I believe, is unheard of. Itrepresents the next level in program optimizations.

Imagine one day where a program could optimize itself dependingon the hardware of the user, the users habits, etc. Well, thismethod attempts to get that ball rolling and does all thosethings in a general way(albeit ignorant, but maybe just aseffective). It's hard to know how effective until it is done, butwe do know it can't be any worse than what we already have.

Re: Self Optimizing Code

Reply via email to