Re: [amibroker] Re: The EASIEST way to use new optimizer engines

Dennis Brown Sat, 28 Jun 2008 15:13:50 -0700

Tomasz,

Thank you for doing all the work on researching a good optimizer andproviding a solution for your customers. It was on my list of thingsto do and I had started to research it recently. I just ran out oftime for the present.

Even though I don't use AA mode for my work, I will still benefit fromyour generous work.

My plan was to make an optimizer chart that runs in a different tabfrom my main chart. That lets me do a lot of things using my FlexibleParameters scheme. Since all my parameters (and I have hundreds ofthem to define a system) are accessible to other charts via staticvariables+ChartID, I can modify the working parameters and get resultsfor each pass through a simple messaging interlock. I can run thewhole process via AFL (I could even translate the optimizer code toAFL to make it easier to modify, but I have not decided if that iswise yet). Also since all the parameter names are available fromtheir file names with my system, the optimizer can have a list ofParams to pick from and save out its own settings as a Param set. Itcould be quite a rich environment for optimizing my complex systems.

This might not make a lot of sense to most, but I just wanted to letyou know that you have enabled me to do much more than imagined byyour openness (which is not always the case with others). Now, justwhere to find the time to do everything on my list. LOL


Best regards,
Dennis

On Jun 28, 2008, at 3:30 PM, Tomasz Janeczko wrote:

Yes you are right. Windows is black box. It truly is. And severalpeople were so worried that they developed Linux :-)
Visual Studio - fortunatelly Microsoft DOES provide source codes forMFC and you can single-step into source down to C++ or assembly levelon ANY code produced by Visual Studio. Without that I won't use it.Many times I needed to investigate the sources of MFCto find why things work that way or another. There are zillions ofdetails that do not exist in Microsoft help files, but
can be revealed when analysing MFC sources.
The main thing that makes difference whenever you need sources ornot is the ability to validate results.
If you have Windows function SetWindowText that just sets the textfor the window, it is easy to verify if it works or not, withouthaving source code.The same with AmiBroker's AFL functions. It is easy to verify if 10-day simple moving average is correct or not.
As 2*2 the result is 4 and everyone knows that. With "intelligentsearch" algorithm the answers are not that obvious. Very fewevolutionary algorithms
have strong mathematical proofs behind them.
That's why having source files is huge advantage, and that's why allnon-exhaustive optimizers from AmiBroker come with the source code.
Best regards,
Tomasz Janeczko
amibroker.com
----- Original Message -----
From: Fred Tonetti
To: [email protected]
Sent: Saturday, June 28, 2008 9:01 PM
Subject: RE: [amibroker] Re: The EASIEST way to use new optimizerengines
Thanks for your comments about the individual engines … What I haveor haven’t read has nothing to do with how someone chose toimplement something. IO too is capable of performing multiple runsor what I termed “passes” but this is implemented somewhatdifferently as documented.
One last thought about “black boxes” … By your definition Windowswould be a black box as would Visual Studio or for that matter allof AmiBroker with the exception of the plugins where the code wassupplied.
Should we not use these tools because they are black boxes ?
From: [email protected] [mailto:[EMAIL PROTECTED]On Behalf Of Tomasz Janeczko
Sent: Saturday, June 28, 2008 6:12 AM
To: [email protected]
Subject: Re: [amibroker] Re: The EASIEST way to use new optimizerengines
Hello,
Fred>"fairly standard simple algorithms with some tweaks that aftertons of experimentation "
The PSO was first described in 1995 by James Kennedy and Russel C.Eberhart, since then
LOTS of people developed their own algorithms based on PSO.
There are at least 20 DIFFERENT PSO public algorithms that I know.All producing different results. What is "fairly standard" then?I am pretty sure that you are not using Standard 2007 (as "spso"),are you???
Unless source code for IO is provided, it *IS* a black box.
Fred> How does one intelligently decide how many runs and tests touse for PSO & Tribes based on differing number of variables to beoptimized ?
You actually answered yourself: you decide "after tons ofexperimentation". Depending on problem under test, its complexity,etc, etc.Any stochastic non-exhaustive method does not give you guarantee offinding global max/min, regardless of number of tests if it is smallerthan exhaustive. The easiest answer is to : specify as large numberof tests as it is reasonable for you in terms of time required tocomplete.Another simple advice is to multiply by 10 the number of tests withadding new dimension. That may lead to overestimating number
of tests required, but it is quite safe.
In case you did not notice this is a very first version that issubject to improvements. I want to keep things simple to use and donot requirepeople to read 60+ page doc to be able to run first optimization.Therefore the work is being done to provide "reasonable" default/automatic values
so optimization can be run without specifying anything.
Fred> What happens differently for these two engines when onespecifies 5 runs of 1000 tests versus 1 run of 5000 tests ?
Well, if you read that many scientific papers on intelligentmethods, you should already now the difference, as it is the mostbasic thing.TEST (or evaluation) is single backtest (or evaluation of objectivefunction value).
RUN is one full run of the algorithm (finding optimum value).
Each run simply RESTARTS the entire optimization process from thenew beginning (new initial random population).Therefore each run may lead to finding different local max/min (ifit does not find global one).
Once you know the basics the difference is obvious.
5 RUNS of 1000 tests is simply doing 5 times the 1000-backtest PSOoptimization .1 RUN of 5000 tests is simply doing 5000-backtest PSO optimizationONCE only.
Now if the problem is relatively simple and 1000 tests are enough tofind global max, 5x1000 is more likely to find global maximumbecause there are less chances to be stuck in local max, assubsequent runs will start from different initial random population.
The difference will be if problem is complex enough (has manydimensions). In that case running 1x5000 is more likely to
produce better result.
Actually this can be used as a stop condition. You can for examplesay that you want to restart (make another run) as long
as two (or three) subsequent runs produce the same maximum.

CMA-ES is slightly different in terms of how RUN is interpreted.
Currently the CMA-ES plugin implements G-CMA-ES flavour (i.e. globalsearch with increasing population size).
As it is written in the READ ME
http://www.amibroker.com/devlog/wp-content/uploads/2008/06/readme5130.html
You may vary it using OptimizerSetOption("Runs", N ) call, where Nshould be in range 1..10.
Specifying more than 10 runs is not recommended, although possible.
**** Note that each run uses TWICE the size of population ofprevious run so it grows exponentially.Therefore with 10 runs you end up with population 2^10 greater (1024times) than the first run. ****
So each subsequent CMA-ES run will take TWICE as much time asprevious one and TWICE the population size.
Of course this can be changed (the source code is available and welldocumented).
Fred> How should one set up CMA-ES so that it produces superiorresults in less time for problems like the one I outlined i.e. thatare of a type that can not be solved by exhaustive search ?
Just use one run.

OptimizeSetOption("Runs", 1 );

it will produce results in less time.
Doing so is actually equivalent to running L-CMA-ES (local search).

Best regards,
Tomasz Janeczko
amibroker.com
----- Original Message -----
From: Fred Tonetti
To: [email protected]
Sent: Saturday, June 28, 2008 10:46 AM
Subject: RE: [amibroker] Re: The EASIEST way to use new optimizerengines
TJ,
IO, which was preceded by PSO, was initially an experiment todetermine whether or not it could even be done and then whether ornot it was a worthwhile tool to have.
Following that it was and is for the most part a give back to thecommunity as most of the bells and whistles are FREEWARE in a userfriendly format. Stating that it is a black box is absurd as ituses fairly standard simple algorithms with some tweaks that aftertons of experimentation I know to be of benefit and users havecontrol over all aspects of how the algorithms work from their AFLif they choose to use them without having to research them on theinternet as there’s 60+ pages of documentation about what has beenimplemented, how it works and the associated feature/functions …
Frankly I could care less if anyone ever bought a copy with the moreadvanced features as the fees associated with those features wereput on simply to reduce the amount of support that would no doubt berequired if the entire community used them.
What I want to compare is the usefulness of the different enginesfor different types of problems and how long they take to arrive atrelatively decent results to solve problems that can not be solvedby exhaustive search and to that end I have already asked severalstraight forward questions that for whatever reason you have chosento ignore … So I’ll try them again …
- How does one intelligently decide how many runs and teststo use for PSO & Tribes based on differing number of variables to beoptimized ?
- What happens differently for these two engines when onespecifies 5 runs of 1000 tests versus 1 run of 5000 tests ?
- How should one set up CMA-ES so that it produces superiorresults in less time for problems like the one I outlined i.e. thatare of a type that can not be solved by exhaustive search ?
These are basic questions about the use of the intelligentoptimization engines that you have chosen to include in the productwhich I would think lots of folks would want the answers to withouthaving to search the internet.
Personally I’ve already read way beyond my share of scientificpapers on intelligent optimization.
From: [email protected] [mailto:[EMAIL PROTECTED]On Behalf Of Tomasz Janeczko
Sent: Saturday, June 28, 2008 3:59 AM
To: [email protected]
Subject: Re: [amibroker] Re: The EASIEST way to use new optimizerengines
Fred,
I don't know why you took some kind of mission on criticizing lastdevelopments maybe this is becauseyou are selling IO while AB optimizer is offered as free upgrade andthat makes you angry.I don't know why this is so, because actually you can benefit fromthat too - I have providedfull source code so everything is open for innovation andimprovement, unlike black box IO.
The fact is that you are comparing APPLES TO ORANGES.
You should really READ the documentation I have provided and visitlinks I have provided.
CMA-ES DEFAULTS are well suited for tests that are replacement ofexhaustive searches.
They are however too large for 15 variables. For example CMO bydefault will use900 * (N + 3 ) * (N+3 ) max evaluations. It converges much quickertherefore estimatedisplayed in the progress bar is calculated as follows 30 * (N+3) *(N+3)
You are comparing 1000 evaluations of PSO with CONSTANT populationsizeto 10000+ evaluations of CMAE with GROWING population size defaultsettings.
You are comparing elephant to an ant.

If you want to COMPARE things you need to set up IDENTICAL conditions.
That would be:

OptimizerSetOption("Runs", 1 );
OptimizerSetOption("MaxEval", 10000 );

With *IDENTICAL* conditions, CMA-ES will run faster.

Best regards,
Tomasz Janeczko
amibroker.com
----- Original Message -----
From: Fred Tonetti
To: [email protected]
Sent: Saturday, June 28, 2008 7:15 AM
Subject: RE: [amibroker] Re: The EASIEST way to use new optimizerengines
It is somewhat meaningless to compare intelligent optimizers withexhaustive search due to the fact that for most real world problemsexhaustive search would need more time than the universe has beenaround to solve them … It is also somewhat meaningless to compareintelligent optimizers with each other based on problems that aresolvable by exhaustive search.
In regards to the imbedded PSO & Tribes algorithms you state …
“You should increase the number of evaluations with increasingnumber of dimensions. The default 1000 is good for 2 or maximum 3dimensions” …
Can you provide any guidance as to what relationship should existbetween the number of dimensions and the number of tests ? i.e.what’s a reasonable number of tests for 5 dimensions, 10, 100 ?
Can you explain the difference between 1 run with 5000 tests and 5runs with 1000 tests ?
As far as CMAE is concerned … Maybe I’m missing something but itdoesn’t seem that CMAE has anything in terms of speed over AB’s PSOor Tribes …
I tried CMAE out on a real world intelligent optimization problemwith 15 variables trading 100 symbols by adding the requiredstatement to the AFL …
Run time for CMAE to complete was 459 minutes …
Run times for AB’s PSO and Tribes to complete with 5 runs and 1000tests was in the neighborhood of 75 minutes each with results beingthe sane as CMAE.
As an FYI …
Run times for IO’s DE and PS to complete via their own internaldecision making process w/o the help of additional cores ( servers )was in the same neighborhood with times of 72 and 53 minutesrespectively.
With the help of additional cores ( 7 ) IO’s DE and PSO ran tocompletion in 11 and 8 minutes respectively …
From: [email protected] [mailto:[EMAIL PROTECTED]On Behalf Of Tomasz Janeczko
Sent: Friday, June 27, 2008 8:05 PM
To: [email protected]
Subject: Re: [amibroker] Re: The EASIEST way to use new optimizerengines
FYI: using new optimizer engine (cmae) to optimize seemingly
simple 3 parameter (ranging 1..100) system gives speed up
of more than 1000 times, as cmae optimizer is able to find best
value in less than 1000 backtests compared to one million backtests
using exhaustive search. It also outperforms PSO usually by factorof 10.
That is 500 times faster than you would get from exhaustive optusing your dual core
and 5 times faster than PSO on dual core.

CMA-ES delivers MORE in terms of speed with LESS development time.

Best regards,
Tomasz Janeczko
amibroker.com




I am using the free version of SPAMfighter for private users.
It has removed 492 spam emails to date.
Paying users do not have this message in their emails.
Try SPAMfighter for free now!




I am using the free version of SPAMfighter for private users.
It has removed 492 spam emails to date.
Paying users do not have this message in their emails.
Try SPAMfighter for free now!

I am using the free version of SPAMfighter for private users.
It has removed 492 spam emails to date.
Paying users do not have this message in their emails.
Try SPAMfighter for free now!

Re: [amibroker] Re: The EASIEST way to use new optimizer engines

Reply via email to