Re: [computer-go] RAVE formula of David Silver (reposted)

Jason House Mon, 01 Dec 2008 02:56:52 -0800

On Dec 1, 2008, at 12:23 AM, Mark Boon <[EMAIL PROTECTED]> wrote:

On 30-nov-08, at 16:51, Jason House wrote:
You've claimed to be non-statistical, so I'm hoping the followingis useful... You can compute the likelihood that you made animprovement as:
erf(# of standard deviations)
Where # of standard deviations =
(win rate - 0.5)/sqrt(#games)
Erf is ill-defined, and in practice, people use lookup tables totranslate between standard deviations and confidence levels. Inpractice, people set a goal confidence and directly translate it toa number of standard deviations (3.0 for 99.85%). This situationrequires the one-tailed p test.
After about 20 or 30 games, this approximation is accurate and canbe used for early termination of your test.
Lately I use twogtp for my test runs. It computes the winningpercentage and puts a ± value after it in parenthesis. Is that the value of one standard deviation? (I had always assumed so.) Even after a 1,000 games it stays in the 1.5% neighbourhood.


Sounds like it.

Maybe 20-30 games is usually an accurate approximation. But if youperform tests often, you'll occasionally bump into that unlikelyevent where what you thought was a big improvement turned out to beno improvement at all. Or the other way around. Only when I see 20+games with a zero winning percentage do I stop it, assuming I made amistake.

The 20 or 30 game caveat would really only apply for extreme winningor losing streaks. Up until that point, confidence levels are not ashigh as one might expect from the approximation.



Mark

_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

Re: [computer-go] RAVE formula of David Silver (reposted)

Reply via email to