[computer-go] Rating variability on CGOS

Brian Sheppard Thu, 08 Oct 2009 15:33:11 -0700

>One must be very careful about proclaiming wild transitivity issues.  I'm
>not saying it's not an issue, there is some going on with every program on
>CGOS, but with less than 500 games between any two players you are going
>to get error margins of +/- 30-50 ELO or something like that.


Actually we are certain that significant differences are being observed. If
we pool the Pachi and Pebbles data, then the null hypothesis is that
Valkyria defeats both programs by 79%. The observed data differs by at least
3.5 standard deviations.

Note that we are talking about 150 rating points.




_______________________________________________
computer-go mailing list
[email protected]
http://www.computer-go.org/mailman/listinfo/computer-go/

[computer-go] Rating variability on CGOS

Reply via email to