This is interesting. Any comments as to why level 8 should be better than level 10?
Thanks for your attention. Terry McIntyre <[EMAIL PROTECTED]> ----- Forwarded Message ---- From: Weston Markham <[EMAIL PROTECTED]> To: computer-go <[EMAIL PROTECTED]> Sent: Wednesday, February 6, 2008 1:49:44 PM Subject: Re: [computer-go] Is Rémi correct? I know that other people have mentioned this sort of thing already, but the result of level 8 being better than level 10 matches my own experience with slightly older versions of gnugo. As I recall, 8 was the best, 9 a little worse, and 10 worse again. Increasing the level seems to improve play after that, but it dramatically increases the time. Weston On Feb 6, 2008 12:48 PM, Don Dailey <[EMAIL PROTECTED]> wrote: > Here is an update from the new 1000 game test using gungo at level 8 > instead of 10. > > Rank Name Elo + - games score oppo. draws > 1 Gnugo-3.7.11 1800 34 30 2186 97% 1137 0% > 2 Mogo_03 1507 48 56 186 16% 1800 0% > 3 Mogo_02 1202 43 51 1000 3% 1800 0% > 4 Mogo_01 1003 70 96 1000 1% 1800 0% > > The test, at this point, seems to indicate that gnugo at level 8 is > stronger than at level 10 because mogo is not doing as well as in the > previous test. It will be more meaningful when we get to levels close > to gnugo's strength. > > - Don > > > > As promised, to answer Rémi, I did a study with mogo vs Gnu at various > > levels. There is NO self play involved, Gnugo-3.7.11 is the only > > opponent for progressively higher rated version of Mogo. > > > > Here are the raw results so far: > > > > Rank Name Elo + - games score oppo. draws > > 1 Mogo_10 2319 72 60 500 95% 1800 0% > > 2 Mogo_11 2284 94 74 259 94% 1800 0% > > 3 Mogo_09 2234 57 49 500 92% 1800 0% > > 4 Mogo_08 2124 43 39 500 87% 1800 0% > > 5 Mogo_07 2016 35 33 500 78% 1800 0% > > 6 Mogo_06 1961 32 30 500 72% 1800 0% > > 7 Mogo_05 1814 28 28 500 52% 1800 0% > > 8 Gnugo-3.7.11 1800 13 13 5259 44% 1823 0% > > 9 Mogo_04 1711 29 29 500 37% 1800 0% > > 10 Mogo_03 1534 35 38 500 18% 1800 0% > > 11 Mogo_02 1281 60 72 500 5% 1800 0% > > 12 Mogo_01 1004 115 178 500 1% 1800 0% > > > > > > The issue is whether self-play results distort the rating of programs. > > In this case, we are only testing whether it distorts the ratings of > > Mogo since no other programs were tested. > > > > In the following table, I played up to 500 games between Gnugo and Mogo > > at various levels. The levels are the exact levels that correspond to > > the big scalability study. In the middle column I listed the > > ratings as computed by bayeselo in games against ONLY Gnugo and set the > > default rating of Gnugo to 1800, just as in the study. > > > > Unfortunately, I used level 10 in the gnugo only games but in the big > > study we use level 8. It's my understanding there is little difference > > between these 2 but we can probably assume Mogo might be a little better > > than indicated relative to the big scalability study. > > > > It looks like there indeed is a lot of distortion at the low end of the > > scale. Mogo seems much stronger at low levels than the larger > > scalability study indicated. > > > > At the higher levels, we also get a mismatch, where Mogo's rating > > doesn't seem as high when playing only Gnugo. This is as Rémi > > claims. > > > > One thing to note is that at higher levels it's more difficult to get an > > accurate rating. Mogo_10 is winning 95% of it's games against Gnugo, > > and an extra win or loss every few games can make a lot of difference. > > However I am inclined to believe this is real since it seems to hold for > > several upper levels. At level 7 it's only 42 ELO, but at levels > > beyond this it's over 100 ELO. > > > > I've never doubted that there is some intransivity between programs, but > > I am a little surprised that it is this much. Even if the comparison is > > slightly unfair due to Mogo playing a stronger version of Gnugo in this > > study, it's still seems like it must be at least 100 ELO. > > > > > > vers vs Gnu Study > > ---- ------ ----- > > 01 1004 688 > > 02 1281 1093 > > 03 1534 1331 > > 04 1711 1554 > > 05 1814 1751 > > 06 1961 1971 > > 07 2016 2058 > > 08 2124 2270 > > 09 2234 2347 > > 10 2319 2470 > > > > > > My suggestion to improve this situation is to play a few thousands games > > against a well rated Gnugo and set up mogo as a second anchor. > > > > - Don > > > > > > > > _______________________________________________ > > computer-go mailing list > > [EMAIL PROTECTED] > > http://www.computer-go.org/mailman/listinfo/computer-go/ > > > > > _______________________________________________ > computer-go mailing list > [EMAIL PROTECTED] > http://www.computer-go.org/mailman/listinfo/computer-go/ > _______________________________________________ computer-go mailing list [EMAIL PROTECTED] http://www.computer-go.org/mailman/listinfo/computer-go/ ____________________________________________________________________________________ Looking for last minute shopping deals? Find them fast with Yahoo! Search. http://tools.search.yahoo.com/newsearch/category.php?category=shopping _______________________________________________ gnugo-devel mailing list gnugo-devel@gnu.org http://lists.gnu.org/mailman/listinfo/gnugo-devel