[Bug-gnubg] Version 1.0 ?

Michael Petch Thu, 02 May 2013 12:17:22 -0700

Hi All,

I'm of the opinion that given the stability of the current builds on
Unix and Windows platforms, and what appears as a measurable increase in
strength and better performance on 3ply (Depreli studies suggest it),
that we should consider ourselves at 1.0 . If we aren't at 1.0 yet, then
we probably will never have a reason to be.


We have a stable product, yes it has some known bugs, but in general I
think we can finally pull it out of testing after more than a decade.
Anyone have any objections?

If there aren't any sizable objections, it may require some
documentation tweaking (to up issue from 0.91) to 1.0, and possibly move
the version of the weights file to 1.0 as well.

Questions, comments etc, please feel free to make them known.
Preliminary (not complete) Depreli results are attached using XG2
rollouts as of 20130427. A regression (using v0.91) still has to be done
against all decisions in the original match files to see if more
positions need rollouts over what is in the current Depreli list. This
would likely increase the totals, but I don't believe it will be
substantial.

-- 
Michael Petch
GNU Backgammon Developer
OpenPGP FingerPrint=D81C 6A0D 987E 7DA5 3219 6715 466A 2ACE 5CAE 3304

                                                              Preliminary 
Depreli Results for GNUBG 
                                                             Using Depreli 
positions from 04-27-2013
                                                                         (5427 
Decisions)

Bot Description                     Chequer Error      Missed Double     Wrong 
Double       Wrong Take        Wrong Pass      All Cube Errors     All Errors
--------------------------------------------------------------------------------------------------------------------------------------------------------------
GNUBG v0.91 4-ply                     9.1071(1138)      1.1315(  47)      
0.5550(  30)      1.4345(  36)      0.3077(   3)      3.4287( 116)     
12.5358(1254)
GNUBG v0.91 3-ply Grandmaster        12.8651(1391)      0.7775(  27)      
1.8143(  76)      0.8975(  25)      0.4208(   6)      3.9101( 134)     
16.7752(1525)
GNUBG v0.91 2-ply CH(LF)/3-ply CU    15.8649(1545)      0.7775(  27)      
1.8143(  76)      0.8975(  25)      0.4208(   6)      3.9101( 134)     
19.7750(1679)
GNUBG v0.91 Supremo                  15.8649(1545)      1.0627(  39)      
1.2697(  58)      1.5792(  35)      0.4201(   3)      4.3317( 135)     
20.1966(1680) Incomplete/Close
GNUBG v0.91 2-ply WorldClass         16.6188(1559)      1.0627(  39)      
1.2697(  58)      1.5792(  35)      0.4201(   3)      4.3317( 135)     
20.9505(1694)
GNUBG v0.90 4-ply                    16.0467(1414)      3.2647( 108)      
0.8644(  27)      4.5741(  58)      0.0969(   2)      8.8001( 195)     
24.8468(1609)
GNUBG v0.90 Supremo                  25.5800(1805)      3.4119(  97)      
1.8000(  53)      5.0106(  59)      0.1567(   3)     10.3792( 212)     
35.9592(2017) Incomplete/Close
GNUBG v0.90 2-ply WorldClass         25.9465(1814)      3.4119(  97)      
1.8000(  53)      5.0106(  59)      0.1567(   3)     10.3792( 212)     
36.3257(2026)
GNUBG v0.90 3-ply Grandmaster        22.0227(1794)      0.8041(  19)     
10.0117( 187)      1.1159(  19)      5.9247(  44)     17.8564( 269)     
39.8791(2063)
GNUBG v0.90 2-ply CH(LF)/3-ply CU    25.5800(1805)      0.8041(  19)     
10.0117( 187)      1.1159(  19)      5.9247(  44)     17.8564( 269)     
43.4364(2074)
--------------------------------------------------------------------------------------------------------------------------------------------------------------

Notes:

1. Incomplete: There are 7 moves that need to be rolled out for supremo, but 
the final result won't change much.
2. Positions were also analysed with GNUBG v0.14.3. They were almost identical 
to v0.90. This is to be expected given that both 
   versions are based on the same neural net weights file (v0.15). Differences 
between the two are changes in the 
   crashed/contact/race net processing and other small changes over the years 
to the neural net code.
3. 2-ply CH(LF)/3-ply CU is an experiment. 2-ply checker with large filter, 
3-ply cube. 
4. All plies have pruning enabled for chequer and cube decision.

_______________________________________________
Bug-gnubg mailing list
[email protected]
https://lists.gnu.org/mailman/listinfo/bug-gnubg

[Bug-gnubg] Version 1.0 ?

Reply via email to