very interesting. I wonder if it is possible to systematically find more of
those positions.

-Joseph


On Sun, 8 Dec 2019 at 07:52, Nikos Papachristou <[email protected]> wrote:

>
>
> On Thu, Dec 5, 2019 at 6:34 PM Timothy Y. Chow <[email protected]>
> wrote:
>
>>
>> > Regarding expected results, I also believe that backgammon bots are
>> very
>> > close to perfection and whatever improvements (from any approach) will
>> > be marginal.
>>
>> In order to determine whether a new network is doing better than the old
>> network, it helps to have examples of positions where the old network is
>> clearly playing poorly.  Here's one example of a game that I played
>> against eXtreme Gammon where the bot made a lot of obvious blunders:
>>
>> http://timothychow.net/cg/Games/7pt2015-05-24e%20Game%202.htm
>>
>> For example, search for "10/8 6/4(3)".  The bot's ridiculous play here
>> would not be among the top 50 plays of any halfway decent human player.
>> Admittedly this was XG but I would expect GNU to behave similarly, if not
>> in these specific positions then in similar ones.
>>
>>
> I analyzed the positioned you mention using GNU Backgammon 1.06.002-mingw
> 32-Bit 20180802.
> Since I am not am experienced GNUBG user, if any GNUBG dev spots anything
> wrong with the following, feel free to correct me.
>
> I started with a 4-ply evaluation and the "correct" move is No 49 in the
> list of best moves.
> ID: AABCgDsTg4MAAA:AQHpACAAAAAE
>
>     1. Cubeful 4-ply    18/16 13/7*                  Eq.: +0.138
>        0.574 0.000 0.000 - 0.426 0.104 0.060
>         4-ply cubeful prune [4ply]
>     2. Cubeful 4-ply    10/4 6/4                     Eq.: +0.136 (-0.003)
>        0.564 0.000 0.000 - 0.436 0.096 0.047
>         4-ply cubeful prune [4ply]
>     3. Cubeful 4-ply    23/21 13/7*                  Eq.: +0.117 (-0.022)
>        0.569 0.000 0.000 - 0.431 0.113 0.060
>         4-ply cubeful prune [4ply]
>     4. Cubeful 4-ply    18/16 10/6 5/3*              Eq.: +0.109 (-0.029)
>        0.566 0.000 0.000 - 0.434 0.111 0.063
>         4-ply cubeful prune [4ply]
> ...
>     49. Cubeful 0-ply    13/7* 5/3*                   Eq.: +0.115 (-0.023)
>        0.552 0.000 0.000 - 0.448 0.087 0.050
>         0-ply cubeful prune [expert]
>
> What happened is that a good move got pruned at 0-ply because the default
> move filter for 4-ply eval at 0-ply is 16. So the best move did not reach
> deeper plies for better evaluation. I suspect something similar happened in
> your game with XG.
> Then I changed this setting to 50, and after waiting a minute or two, the
> move gets to the number 1 spot:
>
>     1. Cubeful 4-ply    13/7* 5/3*                   Eq.: +0.164
>        0.582 0.000 0.000 - 0.418 0.099 0.060
>         4-ply cubeful prune [4ply]
>     2. Cubeful 4-ply    18/16 13/7*                  Eq.: +0.138 (-0.026)
>        0.574 0.000 0.000 - 0.426 0.104 0.060
>         4-ply cubeful prune [4ply]
>     3. Cubeful 4-ply    10/4 6/4                     Eq.: +0.136 (-0.028)
>        0.564 0.000 0.000 - 0.436 0.096 0.047
>         4-ply cubeful prune [4ply]
>     4. Cubeful 4-ply    13/7* 10/8                   Eq.: +0.125 (-0.039)
>        0.571 0.000 0.000 - 0.429 0.110 0.060
>         4-ply cubeful prune [4ply]
>     5. Cubeful 2-ply    23/21 13/7*                  Eq.: +0.153 (-0.011)
>        0.582 0.000 0.000 - 0.418 0.109 0.061
>         2-ply cubeful prune [world class]
>
> The moral: If one needs to experience the full power of the bg bots one
> needs to change the default settings which are configured for the average
> user. Whatever errors bots occasionally make at their evaluations, they
> make up by searching deeper.
>
> Nikos
>
> Playing around with positions like this will quickly disabuse anyone of
>> the illusion that "backgammon bots are very close to perfection."
>>
>> As I recall, in the past, people have tried specifically training neural
>> nets on positions like these, as well as "snake" positions where you have
>> to roll a prime for a long distance, and the problem was that it seemed
>> to
>> degrade performance on other types of positions.  It's possible that, as
>> Papachristou suggests, recent incremental improvements in regularization
>> algorithms might be good enough to overcome these difficulties.
>> Anecdotal
>> evidence from Robert Wachtel's revised version of "In the Game Until the
>> End" suggests that Xavier was able to improve eXtreme Gammon's post-coup
>> classique play significantly, without a wholesale switch to modern deep
>> learning methods.
>>
>> Tim
>>
>>

Reply via email to