Re: [Computer-go] CGOS source on github

2021-01-21 Thread David Wu
One tricky thing is that there are some major nonlinearities between
different bots early in the opening that break Elo model assumptions quite
blatantly at these higher levels.

The most noticeable case of this is with Mi Yuting's flying dagger joseki.
I've noticed for example that in particular matchups between different
pairs of bots (e.g. one particular KataGo net as white versus ELF as black,
or one version of LZ as black versus some other version as white), maybe as
many as 30% of games will enter into this joseki and the preferences for
the bots may happen by chance to line up such that consistently they will
play down a path where one side hits a blind spot and begins the game with
an early disadvantage. Each different bot may have different preferences
such that arbitrarily each possible pairing randomly runs into such a trap
or not.

And, having significant early-game temperature in the bot itself doesn't
always help as much as you would think because this particular joseki is so
sharp that a particular bot could easily have such a strong preference for
one path or another (even when it is ultimately wrong) so as to override
any reasonable temperature. Sometimes, adding temperature or extra
randomness simply only mildly changes the frequency of the sequence, or
just varies the time before the joseki and trap/blunder happens anyways.

If games are to begin from the empty board, I'm not sure there's an easy
way around this except having a very large variety of opponents.

One thing that I'm pretty sure would mostly "fix" the problem (in the sense
of producing a smoother metric of general strength in a variety of
positions not heavily affected by just a few key lines) would be to
semi-arbitrarily take a very large sampling of positions from a wide range
of human professional games, from say, move 20, and have bots play starting
from these sampled positions, in pairs once with each color. This would
still include many AI openings, because of the way human pros in the last
3-4 years have quickly integrated and experimented with them, but would
also introduce a lot more variety in general than would occur in any
head-to-head matchup.

This is almost surely a *smaller *problem than simply having enough games
mixing between different long-running bots to anchor the Elo system. And it
is not the only way major nontransitivities can show up, (e.g. ladders).
But to take a leaf from computer Chess, playing from sampled forced
openings seems to be a common practice there and maybe it's worth
considering in computer Go as well, even if it only fixes what is currently
the smaller of the issues.


On Thu, Jan 21, 2021 at 12:01 PM Rémi Coulom  wrote:

> Thanks for computing the new rating list.
>
> I feel it did not fix anything. The old Zen, cronus, etc.have almost no
> change at all.
>
> So it is not a good fix, in my opinion. No need to change anything to the
> official ratings.
>
> The fundamental problem seems that the Elo rating model is too wrong for
> this data, and there is no easy fix for that.
>
> Long ago, I had thought about using a more complex multi-dimensional Elo
> model. The CGOS data may be a good opportunity to try it. I will try when I
> have some free time.
>
> Rémi
> ___
> Computer-go mailing list
> Computer-go@computer-go.org
> http://computer-go.org/mailman/listinfo/computer-go
>
___
Computer-go mailing list
Computer-go@computer-go.org
http://computer-go.org/mailman/listinfo/computer-go


Re: [Computer-go] 12 UEC Cup in March

2021-01-21 Thread lijun
您好!我是江苏君华特种工程塑料制品有限公司的总经理李军,我的手机号码为13382868677,感谢您发来的邮件!我会尽快处理答复您!
Hello,I am Jolly Li, the general manager of Jiangsu Junhua High Performance 
Engineering Plastic Products Co.,Ltd. Thanks for your mail ,it is well 
received,I will get back to you as soon as possible.
___
Computer-go mailing list
Computer-go@computer-go.org
http://computer-go.org/mailman/listinfo/computer-go


[Computer-go] 12 UEC Cup in March

2021-01-21 Thread Rémi Coulom
Hi,

I had not noticed, but there will be an online UEC Cup in March:
http://entcog.c.ooco.jp/entcog/new_uec/

It seems that the English page there has the wrong date:
http://entcog.c.ooco.jp/entcog/new_uec/en/outline.html

If I understand correctly it will take place on March 20th-21st.

I will register.

Rémi
___
Computer-go mailing list
Computer-go@computer-go.org
http://computer-go.org/mailman/listinfo/computer-go


Re: [Computer-go] CGOS source on github

2021-01-21 Thread Rémi Coulom
Thanks for computing the new rating list.

I feel it did not fix anything. The old Zen, cronus, etc.have almost no
change at all.

So it is not a good fix, in my opinion. No need to change anything to the
official ratings.

The fundamental problem seems that the Elo rating model is too wrong for
this data, and there is no easy fix for that.

Long ago, I had thought about using a more complex multi-dimensional Elo
model. The CGOS data may be a good opportunity to try it. I will try when I
have some free time.

Rémi
___
Computer-go mailing list
Computer-go@computer-go.org
http://computer-go.org/mailman/listinfo/computer-go


Re: [Computer-go] CGOS source on github

2021-01-21 Thread Hiroshi Yamashita

Hi,

This is original BayesElo. I updated manually. This is latest.
http://www.yss-aya.com/cgos/19x19/bayes.html
CrazyStone-18.044065
CrazyStone-81b-TiV  4032
Zen-15.7-3c1g   3999
CrazyStone-57-TiV   3618

This renames CrazyStone-57-TiV to CrazyStone-18.04.
http://www.yss-aya.com/cgos/19x19/bayes_20210121_rename_CrazyStone-57-TiV_to_CrazyStone-18.04.html
CrazyStone-81b-TiV  4051
Zen-15.7-3c1g   3968
CrazyStone-18.043778

Would you like to continue to rename CrazyStone-57-TiV?
The drift looks fixed, but a little for other programs.

Thanks,
Hiroshi Yamashita
___
Computer-go mailing list
Computer-go@computer-go.org
http://computer-go.org/mailman/listinfo/computer-go


Re: [Computer-go] CGOS source on github

2021-01-21 Thread Rémi Coulom
I checked, and CrazyStone-57-TiV is using the same neural network and
hardware as CrazyStone-18.04. Batch size, cuDNN version, and time
management heuristics may have changed, but I expect that strength should
be almost identical. CrazyStone-57-TiV may be a little stronger.

So it seems that the rating drift over 3 years is about 450 Elo points, and
the "All Time Ranks" are a bit meaningless.

Can you produce a list where CrazyStone-57-TiV is renamed to
CrazyStone-18.04? It may be enough to fix the drift.

I need the machine for something else, so I disconnected the GPU version.
CrazyStone-81-15po is running 15 playouts per move on the CPU of a small
machine, and will stay.

Rémi
___
Computer-go mailing list
Computer-go@computer-go.org
http://computer-go.org/mailman/listinfo/computer-go