Re: [computer-go] Biasing nodes according to pattern gammas

Rémi Coulom Wed, 16 Dec 2009 11:57:57 -0800

Jacques Basaldúa wrote:

 > Petr Baudis wrote:

> > I wonder now, do you use separate set of gammas for simulations andnode> > biasing? Since I've found that the performance seems very bad if Idon't

 > > include some time-expensive features, since the gammas are then very
 > > off; I will probably simply generate two gamma sets, but perhaps it's
 > > enough to do some trick like merging features by computing weighted
 > > (geometric?) averages?


 > Rémi Coulom answered:
 > I learn two sets of gammas separately for the two sets of features.

I don't get it. Why do you need two sets one for the tree and one
for the playouts? To learn gammas, I use a database of games.
The patterns compete, some of them win. This is computed using a
Bradley-Terry model. At that time moves are just moves, not tree moves
or simulation moves. When that offline learned model 'best' fits the
moves played (55000 games x 100 moves each in my case) I am done, I
have a set of gamma values.

I use these for playouts and biasing the tree. What else are you doing?
How do you compute a set for the playouts and a set for the tree?
Do you adjust gamma values one by one playing games?

The problem is that I am not using the same set of features for biasingthe tree, and for playouts. Playouts only use fast light features. Thetree part uses slow complex features. In particular, I use patterns ofradius 3 and 4 in the tree, and only radius 3 in the playouts. When 3x3patterns are learnt together with r=4 patterns, they get different gammas.


Rémi
_______________________________________________
computer-go mailing list
[email protected]
http://www.computer-go.org/mailman/listinfo/computer-go/

Re: [computer-go] Biasing nodes according to pattern gammas

Reply via email to