[Computer-go] UCB-1 tuned policy

Igor Polyakov Tue, 14 Apr 2015 03:37:49 -0700

I implemented UCB1-tuned in my basic UCB-1 go player, but it doesn'tseem like it makes a difference in self-play.

It seems like it's able to run 5-25% more simulations, which means it'sprobably exploiting deeper (and has less steps until it runs out of roomto play legal moves), but I have yet to see any strength improvements on9x9 boards.

As far as I understand, the only thing that's different is the formula.Has anyone actually seen any difference between the two algorithms?

_______________________________________________
Computer-go mailing list
Computer-go@computer-go.org
http://computer-go.org/mailman/listinfo/computer-go

[Computer-go] UCB-1 tuned policy

Reply via email to