Re: [Computer-go] UCB-1 tuned policy

Erik van der Werf Thu, 16 Apr 2015 00:17:39 -0700

Many observed that, but not everyone.
Op 16 apr. 2015 07:38 schreef "David Fotland" <[email protected]>:


> I didn’t notice a difference.  Like everyone else, once I had RAVE
> implemented and added biases to the tree move selection, I found the UCT
> term made the program weaker, so I removed it.
>
> David
>
> > -----Original Message-----
> > From: Computer-go [mailto:[email protected]] On
> Behalf Of
> > Igor Polyakov
> > Sent: Tuesday, April 14, 2015 3:37 AM
> > To: [email protected]
> > Subject: [Computer-go] UCB-1 tuned policy
> >
> > I implemented UCB1-tuned in my basic UCB-1 go player, but it doesn't seem
> > like it makes a difference in self-play.
> >
> > It seems like it's able to run 5-25% more simulations, which means it's
> > probably exploiting deeper (and has less steps until it runs out of room
> to
> > play legal moves), but I have yet to see any strength improvements on
> > 9x9 boards.
> >
> > As far as I understand, the only thing that's different is the formula.
> > Has anyone actually seen any difference between the two algorithms?
> > _______________________________________________
> > Computer-go mailing list
> > [email protected]
> > http://computer-go.org/mailman/listinfo/computer-go
>
> _______________________________________________
> Computer-go mailing list
> [email protected]
> http://computer-go.org/mailman/listinfo/computer-go

_______________________________________________
Computer-go mailing list
[email protected]
http://computer-go.org/mailman/listinfo/computer-go

Re: [Computer-go] UCB-1 tuned policy

Reply via email to