Re: [computer-go] More UCT / Monte-Carlo questions (Effect of rave)

2008-02-07 Thread Erik van der Werf
Hmm, sounds like I should experiment some more. Thanks! Erik On Thu, Feb 7, 2008 at 1:11 AM, Hideki Kato [EMAIL PROTECTED] wrote: Hi Erik, My program is very based on MoGo's report and the paper. Yes, I used FPU of 1.15. -Hideki Erik van der Werf: [EMAIL PROTECTED]: Hi Hideki,

RE: [computer-go] More UCT / Monte-Carlo questions (Effect of rave)

2008-02-06 Thread Gian-Carlo Pascutto
I also implemented RAVE in Mango. There was a few points of improvements (around 60 Elo points with gnugo as reference), but as much as in the paper of Gelly and Silver :( (around 250 Elo points if I remember well) It might be that the effect of RAVE depends a lot on the simulation strategy.

RE: [computer-go] More UCT / Monte-Carlo questions (Effect of rave)

2008-02-06 Thread Chaslot G (MICC)
I also implemented RAVE in Mango. There was a few points of improvements (around 60 Elo points with gnugo as reference), but as much as in the paper of Gelly and Silver :( (around 250 Elo points if I remember well) It might be that the effect of RAVE depends a lot on the simulation strategy.

Re: [computer-go] More UCT / Monte-Carlo questions (Effect of rave)

2008-02-06 Thread Erik van der Werf
Hi Guillaume, I think we talked about this before, but others may be interested as well. In my opinion the ICML paper on Rave has several weaknesses. It's been a while since I read the paper, but here are some I remember: (1) They compared Rave to plain UCT. If they would have compared it to a

Re: [computer-go] More UCT / Monte-Carlo questions (Effect of rave)

2008-02-06 Thread Sylvain Gelly
Hi Erik, (1) They compared Rave to plain UCT. If they would have compared it to a more sophisticated implementation (like the best Mogo before Rave) they probably could not have shown a spectacular improvement. The best Mogo before Rave was very close to plain UCT with the sequence-like

Re: [computer-go] More UCT / Monte-Carlo questions (Effect of rave)

2008-02-06 Thread Erik van der Werf
Hi Sylvain, Thanks for your reply! How do you like your new job? Do you miss CompGo? ;-) On Wed, Feb 6, 2008 at 2:20 PM, Sylvain Gelly [EMAIL PROTECTED] wrote: (1) They compared Rave to plain UCT. If they would have compared it to a more sophisticated implementation (like the best Mogo

Re: [computer-go] More UCT / Monte-Carlo questions (Effect of rave)

2008-02-06 Thread Hideki Kato
I found some data. GGMC Go v2r6, against GNU Go 3.7.10 level 10, 9x9, komi 7.5, 3000 playouts/move, 2000 games match: Without RAVE: winning rate was 23.1 +- 0.9% (-209 +- 9 ELO) With RAVE: winning rate was 65.3 +- 1.1% (+110 +- 8 ELO) Though this includes some other improvements, most

Re: [computer-go] More UCT / Monte-Carlo questions (Effect of rave)

2008-02-06 Thread Jason House
On Feb 6, 2008 11:39 AM, Gian-Carlo Pascutto [EMAIL PROTECTED] wrote: Hideki Kato wrote: 4) Before back-propagating the value of each playout, I setup a color table for all intersections of the board for speed-up, in fact (initialized with EMPTY). That is, fill the board (table[move] =

Re: [computer-go] More UCT / Monte-Carlo questions (Effect of rave)

2008-02-06 Thread Gian-Carlo Pascutto
Hideki Kato wrote: 4) Before back-propagating the value of each playout, I setup a color table for all intersections of the board for speed-up, in fact (initialized with EMPTY). That is, fill the board (table[move] = color) by tracing the moves and the colors returned by the playout forward

Re: [computer-go] More UCT / Monte-Carlo questions (Effect of rave)

2008-02-06 Thread Hideki Kato
Gian-Carlo Pascutto: [EMAIL PROTECTED]: Hideki Kato wrote: 4) Before back-propagating the value of each playout, I setup a color table for all intersections of the board for speed-up, in fact (initialized with EMPTY). That is, fill the board (table[move] = color) by tracing the moves and

Re: [computer-go] More UCT / Monte-Carlo questions (Effect of rave)

2008-02-06 Thread Erik van der Werf
Hi Hideki, Your results look similar to those of Mogo as reported in their icml paper. When you ran this experiment, did you use anything like FPU or progressive widening, or did you use Levente's original design which always selects unvisited moves first? Regards, Erik On Wed, Feb 6, 2008 at

Re: [computer-go] More UCT / Monte-Carlo questions (Effect of rave)

2008-02-06 Thread Sylvain Gelly
Hi Erik, In the ICML version of UCT without RAVE, you did not use your First Play Urgency, right? I think that using FPU has an effect similar to what others reported with their progressive widening. From what I've seen it looks like plain UCT, without FPU or progressive widening, has more

Re: [computer-go] More UCT / Monte-Carlo questions (Effect of rave)

2008-02-06 Thread Hideki Kato
Hi Erik, My program is very based on MoGo's report and the paper. Yes, I used FPU of 1.15. -Hideki Erik van der Werf: [EMAIL PROTECTED]: Hi Hideki, Your results look similar to those of Mogo as reported in their icml paper. When you ran this experiment, did you use anything like FPU or