Re: [computer-go] MoGo paper at ICML

2007-06-23 Thread Yamato
The cumulative result is only given using the prior knowledge on top
of RAVE, but it could have been done the other way round and give the
same type of results. Each particular improvement is somehow
independent of the others.

I think I don't understand that.
What do you mean for the other way round?

All experiments (except the default policy) were played against GnuGo
level 10, not level 8.

That's surprising news to me...

--
Yamato
___
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/


Re: [computer-go] Congratulations to GNU and to MoGoBot19!

2007-06-23 Thread Brian Slesinsky

Maybe another way to put it:

In Fischer time, the time allowed to play the game is simply a
function of the number of moves in the game.  If white moves last,
this time is same for both players, otherwise black gets slightly
more.  At the beginning of the game, the time on the clock is the
amount of time left for a zero-move game where the player ponders
until running out of time.  After each move, the time is increased
because we now know that the game will be a little longer.

The practical upper limit on the amount of time a game will take to
play is limited by the maximum length of a game in moves.  If you can
estimate the number of moves in the game and add it to the clock, that
is the true time left in some sense, assuming you don't actually run
out of time and end the game early.

The way the clock works ensures a minimum amount of time for each move
in the end game. You don't really get more time by moving faster than
the amount given to you after each move; it just means that you're
using less of your total time (determined by the number of moves) so
there is more for the end game.

- Brian
___
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/


Re: [computer-go] MoGo paper at ICML

2007-06-23 Thread Yamato
Using prior knowledge on normal uct, and this was the use of prior
knowledge brought about the same improvement.

You mean, there is more improvement when using both?

It was gnugo default level, and we thought default was 8, but default is
actually 10. I don't see why it is so surprising, I guess it does not change
really the level of gnugo.

Because I have tested my program against GNU Go level 8 to compare the 
winning rate with MoGo. Now I see surely there is almost no change.

--
Yamato
___
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/


Re: [computer-go] MoGo paper at ICML

2007-06-23 Thread Sylvain Gelly

2007/6/23, Yamato [EMAIL PROTECTED]:


Using prior knowledge on normal uct, and this was the use of prior
knowledge brought about the same improvement.

You mean, there is more improvement when using both?



I mean that there is no need to have AMAF to get improvement by using prior
knowledge.


It was gnugo default level, and we thought default was 8, but default is
actually 10. I don't see why it is so surprising, I guess it does not
change
really the level of gnugo.

Because I have tested my program against GNU Go level 8 to compare the
winning rate with MoGo. Now I see surely there is almost no change.



Ok, I understand.

Sylvain
___
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

Re: [computer-go] MoGo paper at ICML

2007-06-23 Thread Yamato
 Using prior knowledge on normal uct, and this was the use of prior
 knowledge brought about the same improvement.

 You mean, there is more improvement when using both?

I mean that there is no need to have AMAF to get improvement by using prior
knowledge.

Sorry, what is AMAF?

And I have another question; Don't you use Q_RLGO anymore?
If so, would you explain the detail of the Q_MoGo heuristic?

--
Yamato
___
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/


Re: [computer-go] MoGo paper at ICML

2007-06-23 Thread Yamato
 Sorry, what is AMAF?

Sorry: All Moves As First :)

OK, I see.

Q_RLGO is not used in MoGo's versions which play online.
Q_MoGo(s,a) is:
- if (self atari(s,a)): 0
- if one pattern, among the patterns used in MoGo's simulation policy,
matches for move a in position s, then 1
- else 0.5

Thanks for that. These values were more extreme than my expectation.
I thought you use values like 0.4 or 0.6.

--
Yamato
___
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/