Re: [computer-go] MoGo v.s. Kim rematch (Jason House's paper)

2008-09-27 Thread Jason House



Sent from my iPhone

On Sep 24, 2008, at 5:16 PM, Jason House [EMAIL PROTECTED]  
wrote:


On Sep 24, 2008, at 2:40 PM, Jacques Basaldúa [EMAIL PROTECTED] wr 
ote:




Therefore, the variance of the normal that best approximates the  
distribution of both RAVE and

wins/(wins + losses) is the same n·p·(1-p)


See above, it's slightly different.


If this is true, the variance you are measuring from the samples  
does not contain any information
about the precision of the estimators. If someone understands this  
better, please explain it to

the list.


This will get covered in my next revision. A proper discussion is  
too much to type with my thumb...


My paper writing time is less than I hadhiped, so here's a quick and  
dirty answer.


For a fixed win rate, the probabilities of a specific number of wins  
and losses follows the binomial distribution. That distribution keeps  
p (probability of winning) constant and the number of observed wins  
and losses variable.


When trying to reverse this process, the wins and losses are kept  
constant and p varies. Essentially prob(p=x) is proportional to  
(x^wins)(1-x)^losses.


This is a Beta distribution with known mean, mode, variance, etc...  
It's these values which should be used for approximating the win rate  
estimator as a normal distion. Does that makes sense?


I've glossed over a very important detail when reversing. Bayes  
Theorem requires some extra a priori information. My preferred  
handling alters the reversed equation's exponents a bit but the basic  
conclusion (of a beta distribution) is the same.___

computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/


Re: [computer-go] MoGo v.s. Kim rematch (Jason House's paper)

2008-09-27 Thread Álvaro Begué
On Fri, Sep 26, 2008 at 9:29 AM, Jason House
[EMAIL PROTECTED] wrote:


 Sent from my iPhone

 On Sep 24, 2008, at 5:16 PM, Jason House [EMAIL PROTECTED]
 wrote:

 On Sep 24, 2008, at 2:40 PM, Jacques Basaldúa [EMAIL PROTECTED] wrote:


 Therefore, the variance of the normal that best approximates the
 distribution of both RAVE and
 wins/(wins + losses) is the same n·p·(1-p)

 See above, it's slightly different.


 If this is true, the variance you are measuring from the samples does not
 contain any information
 about the precision of the estimators. If someone understands this
 better, please explain it to
 the list.

 This will get covered in my next revision. A proper discussion is too much
 to type with my thumb...

 My paper writing time is less than I hadhiped, so here's a quick and dirty
 answer.

 For a fixed win rate, the probabilities of a specific number of wins and
 losses follows the binomial distribution. That distribution keeps p
 (probability of winning) constant and the number of observed wins and losses
 variable.

 When trying to reverse this process, the wins and losses are kept constant
 and p varies. Essentially prob(p=x) is proportional to (x^wins)(1-x)^losses.

 This is a Beta distribution with known mean, mode, variance, etc... It's
 these values which should be used for approximating the win rate estimator
 as a normal distion. Does that makes sense?

 I've glossed over a very important detail when reversing. Bayes Theorem
 requires some extra a priori information. My preferred handling alters the
 reversed equation's exponents a bit but the basic conclusion (of a beta
 distribution) is the same.

Maybe I can say it a little more precisely. Before we have collected
any data, let's use a uniform prior for p. After we sample the move a
number of times, we obtain w wins and l losses. Bayes's theorem tells
us that the posterior probability distribution is a beta distribution
B(w+1,l+1) (see http://en.wikipedia.org/wiki/Beta_distribution for
details).

An implication of this is that the expected value of p after w wins
and l losses is (w+1)/(w+l+2). This is the same as initializing w=l=1
before you have any information and then using w/(w+l) as your winning
rate, which some people have done intuitively, but it's clear that
it's not just a kludge. I'll use the letter r for the value of the
winning rate.

The estimate of the variance is (w+1)*(l+1)/((w+l+2)^2*(w+l+3)), which
is r*(1-r)/(w+l+3). The simple UCB formula uses an estimate of the
variance that is simply 1/visits, so perhaps one should modify the
formula by multiplying that estimate by r*(1-r), which means that the
variance is smaller in positions that look like clear victories for
one side. I don't know if this makes any difference in practice, but I
doubt it.

Álvaro.
___
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/


Re: [computer-go] MoGo v.s. Kim rematch (Jason House's paper)

2008-09-27 Thread Jason House



Sent from my iPhone

On Sep 27, 2008, at 10:14 AM, Álvaro Begué [EMAIL PROTECTED]  
wrote:



On Fri, Sep 26, 2008 at 9:29 AM, Jason House
[EMAIL PROTECTED] wrote:



Sent from my iPhone

On Sep 24, 2008, at 5:16 PM, Jason House  
[EMAIL PROTECTED]

wrote:

On Sep 24, 2008, at 2:40 PM, Jacques Basaldúa [EMAIL PROTECTED] 
 wrote:





Therefore, the variance of the normal that best approximates the
distribution of both RAVE and
wins/(wins + losses) is the same n·p·(1-p)


See above, it's slightly different.


If this is true, the variance you are measuring from the samples  
does not

contain any information
about the precision of the estimators. If someone understands this
better, please explain it to
the list.


This will get covered in my next revision. A proper discussion is  
too much

to type with my thumb...


My paper writing time is less than I hadhiped, so here's a quick  
and dirty

answer.

For a fixed win rate, the probabilities of a specific number of  
wins and

losses follows the binomial distribution. That distribution keeps p
(probability of winning) constant and the number of observed wins  
and losses

variable.

When trying to reverse this process, the wins and losses are kept  
constant
and p varies. Essentially prob(p=x) is proportional to (x^wins)(1- 
x)^losses.


This is a Beta distribution with known mean, mode, variance, etc...  
It's
these values which should be used for approximating the win rate  
estimator

as a normal distion. Does that makes sense?

I've glossed over a very important detail when reversing. Bayes  
Theorem
requires some extra a priori information. My preferred handling  
alters the
reversed equation's exponents a bit but the basic conclusion (of a  
beta

distribution) is the same.


Maybe I can say it a little more precisely. Before we have collected
any data, let's use a uniform prior for p. After we sample the move a
number of times, we obtain w wins and l losses. Bayes's theorem tells
us that the posterior probability distribution is a beta distribution
B(w+1,l+1) (see http://en.wikipedia.org/wiki/Beta_distribution for
details).



When I originally posted about this stuff and modifying the UCB  
formula, the uniform prior was a major sticking point for people. It  
is my preferred handling when no domain knowledge/heuristics are used.





An implication of this is that the expected value of p after w wins
and l losses is (w+1)/(w+l+2). This is the same as initializing w=l=1
before you have any information and then using w/(w+l) as your winning
rate, which some people have done intuitively, but it's clear that
it's not just a kludge. I'll use the letter r for the value of the
winning rate.

The estimate of the variance is (w+1)*(l+1)/((w+l+2)^2*(w+l+3)), which
is r*(1-r)/(w+l+3). The simple UCB formula uses an estimate of the
variance that is simply 1/visits, so perhaps one should modify the
formula by multiplying that estimate by r*(1-r), which means that the
variance is smaller in positions that look like clear victories for
one side. I don't know if this makes any difference in practice, but I
doubt it.



The UCB1-Tuned formula aimed at a similar effect. I know the MoGo team  
abandoned it because they didn't see a strength gain, just added  
complexity. In fact, they don't even use UCB's at all anymore.


Also, note that when initializing wins and losses to 1, n=wins+losses  
and p=wins/n, the variance becomes p(1-p)/(n+1) which is very close to  
a nieve binomial distribution of p(1-p)/n with win and loss  
initialization.





___
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/


Re: [computer-go] MoGo v.s. Kim rematch

2008-09-25 Thread Jason House
On Sep 25, 2008, at 12:12 AM, David Fotland [EMAIL PROTECTED] 
games.com wrote:


This is an interesting idea, but do you have any actual results?  If  
you

implement this kind of rave formula do you get a stronger program?


I don't have enough data to say anything except that it did not have a  
large (100 ELO+) change.


I switched to this method just before the July KGS tournament. Both  
the old and new versions were untuned, but based on parameters I've  
heard on this list. My testing on CGOS is tough to draw conclusions  
from.






David


-Original Message-
From: [EMAIL PROTECTED] [mailto:computer-go-
[EMAIL PROTECTED] On Behalf Of Jason House
Sent: Wednesday, September 24, 2008 4:34 AM
To: computer-go
Subject: Re: [computer-go] MoGo v.s. Kim rematch

On Tue, 2008-09-23 at 18:08 -0300, Douglas Drumond wrote:

Attached is a quick write up of what I was talking about with some

math.


PS: Any tips on cleanup and making it a mini publication would be

appreciated.  I've never published a paper before.  Would this be too
small?



Better add an abstract, but what I missed most was bibliography.


Ask and you shall receive :)
Actually, I spent most of my free time learning Tex/Lyx, so there are
very few changes in this version.  I'm out of time for a while, so I
figured I'd just share what I have so far.





[]'s




Douglas Drumond
-
Computer Engineering
FEEC/IC - Unicamp
___
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/




___
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/


RE: [computer-go] MoGo v.s. Kim rematch

2008-09-24 Thread David Fotland
This is an interesting idea, but do you have any actual results?  If you
implement this kind of rave formula do you get a stronger program?

David

 -Original Message-
 From: [EMAIL PROTECTED] [mailto:computer-go-
 [EMAIL PROTECTED] On Behalf Of Jason House
 Sent: Wednesday, September 24, 2008 4:34 AM
 To: computer-go
 Subject: Re: [computer-go] MoGo v.s. Kim rematch
 
 On Tue, 2008-09-23 at 18:08 -0300, Douglas Drumond wrote:
   Attached is a quick write up of what I was talking about with some
 math.
  
   PS: Any tips on cleanup and making it a mini publication would be
 appreciated.  I've never published a paper before.  Would this be too
 small?
 
 
  Better add an abstract, but what I missed most was bibliography.
 
 Ask and you shall receive :)
 Actually, I spent most of my free time learning Tex/Lyx, so there are
 very few changes in this version.  I'm out of time for a while, so I
 figured I'd just share what I have so far.
 
 
 
 
  []'s
 
 
 
 
  Douglas Drumond
  -
  Computer Engineering
  FEEC/IC - Unicamp
  ___
  computer-go mailing list
  computer-go@computer-go.org
  http://www.computer-go.org/mailman/listinfo/computer-go/

___
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/


Re: [computer-go] MoGo v.s. Kim rematch

2008-09-23 Thread David Doshay

On 22, Sep 2008, at 10:50 PM, Hideki Kato wrote:



David Doshay: [EMAIL PROTECTED]:

It was 800, just like last time, but the networking had been upgraded
from ethernet to infiniband. Olivier said that this should have  
been a

good improvement because he felt that communication overhead was
significant.


Really previous Huygens used Ethernet?  It's hard to believe...

Hideki


I thought so as well, but Olivier wrote to me:

Begin forwarded message:


From: Olivier Teytaud [EMAIL PROTECTED]
Date: 6, September 2008 2:07:42 AM PDT
To: David Doshay [EMAIL PROTECTED]
Subject: Re: [computer-go] yet a mogo vs human game

Hi David,
...

We will have at least the same number of cores, probably more, and  
we will very likely have a better hardware -
the infiniband network should be available, and this makes a big  
difference.


...
Best regards,
Olivier



So, perhaps Huygens has both and they were not using it last time,
or maybe they brought Huygens up with E-net and then upgraded.

But Mogo did not use it for the Portland exhibition, but did use
infiniband for the rematch.

Cheers,
David
___
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/


Re: [computer-go] MoGo v.s. Kim rematch

2008-09-23 Thread Hideki Kato
David,

I've found a description that Infiniband was improved from 2 x 4X IB 
(20 Gbps) to 8 x 8X IB (160 Gbps) on Jun 2008 at the bottom of 6th 
page of a pdf about Huygens system:
https://www.os3.nl/_media/2007-2008/courses/inr/week7/sne_20080320_walter.pdf

I guess that is the better hardware Olivier wrote.

Hideki

David Doshay: [EMAIL PROTECTED]:
On 22, Sep 2008, at 10:50 PM, Hideki Kato wrote:


 David Doshay: [EMAIL PROTECTED]:
 It was 800, just like last time, but the networking had been upgraded
 from ethernet to infiniband. Olivier said that this should have  
 been a
 good improvement because he felt that communication overhead was
 significant.

 Really previous Huygens used Ethernet?  It's hard to believe...

 Hideki

I thought so as well, but Olivier wrote to me:

Begin forwarded message:

 From: Olivier Teytaud [EMAIL PROTECTED]
 Date: 6, September 2008 2:07:42 AM PDT
 To: David Doshay [EMAIL PROTECTED]
 Subject: Re: [computer-go] yet a mogo vs human game

 Hi David,
 ...

 We will have at least the same number of cores, probably more, and  
 we will very likely have a better hardware -
 the infiniband network should be available, and this makes a big  
 difference.

 ...
 Best regards,
 Olivier


So, perhaps Huygens has both and they were not using it last time,
or maybe they brought Huygens up with E-net and then upgraded.

But Mogo did not use it for the Portland exhibition, but did use
infiniband for the rematch.

Cheers,
David
___
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/
--
[EMAIL PROTECTED] (Kato)
___
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/


Re: [computer-go] MoGo v.s. Kim rematch

2008-09-23 Thread Jason House
On Mon, Sep 22, 2008 at 1:21 PM, Łukasz Lew [EMAIL PROTECTED] wrote:

 Hi,

 On Mon, Sep 22, 2008 at 17:58, Jason House [EMAIL PROTECTED]
 wrote:
  On Sep 22, 2008, at 7:59 AM, Magnus Persson [EMAIL PROTECTED]
 wrote:
 
  The results of the math are most easilly expressed in terms of inverse
  variance (iv=1/variance)
 
  Combined mean = sum( mean * iv )
  Combined iv = sum( iv )
 
  I'll try to do a real write-up if anyone is interested.

 I am very interested. :)

 Lukasz



Attached is a quick write up of what I was talking about with some math.

PS: Any tips on cleanup and making it a mini publication would be
appreciated.  I've never published a paper before.  Would this be too small?


RAVE.doc
Description: MS-Word document
___
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

Re: [computer-go] MoGo v.s. Kim rematch

2008-09-23 Thread Douglas Drumond
 Attached is a quick write up of what I was talking about with some math.

 PS: Any tips on cleanup and making it a mini publication would be 
 appreciated.  I've never published a paper before.  Would this be too small?


Better add an abstract, but what I missed most was bibliography.


[]'s




Douglas Drumond
-
Computer Engineering
FEEC/IC - Unicamp
___
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/


Re: [computer-go] MoGo v.s. Kim rematch

2008-09-23 Thread Darren Cook
 It was 800, just like last time, but the networking had been upgraded
 from ethernet to infiniband. Olivier said that this should have been a
 good improvement because he felt that communication overhead was
 significant.

I thought Olivier had previously said there was very little overhead. E.g.:
 http://computer-go.org/pipermail/computer-go/2008-May/015068.html
 http://www.mail-archive.com/computer-go@computer-go.org/msg07953.html

I took this 95% to mean that giving Mogo 760 (800x0.95) times the
thinking time on a single core would be the same strength as Mogo on 800
cores.

Assuming a rule of thumb that doubling the playouts is worth one rank
(*), increasing network speed will surely have very little effect on the
strength?

Darren

*: That is from memory of discussion here; I hope somebody will correct
me with a more accurate thumb.

 We will have at least the same number of cores, probably more, and we
 will very likely have a better hardware -
 the infiniband network should be available, and this makes a big
 difference.


-- 
Darren Cook, Software Researcher/Developer
http://dcook.org/mlsn/ (English-Japanese-German-Chinese-Arabic
open source dictionary/semantic network)
http://dcook.org/work/ (About me and my work)
http://dcook.org/blogs.html (My blogs and articles)
___
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/


Re: [computer-go] MoGo v.s. Kim rematch

2008-09-22 Thread Magnus Persson

Quoting Mark Boon [EMAIL PROTECTED]:


Playing out that fake ladder in the first game meant an instant loss.
Surprising. And embarassing. Any information on the number of
processors used?



The interesting question is if there is a silly bug or something more  
sophisticated.


I have struggled with ladders in Valkyria and it is often really hard  
to tell what causes these problems. In Leksand most games on 19x19  
where lost in a ways similar to the recent mogo game. I could not find  
an obvious problem with the playouts at least not in terms of an  
easily fixable bug. Note that Valkyria reads out 99% of all ladders  
correctly both in the tree and in the playouts.


What I realized was that AMAF in combination with heavy playouts  
causes some serious biases, for some kinds of very bad moves such that  
AMAF completely misevaluate them.


In the case of the ladders the heavy playouts of Valkyria correctly  
prunes playing out ladders for the loser. But sometimes in the  
playouts the ladder is broken and after that there is a chance that  
the stones escape anyway. This means that almost always when the  
escaping move is played it is a good move! Thus AMAF will assign a  
very good score to this move


My solutions to this was simply to turn off AMAF-eval for all shapes  
commonly misevaluated for ladders. But I think this problem is true  
for many shapes
in general. What makes ladders special is that the problem repeats it  
self and the effect get stronger and thus even more likely the larger  
the ladder gets.


I think a better solution would be to modify AMAF in some way to avoid  
these problems, or perhaps change the playouts in a way to balance the  
problem. Does anyone know something to do about it or have any ideas?


-Magnus


___
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/


Re: [computer-go] MoGo v.s. Kim rematch

2008-09-22 Thread Don Dailey
I'm curious about a couple of things in particular.   Is this a bug and
how much time would be required for Mogo to have played the correct move
if it wasn't. 

Of course I'm also interested in ways to solve this with less deep
searches or better play-outs.

- Don
 

On Mon, 2008-09-22 at 13:59 +0200, Magnus Persson wrote:
 Quoting Mark Boon [EMAIL PROTECTED]:
 
  Playing out that fake ladder in the first game meant an instant loss.
  Surprising. And embarassing. Any information on the number of
  processors used?
 
 
 The interesting question is if there is a silly bug or something more  
 sophisticated.
 
 I have struggled with ladders in Valkyria and it is often really hard  
 to tell what causes these problems. In Leksand most games on 19x19  
 where lost in a ways similar to the recent mogo game. I could not find  
 an obvious problem with the playouts at least not in terms of an  
 easily fixable bug. Note that Valkyria reads out 99% of all ladders  
 correctly both in the tree and in the playouts.
 
 What I realized was that AMAF in combination with heavy playouts  
 causes some serious biases, for some kinds of very bad moves such that  
 AMAF completely misevaluate them.
 
 In the case of the ladders the heavy playouts of Valkyria correctly  
 prunes playing out ladders for the loser. But sometimes in the  
 playouts the ladder is broken and after that there is a chance that  
 the stones escape anyway. This means that almost always when the  
 escaping move is played it is a good move! Thus AMAF will assign a  
 very good score to this move
 
 My solutions to this was simply to turn off AMAF-eval for all shapes  
 commonly misevaluated for ladders. But I think this problem is true  
 for many shapes
 in general. What makes ladders special is that the problem repeats it  
 self and the effect get stronger and thus even more likely the larger  
 the ladder gets.
 
 I think a better solution would be to modify AMAF in some way to avoid  
 these problems, or perhaps change the playouts in a way to balance the  
 problem. Does anyone know something to do about it or have any ideas?
 
 -Magnus
 
 
 ___
 computer-go mailing list
 computer-go@computer-go.org
 http://www.computer-go.org/mailman/listinfo/computer-go/


signature.asc
Description: This is a digitally signed message part
___
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

Re: [computer-go] MoGo v.s. Kim rematch

2008-09-22 Thread terry mcintyre
 Consider this as tentative, since I heard it about 3rd-hand, but I believe the 
number of processors used to have been 3000.


Congratulations to the Mogo team; good luck improving your program to deal with 
the ladder and life-and-death issues.

Looking forward to further information!

I have always wondered if AMAF is a feature or a bug. There are many situations 
where the order of moves is crucial; A before B wins, B before A loses; ladders 
are a classic example where the ordering of moves is utterly important. AMAF 
seems to assume that order doesn't matter. Of course, there are many other 
positions where this assumption is true; that is why it sometimes yields an 
improvement in processing speed, but it seems risky.

Ladders are also a classic case where two patterns can look very similar, but 
be very different. When you capture a ladder, you are in a very good position. 
But if the stones under attack have just one extra liberty, the position may 
look like a ladder, but your target will escape, and your stones will be full 
of cutting points; the proper evaluation for that position would be much 
harsher. More generally, whenever I see a Monte Carlo program lose, it is 
usually a semeai where being one liberty behind or one ahead makes all the 
difference. We call these capturing races in English for a reason; being 
ahead or behind by one liberty matters a great deal. To make life interesting, 
there are loose ladder constructs where an extra liberty does not help the 
fleeing stones; they still get corraled and captured.

These corner cases are tough, but many games hinge on correctly reading out the 
exact consequences of life-and-death struggles.

Terry McIntyre [EMAIL PROTECTED]


Go is very hard. The more I learn about it, the less I know. -Jie Li, 9 dan

 On Mon, 2008-09-22 at 13:59 +0200, Magnus Persson wrote:
  Quoting Mark Boon :
  
   Playing out that fake ladder in the first game meant an instant loss.
   Surprising. And embarassing. Any information on the number of
   processors used?


  
___
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/


Re: [computer-go] MoGo v.s. Kim rematch

2008-09-22 Thread Don Dailey
I think AMAF is a feature not a bug.   It's only a matter of how
judiciously it's applied.   

Also, almost any evaluation feature in a game playing program is a bug -
meaning it is an imperfect approximation of what you really want.  

Of course it could turn out that AMAF got them in trouble in this game.
The Mogo team will probably analyze the reason for the problem.But
as long as they are playing strong professional players they are going
to have something to debug and analyze!

- Don


 
On Mon, 2008-09-22 at 06:06 -0700, terry mcintyre wrote:
 Consider this as tentative, since I heard it about 3rd-hand, but I believe 
 the number of processors used to have been 3000.
 
 
 Congratulations to the Mogo team; good luck improving your program to deal 
 with the ladder and life-and-death issues.
 
 Looking forward to further information!
 
 I have always wondered if AMAF is a feature or a bug. There are many 
 situations where the order of moves is crucial; A before B wins, B before A 
 loses; ladders are a classic example where the ordering of moves is utterly 
 important. AMAF seems to assume that order doesn't matter. Of course, there 
 are many other positions where this assumption is true; that is why it 
 sometimes yields an improvement in processing speed, but it seems risky.
 
 Ladders are also a classic case where two patterns can look very similar, but 
 be very different. When you capture a ladder, you are in a very good 
 position. But if the stones under attack have just one extra liberty, the 
 position may look like a ladder, but your target will escape, and your 
 stones will be full of cutting points; the proper evaluation for that 
 position would be much harsher. More generally, whenever I see a Monte Carlo 
 program lose, it is usually a semeai where being one liberty behind or one 
 ahead makes all the difference. We call these capturing races in English 
 for a reason; being ahead or behind by one liberty matters a great deal. To 
 make life interesting, there are loose ladder constructs where an extra 
 liberty does not help the fleeing stones; they still get corraled and 
 captured.
 
 These corner cases are tough, but many games hinge on correctly reading out 
 the exact consequences of life-and-death struggles.
 
 Terry McIntyre [EMAIL PROTECTED]
 
 
 Go is very hard. The more I learn about it, the less I know. -Jie Li, 9 dan
 
  On Mon, 2008-09-22 at 13:59 +0200, Magnus Persson wrote:
   Quoting Mark Boon :
   
Playing out that fake ladder in the first game meant an instant loss.
Surprising. And embarassing. Any information on the number of
processors used?
 
 
   
 ___
 computer-go mailing list
 computer-go@computer-go.org
 http://www.computer-go.org/mailman/listinfo/computer-go/


signature.asc
Description: This is a digitally signed message part
___
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

RE: [computer-go] MoGo v.s. Kim rematch

2008-09-22 Thread David Fotland
AMAF certainly helps to do move ordering when there is little other
information.  With good prior heuristics or enough actual playouts, it
should not be weighted very highly.  AMAF finds good moves, but it often
bias heavily for or against moves.  In ManyFaces, AMAF (actually RAVE) is
worth between 5% and 10% wins against gnugo.

I've seen similar ladder problems, and it is not AMAF, it's caused by the
playouts, when they can't read ladders.  It's easy to add various hacks to
prevent playing out simple ladders, but the one in this game had an extra
liberty (if I remember correctly).  A general solution is a little tricky.

David

 -Original Message-
 From: [EMAIL PROTECTED] [mailto:computer-go-
 [EMAIL PROTECTED] On Behalf Of Don Dailey
 Sent: Monday, September 22, 2008 6:23 AM
 To: computer-go
 Subject: Re: [computer-go] MoGo v.s. Kim rematch
 
 I think AMAF is a feature not a bug.   It's only a matter of how
 judiciously it's applied.
 
 Also, almost any evaluation feature in a game playing program is a bug
 - meaning it is an imperfect approximation of what you really want.
 
 Of course it could turn out that AMAF got them in trouble in this game.
 The Mogo team will probably analyze the reason for the problem.But
 as long as they are playing strong professional players they are going
 to have something to debug and analyze!
 
 - Don
 
 
 
 On Mon, 2008-09-22 at 06:06 -0700, terry mcintyre wrote:
  Consider this as tentative, since I heard it about 3rd-hand, but I
 believe the number of processors used to have been 3000.
 
 
  Congratulations to the Mogo team; good luck improving your program to
 deal with the ladder and life-and-death issues.
 
  Looking forward to further information!
 
  I have always wondered if AMAF is a feature or a bug. There are many
 situations where the order of moves is crucial; A before B wins, B
 before A loses; ladders are a classic example where the ordering of
 moves is utterly important. AMAF seems to assume that order doesn't
 matter. Of course, there are many other positions where this assumption
 is true; that is why it sometimes yields an improvement in processing
 speed, but it seems risky.
 
  Ladders are also a classic case where two patterns can look very
 similar, but be very different. When you capture a ladder, you are in a
 very good position. But if the stones under attack have just one extra
 liberty, the position may look like a ladder, but your target will
 escape, and your stones will be full of cutting points; the proper
 evaluation for that position would be much harsher. More generally,
 whenever I see a Monte Carlo program lose, it is usually a semeai where
 being one liberty behind or one ahead makes all the difference. We call
 these capturing races in English for a reason; being ahead or behind
 by one liberty matters a great deal. To make life interesting, there
 are loose ladder constructs where an extra liberty does not help the
 fleeing stones; they still get corraled and captured.
 
  These corner cases are tough, but many games hinge on correctly
 reading out the exact consequences of life-and-death struggles.
 
  Terry McIntyre [EMAIL PROTECTED]
 
 
  Go is very hard. The more I learn about it, the less I know. -Jie
  Li, 9 dan
 
   On Mon, 2008-09-22 at 13:59 +0200, Magnus Persson wrote:
Quoting Mark Boon :
   
 Playing out that fake ladder in the first game meant an instant
 loss.
 Surprising. And embarassing. Any information on the number of
 processors used?
 
 
 
  ___
  computer-go mailing list
  computer-go@computer-go.org
  http://www.computer-go.org/mailman/listinfo/computer-go/

___
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/


Re: [computer-go] MoGo v.s. Kim rematch

2008-09-22 Thread Jason House
On Sep 22, 2008, at 7:59 AM, Magnus Persson [EMAIL PROTECTED]  
wrote:


In the case of the ladders the heavy playouts of Valkyria correctly  
prunes playing out ladders for the loser. But sometimes in the  
playouts the ladder is broken and after that there is a chance that  
the stones escape anyway. This means that almost always when the  
escaping move is played it is a good move! Thus AMAF will assign a  
very good score to this move


My solutions to this was simply to turn off AMAF-eval for all shapes  
commonly misevaluated for ladders. But I think this problem is true  
for many shapes
in general. What makes ladders special is that the problem repeats  
it self and the effect get stronger and thus even more likely the  
larger the ladder gets.


I think a better solution would be to modify AMAF in some way to  
avoid these problems, or perhaps change the playouts in a way to  
balance the problem. Does anyone know something to do about it or  
have any ideas?


My RAVE formulation includes a per-move parameter for RAVE confidence.  
This allows heuristics to fix situations like above. Sadly, my bot  
isn't mature enough to take advantage yet :(


The concept I used for the derivation is simple. I treat everything as  
gaussian estimators. It's easy to find the max of the distribution. I  
then use the same trick as bayeselo to estimate variance. I then add a  
Gaussian noise term to represent RAVE bias.








The results of the math are most easilly expressed in terms of inverse  
variance (iv=1/variance)


Combined mean = sum( mean * iv )
Combined iv = sum( iv )

I'll try to do a real write-up if anyone is interested.
___
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/


Re: [computer-go] MoGo v.s. Kim rematch

2008-09-22 Thread David Doshay
It was 800, just like last time, but the networking had been upgraded  
from ethernet to infiniband. Olivier said that this should have been a  
good improvement because he felt that communication overhead was  
significant.


Cheers,
David



On 22, Sep 2008, at 6:06 AM, terry mcintyre wrote:

Consider this as tentative, since I heard it about 3rd-hand, but I  
believe the number of processors used to have been 3000.


___
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/


Re: [computer-go] MoGo v.s. Kim rematch

2008-09-22 Thread Hideki Kato

David Doshay: [EMAIL PROTECTED]:
It was 800, just like last time, but the networking had been upgraded  
from ethernet to infiniband. Olivier said that this should have been a  
good improvement because he felt that communication overhead was  
significant.

Really previous Huygens used Ethernet?  It's hard to believe...

Hideki

Cheers,
David



On 22, Sep 2008, at 6:06 AM, terry mcintyre wrote:

 Consider this as tentative, since I heard it about 3rd-hand, but I  
 believe the number of processors used to have been 3000.

___
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/
--
[EMAIL PROTECTED] (Kato)
___
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/


Re: [computer-go] MoGo v.s. Kim rematch

2008-09-21 Thread mingwu
Anyone knows the result, or better the game sgf?


On Sat, Sep 6, 2008 at 6:57 AM, Don Dailey [EMAIL PROTECTED] wrote:

 Great news!   Look forward to seeing it happen.  I hope Mogo has some
 great hardware.

 - Don


 On Fri, 2008-09-05 at 15:54 -0700, David Doshay wrote:
  MoGo and Myungwan Kim will hold an exhibition rematch at the Cotsen
  Open on Saturday September 20. The exhibition will start at about 5pm
  Pacific Daylight time.
 
  As probably known by all on this list, MoGo won the last game, held at
  the US Go Congress in Portland Oregon, when it was given a 9 stone
  handicap and played with a one hour time limit.
 
  At this time the expected handicap will be 7, and it is not clear if
  there will be one game or two. It is not known at this time how many
  cores MoGo will be running on. Mr Kim has asked for MoGo to be given
  90 minutes because he saw how much the increase in time from 15
  minutes to one hour increased its playing strength. Mr. Kim has also
  asked that there be only one or at most 2 blitz games at the start.
 
  The MoGo team also wants to have some 9x9 games, but Mr Kim does not
  feel familiar enough with 9x9 to play those games, but he is searching
  for an alternate strong player. MoGo has some new features for 9x9,
  and the team is anxious to see the newest code in action.
 
 
 
  Cheers,
  David
 
 
 
  ___
  computer-go mailing list
  computer-go@computer-go.org
  http://www.computer-go.org/mailman/listinfo/computer-go/

 ___
 computer-go mailing list
 computer-go@computer-go.org
 http://www.computer-go.org/mailman/listinfo/computer-go/

___
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

Re: [computer-go] MoGo v.s. Kim rematch

2008-09-21 Thread dhillismail
There is a discussion at godiscussions
http://www.godiscussions.com/forum/showthread.php?t=7154
?The sgf's for the two games played are on page 2

Dave Hillis

-Original Message-
From: mingwu [EMAIL PROTECTED]
To: [EMAIL PROTECTED]; computer-go computer-go@computer-go.org
Sent: Sun, 21 Sep 2008 9:22 pm
Subject: Re: [computer-go] MoGo v.s. Kim rematch



Anyone knows the result, or better the game sgf?



On Sat, Sep 6, 2008 at 6:57 AM, Don Dailey [EMAIL PROTECTED] wrote:

Great news! ? Look forward to seeing it happen. ?I hope Mogo has some
great hardware.

- Don






On Fri, 2008-09-05 at 15:54 -0700, David Doshay wrote:
 MoGo and Myungwan Kim will hold an exhibition rematch at the Cotsen
 Open on Saturday September 20. The exhibition will start at about 5pm
 Pacific Daylight time.

 As probably known by all on this list, MoGo won the last game, held at
 the US Go Congress in Portland Oregon, when it was given a 9 stone
 handicap and played with a one hour time limit.

 At this time the expected handicap will be 7, and it is not clear if
 there will be one game or two. It is not known at this time how many
 cores MoGo will be running on. Mr Kim has asked for MoGo to be given
 90 minutes because he saw how much the increase in time from 15
 minutes to one hour increased its playing strength. Mr. Kim has also
 asked that there be only one or at most 2 blitz games at the start.

 The MoGo team also wants to have some 9x9 games, but Mr Kim does not
 feel familiar enough with 9x9 to play those games, but he is searching
 for an alternate strong player. MoGo has some new features for 9x9,
 and the team is anxious to see the newest code in action.



 Cheers,
 David



 ___
 computer-go mailing list
 computer-go@computer-go.org
 http://www.computer-go.org/mailman/listinfo/computer-go/

___
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/









___
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

___
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

Re: [computer-go] MoGo v.s. Kim rematch

2008-09-05 Thread Don Dailey
Great news!   Look forward to seeing it happen.  I hope Mogo has some
great hardware.

- Don


On Fri, 2008-09-05 at 15:54 -0700, David Doshay wrote:
 MoGo and Myungwan Kim will hold an exhibition rematch at the Cotsen  
 Open on Saturday September 20. The exhibition will start at about 5pm  
 Pacific Daylight time.
 
 As probably known by all on this list, MoGo won the last game, held at  
 the US Go Congress in Portland Oregon, when it was given a 9 stone  
 handicap and played with a one hour time limit.
 
 At this time the expected handicap will be 7, and it is not clear if  
 there will be one game or two. It is not known at this time how many  
 cores MoGo will be running on. Mr Kim has asked for MoGo to be given  
 90 minutes because he saw how much the increase in time from 15  
 minutes to one hour increased its playing strength. Mr. Kim has also  
 asked that there be only one or at most 2 blitz games at the start.
 
 The MoGo team also wants to have some 9x9 games, but Mr Kim does not  
 feel familiar enough with 9x9 to play those games, but he is searching  
 for an alternate strong player. MoGo has some new features for 9x9,  
 and the team is anxious to see the newest code in action.
 
 
 
 Cheers,
 David
 
 
 
 ___
 computer-go mailing list
 computer-go@computer-go.org
 http://www.computer-go.org/mailman/listinfo/computer-go/

___
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/