Re: [computer-go] Re: Depth-first UCT

2008-08-12 Thread Stuart A. Yeates
On Wed, Aug 13, 2008 at 1:29 PM, Hideki Kato [EMAIL PROTECTED] wrote:
 RĂ©mi Coulom: [EMAIL PROTECTED]:
Gian-Carlo Pascutto wrote:

 The *paper* about MTD(f) is extremely interesting because it shows
 that many best-first algorithms can be rewritten as depth-first
 algorithms.

 It happened for SSS, it happened for proof-number search.

 Who will make it happen for UCT?



Actually, there was a paper presented at the 2007 Computer-Game Workshop
in Japan entitled Depth-First UCT and its Application to Go, by
Haruhiro Yoshimoto, Akihiro Kishimoto, Tomoyuki Kaneko, and Kazuki Yoshizoe.

 Abstract in English:
 The UCT algorithm is a representative best-first search algorithm that
 has been popularly used in Monte Carlo Go.  This paper presents the
 DFUCT (Depth-First UCT) algorithm, which is an efficient depth-first
 variant of UCT.  Experimental results in Go show that DFUCT achieves
 3% improvement in running time, while the solving abilities of DFUCT
 and UCT are comparable.

 This paper is written in Japanese.

Thank you for the translation.

cheers
stuart
___
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/


Re: [computer-go] Re: Depth-first UCT

2008-08-12 Thread Zach Wegner
Interesting. Could you (or someone else) explain how DFUCT works? I'd
imagine it doesn't save all the nodes in memory, but that seems rather
counterintuitive.
___
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/


Re: [computer-go] Re: Depth-first UCT

2008-08-12 Thread Hideki Kato
Zach Wegner: [EMAIL PROTECTED]:
Interesting. Could you (or someone else) explain how DFUCT works? I'd
imagine it doesn't save all the nodes in memory, but that seems rather
counterintuitive.

As far as I understand,

The basic idea is that DFUCT continues searching avoiding going back 
to root node if possible.  To do this, DFUCT saves the number of wins, 
the number of visits and the sum of number of visits of the brothers 
of two nodes at root those ucb values are the maximum (first) and next 
to the maximum (second), in a stack.

The authors claimed in the paper that although DFUCT does not behave 
the same as UCT when the third node at root comes up to the second, 
it's so rare that can be ignored.

Hideki
--
[EMAIL PROTECTED] (Kato)
___
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/


Re: [computer-go] Re: Depth-first UCT

2008-08-12 Thread Hideki Kato
Oops.

Hideki Kato: [EMAIL PROTECTED]:
Zach Wegner: [EMAIL PROTECTED]:
Interesting. Could you (or someone else) explain how DFUCT works? I'd
imagine it doesn't save all the nodes in memory, but that seems rather
counterintuitive.

As far as I understand,

The basic idea is that DFUCT continues searching avoiding going back 
to root node if possible.  To do this, DFUCT saves the number of wins, 
the number of visits and the sum of number of visits of the brothers 
of two nodes at root those ucb values are the maximum (first) and next 
two nodes should be two arcs (moves).

to the maximum (second), in a stack.

The authors claimed in the paper that although DFUCT does not behave 
the same as UCT when the third node at root comes up to the second, 
third node should be third arc.

it's so rare that can be ignored.

Sorry for mistakes.

Hideki
--
[EMAIL PROTECTED] (Kato)
___
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/