[computer-go] A nearest-neighbor heuristic

Peter Drake Wed, 07 Mar 2007 21:38:59 -0800

First, a general hypothesis on heuristics: one should applyheuristics to the first few moves beyond the fringe of the UCT tree,and not later. It's important that these early moves be good, but notworth the time to make later moves good. Thoughts? Is anyone alreadyusing this idea?

Now, a specific heuristic I'd like to try. If anyone can point outanything horribly wrong with it before I go to the trouble toimplement it, that'd be nice. :-)

Maintain a set of if-then rules, perhaps 1000 of them. Each ruleconsists of a board configuration and a suggested move. (Originally,they're all identically [<empty board> => E5] for 9x9.) As the gameprogresses, this population of rules will change.

When it's time to heuristically choose a move, compare the currentboard configuration against the "if" part of each rule. Play the movefrom the closest match (nearest neighbor). There's room forcreativity in the definition of nearness, but something like Hammingdistance might suffice.

The population of rules is updated during the game. We might do this,for example, whenever a move becomes the best move from its UCT node.(Note that I'm using "best" here to mean "most likely to win" and not"highest UCT value".) When this happens, ask the population what itwould do given this board configuration. If the answer is the move inquestion, do nothing. Otherwise, overwrite the oldest rule with a newrule suggesting this move for this configuration.

My hope is that this heuristic will suggest the move that has beenmost effective on similar boards.


Peter Drake
http://www.lclark.edu/~drake/



_______________________________________________
computer-go mailing list
computer-go@computer-go.org
http://www.computer-go.org/mailman/listinfo/computer-go/

[computer-go] A nearest-neighbor heuristic

Reply via email to