How about you read the paper first? The conversation would make much more
sense if you actually spent some time trying to understand the details of
what they did. :) <-- (mandatory smiley to indicate I am not upset or
anything)
On Sun, Jan 31, 2016 at 10:20 AM, Greg Schmidt
The articles I've read so far about AlphaGo mention both MCTS and
RL/Q-Learning. Since MCTS (and certainly UCT) keeps statistics on wins and
propagates that information up the tree, that in and of itself would seem to
constitute RL, so how does it make sense to have both? It seems redundant
On Sun, Jan 31, 2016 at 03:20:16PM +, Greg Schmidt wrote:
> The articles I've read so far about AlphaGo mention both MCTS and
> RL/Q-Learning. Since MCTS (and certainly UCT) keeps statistics on wins and
> propagates that information up the tree, that in and of itself would seem to
>