Re: [Computer-go] Accelerating Self-Play Learning in Go

2019-03-03 Thread Álvaro Begué
From before AlphaGo was announced, I thought the way forward was generating games that play to the bitter end maximizing score, and then using the final ownership as something to predict. I am very glad that someone has had the time to put this idea (and many others!) into practice.

[Computer-go] Accelerating Self-Play Learning in Go

2019-03-03 Thread David Wu
For any interested people on this list who don't follow Leela Zero discussion or reddit threads: I recently released a paper on ways to improve the efficiency of AlphaZero-like learning in Go. A variety of the ideas tried deviate a little from "pure zero" (e.g. ladder detection, predicting board