> ...if it is hard to have "the good starting point" such as a trained > policy from human expert game records, what is a way to devise one.
My first thought was to look at the DeepMind research on learning to play video games (which I think either pre-dates the AlphaGo research, or was done in parallel with it): https://deepmind.com/research/dqn/ It just learns from trial and error, no expert game records: http://www.theverge.com/2016/6/9/11893002/google-ai-deepmind-atari-montezumas-revenge Darren -- Darren Cook, Software Researcher/Developer My New Book: Practical Machine Learning with H2O: http://shop.oreilly.com/product/0636920053170.do _______________________________________________ Computer-go mailing list Computer-go@computer-go.org http://computer-go.org/mailman/listinfo/computer-go