Congratulations people at DeepMind :-) I like the fact that alphaGo uses many forms of learning (as humans do!): - imitation learning (on expert games, learning an actor policy); - learning by playing (self play, policy gradient), incidentally generating games; - use of those games for teaching a second deep network (supervised learning); - real time learning with Monte Carlo simulations (including Rave ?). ==> just beautiful :-)
2016-01-27 21:18 GMT+01:00 Yamato <[email protected]>: > Congratulations Aja. > > Do you have a plan to run AlphaGo on KGS? > > It must be a 9d! > > Yamato > _______________________________________________ > Computer-go mailing list > [email protected] > http://computer-go.org/mailman/listinfo/computer-go > -- ========================================================= Olivier Teytaud, [email protected], TAO, LRI, UMR 8623(CNRS - Univ. Paris-Sud), bat 490 Univ. Paris-Sud F-91405 Orsay Cedex France http://www.slideshare.net/teytaud
_______________________________________________ Computer-go mailing list [email protected] http://computer-go.org/mailman/listinfo/computer-go
