Hello Shangtong, great to see you back!
> It has been two years since last time I coded in mlpack for mean shift and > CNN. > Now I’m a MSc student in University of Alberta supervised by Prof Richard > Sutton. My primary interest is reinforcement learning. I wrote python code for > the book Reinforcement Learning: An Introduction(2nd Edition). However I don’t > have much experience with DeepRL, so this project interests me. You did some great work there! I think I'll go and add the book and repository as another reference to the project idea, might be helpful to dive into the topic. > To warm up, I proposed a framework for Q-Learning with an implementation of > DQN > with experience replay and target network. The pull request is here I test it > in Mountain Car task. That is a really nice PR, I'll take a closer look once I get the chance, and we can discuss details over there. Thanks, Marcus > On 13 Mar 2017, at 16:14, Shangtong Zhang <[email protected]> > wrote: > > Hello everyone, > > I’m Shangtong Zhang. It has been two years since last time I coded in mlpack > for mean shift and CNN. Now I’m a MSc student in University of Alberta > supervised by Prof Richard Sutton. My primary interest is reinforcement > learning. I wrote python code > <https://github.com/ShangtongZhang/reinforcement-learning-an-introduction> > for the book Reinforcement Learning: An Introduction(2nd Edition). However I > don’t have much experience with DeepRL, so this project interests me. > > To warm up, I proposed a framework for Q-Learning with an implementation of > DQN with experience replay and target network. The pull request is here > <https://github.com/mlpack/mlpack/pull/934> I test it in Mountain Car task. > The PR is just to show my design, it’s not ready to merge. I think we need to > support batch update for our network component to make deep RL more > efficient. Looking forward to any feedback. > > Thanks, > > Shangtong Zhang, > First year graduate student, > Department of Computing Science, > University of Alberta > Github <https://github.com/ShangtongZhang> | Stackoverflow > <http://stackoverflow.com/users/3650053/slardar-zhang> > _______________________________________________ > mlpack mailing list > [email protected] > http://knife.lugatgt.org/cgi-bin/mailman/listinfo/mlpack
_______________________________________________ mlpack mailing list [email protected] http://knife.lugatgt.org/cgi-bin/mailman/listinfo/mlpack
