Dear Marcus,

I am Rohan Raj,aka luffy1996 on github and IRC channel. I am currently
drafting my gsoc proposal for mlpack in reinforcement learning project . I
would like to know your views on this proposal.

I am planning to go ahead with PPO  , ACER and  ACKTR  implementation on
mlpack for GSOC 2018 . I would also like to mention that I am planning to
give 3 - 3.5 weeks for each of these algorithms. This makes my tentative
schedule as
Week 1 : Implementation of continuous action space games for Actor critic
Algorithms.
Week 2 - 4 : ACER
Week 5 - 7 : A2C(synchronous A3C) and PPO
Week 8 - 10 : ACKTR
Week 11 - 12 : Bug fixing and final submission.

It would be grate if you can provide your input to this tentative schedule.
I would be happy to add changes based on your reviews and suggestions.

PPO : (https://arxiv.org/abs/1707.06347)
ACER : (https://arxiv.org/abs/1611.01224)
ACKTR : (https://arxiv.org/abs/1708.05144)

Rohan Raj
Indian Institute of Technology Guwahati
Assam , India
Phone : +91 8723990557

ᐧ
_______________________________________________
mlpack mailing list
mlpack@lists.mlpack.org
http://knife.lugatgt.org/cgi-bin/mailman/listinfo/mlpack

Reply via email to