@FoConrad @sxjscience I also implemented PPO using mxnet-gluon, but the performance is much worse than the OpenAi baseline. Is it really caused by the Adam optimizer in MXNet? Have you solved this problem?
[ Full content available at: https://github.com/apache/incubator-mxnet/issues/10563 ] This message was relayed via gitbox.apache.org for [email protected]
