https://github.com/jjkke88/RL_toolbox
Python43
6 years ago
reinfore learning tool box, contains trpo, a3c algorithm for continous action space
MIT License