https://github.com/jjkke88/RL_toolbox
Python43
7 years ago
reinfore learning tool box, contains trpo, a3c algorithm for continous action space
MIT License