Python324Updated 5 months ago backpropagationcomputation-graphcomputational-graphsdirect-policy-searchfinite-differencegymilqgilqg-mujocoilqrmodel-basedmujocomujoco-dynamicsmujoco-pyopenai-gympolicy-gradientpolicy-gradientspolicy-optimizationpytorchreinforcement-learning