67 results for “topic:stable-baselines”
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.
A collection of 100+ pre-trained RL agents using Stable Baselines, training and hyperparameter optimization included.
Stable-Baselines tutorial for Journées Nationales de la Recherche en Robotique 2019
Contrib package for Stable-Baselines3 - Experimental reinforcement learning (RL) code
Mirror of Stable-Baselines: a fork of OpenAI Baselines, implementations of reinforcement learning algorithms
Colab notebooks part of the documentation of Stable Baselines reinforcement learning library
An open, minimalist Gymnasium environment for autonomous coordination in wireless mobile networks.
A collection of pre-trained RL agents using Stable Baselines3
Stable Baselines官方文档中文版
NFVdeep: Deep Reinforcement Learning for Online Orchestration of Service Function Chains
Pytorch Implementation of Policy Distillation for control, which has well-trained teachers via stable_baselines3.
A well-documented A2C written in PyTorch
MovieLens recommendation system using reinforcement learning (GYM + PPO)
RL Reach is a platform for running reproducible reinforcement learning experiments.
Mirror Descent Policy Optimization
A graphical interface for reinforcement learning and gym-based environments.
Representing robots as graphs for reinforcement-learning in PyBullet locomotion environments.
A trading bitcoin agent was created with deep reinforcement learning implementations.
Attention-based Curiosity-driven Exploration in Deep Reinforcement Learning
Reinforcement learning with Godot
RL training for quadruped robot(mit minicheetah) various gaits in different velocity based on MPC controller.
A highly-customizable OpenAI gym environment to train & evaluate RL agents trading stocks and crypto.
Distributed Online Service Coordination Using Deep Reinforcement Learning
Training in bursts for defending against adversarial policies
An open source robot reinforcement learing plantform using stable-baselines and OpenAI Gym
RL vs. 1/n and Mean-Variance in the Portfolio Allocation Problem. A Bachelor's thesis at Utrecht University.
The code release of "Real-time Active Vision for a Humanoid Soccer Robot Using Deep Reinforcement Learning" paper, ICAART 2021
This project combines the power of reinforcement learning with a custom raycasting game engine written in Golang, reminiscent of the classic Wolf3D game.
一些簡單的遊戲,用來驗證強化學習中使用動作遮罩的效果及影響