43 results for “topic:advantage-actor-critic”
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
Contains high quality implementations of Deep Reinforcement Learning algorithms written in PyTorch
Scalable, event-driven, deep-learning-friendly backtesting library
A PyTorch library for building deep reinforcement learning agents.
PyTorch implementations of various Deep Reinforcement Learning (DRL) algorithms for both single agent and multi-agent.
PyTorch C++ Reinforcement Learning
Code for Hands On Intelligent Agents with OpenAI Gym book to get started and learn to build deep reinforcement learning agents using PyTorch
Tutorials for reinforcement learning in PyTorch and Gym by implementing a few of the popular algorithms. [IN PROGRESS]
Code accompanying the blog post "Deep Reinforcement Learning with TensorFlow 2.1"
Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO
PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.
Curiosity-driven Exploration by Self-supervised Prediction
Reinforcing Your Learning of Reinforcement Learning
A well-documented A2C written in PyTorch
[ICRA 2023] Demonstration-Guided Reinforcement Learning with Efficient Exploration for Task Automation of Surgical Robot
Deep Reinforcement Learning in Autonomous Driving: the A3C algorithm used to make a car learn to drive in TORCS; Python 3.5, Tensorflow, tensorboard, numpy, gym-torcs, ubuntu, latex
The friendly robot that beats you in Yahtzee 🤖 🎲
MLP-framework (pure numpy) and DDQN-framework for OpenAI's Gym games. +test code for PPO added. +Hindsight Experience Replay(HER) bitflip-DQN example. +prioritized replay.
Official implementation of the AAAI 2021 paper Deep Bayesian Quadrature Policy Optimization.
Deep reinforcement learning package for torch7
No description provided.
🌊 Implement advanced algorithms for USV path planning using reinforcement and imitation learning, ensuring efficient and safe navigation in complex environments.
It's a Raspberry Pi Pokémon that gamifies WiFi Hacking by learning from its surrounding WiFi environment utilising deep Reinforcement Learning.
Implementations of deep reinforcement learning algorithms.
Solving CartPole-v1 environment in Keras with Advantage Actor Critic (A2C) algorithm an Deep Reinforcement Learning algorithm
No description provided.
This repository contains the implementation of the K-workers, n-step Advantage Actor-Critic (A2C) algorithm applied to the CartPole environment, as part of a reinforcement learning project for the EPFL Spring Semester 2024 course on Artificial Neural Networks and Reinforcement Learning.
PPO and A2C implementations for a noisy blackbox environment challenge
The pytorch implemetation of a2c