409 results for “topic:markov-decision-processes”
VIP cheatsheets for Stanford's CS 221 Artificial Intelligence
Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow
MDPs and POMDPs in Julia - An interface for defining, solving, and simulating fully and partially observable Markov decision processes on discrete and continuous spaces.
A C++ framework for MDPs and POMDPs with Python bindings
Curso de Álgebra Lineal
Extensible Combinatorial Optimization Learning Environments
A JuMP extension for Stochastic Dual Dynamic Programming
A QoE-Oriented Computation Offloading Algorithm based on Deep Reinforcement Learning (DRL) for Mobile Edge Computing (MEC) | This algorithm captures the dynamics of the MEC environment by integrating the Dueling Double Deep Q-Network (D3QN) model with Long Short-Term Memory (LSTM) networks.
A framework to build and solve POMDP problems. Documentation: https://h2r.github.io/pomdp-py/
An Automata Learning Library Written in Python
R.L. methods and techniques.
Coding Demos from the School of AI's Move37 Course
🌲 Stanford CS 228 - Probabilistic Graphical Models
A research platform to develop automated security policies using quantitative methods, e.g., optimal control, computational game theory, reinforcement learning, optimization, evolutionary methods, and causal inference.
Implementation of value iteration algorithm for calculating an optimal MDP policy
WrightEagle Base Code for RoboCup Soccer Simulation 2D
Reinforcement Learning Short Course
High Performance Map Matching with Markov Decision Processes (MDPs) and Hidden Markov Models (HMMs).
Online algorithms for solving large-scale dynamic vehicle routing problems with stochastic requests
Framework for the simulation and estimation of some finite-horizon discrete choice dynamic programming models.
Reinforcement Learning in JavaScript
AWS Last Mile Route Sequence Optimization
Monte Carlo Tree Search (MCTS) is a method for finding optimal decisions in a given domain by taking random samples in the decision space and building a search tree accordingly. It has already had a profound impact on Artificial Intelligence (AI) approaches for domains that can be represented as trees of sequential decisions, particularly games and planning problems. In this project I used a board game called "HEX" as a platform to test different simulation strategies in MCTS field.
🐍 AI that learns to play Snake using Q-Learning (Reinforcement Learning)
A curated list of online resources for probabilistic planning: papers, software and research groups around the world!
Implementation of Tsallis Actor Critic method
Rich literature review and discussion on the implementation of "Hierarchical Decision-Making for Autonomous Driving"
My solutions to Yandex Practical Reinforcement Learning course in PyTorch and Tensorflow
A Reinforcement Learning Library for C++11/14
Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations