174 results for “topic:mdp”
Personal notes about scientific and research works on "Decision-Making for Autonomous Driving"
A simple framework for experimenting with Reinforcement Learning in Python.
A QoE-Oriented Computation Offloading Algorithm based on Deep Reinforcement Learning (DRL) for Mobile Edge Computing (MEC) | This algorithm captures the dynamics of the MEC environment by integrating the Dueling Double Deep Q-Network (D3QN) model with Long Short-Term Memory (LSTM) networks.
A Modern Probabilistic Model Checker
(Experimental, a lot of bugs) Automatic fingering generator for piano scores, determining optimal fingering using Model-Based Reinforcement Learning, written in the Julia language.
Solving POMDP using Recurrent networks
Java Market Data Handler for CME Market Data (MDP 3.0)
Online Replanning in Belief Space for Partially Observable Task and Motion Problems
Modeling agents with probabilistic programs
A minimalist, low-latency, HFT CME MDP3.0 C++ market data feed handler and pcap file reader (MDP 3.0)
Agent Git: Agent Version Control, Open-Branching, and Reinforcement Learning MDP for Agentic AI. A Standalone Agentic AI Infrastructure Layer for LangGraph Ecosystems
Make it easy to specify simple MDPs that are compatible with the OpenAI Gym.
Hierarchical Online Planning and Reinforcement Learning on Taxi
Hands-on workshop for websphere MQ programming
Feature selection for maximizing expected cumulative reward
Using reinforcement learning and genetic algorithms to improve traffic flow and reduce vehicle waiting times in a single-lane two-way junction simulator by coordinating traffic signal schedules.
Minimal Policy Search Toolbox
a High-Performance Distributed Solver for Large-Scale Markov Decision Processes (MDP) relying on Inexact Policy Iteration; for Python and C++
Code for "Counterfactual Explanations in Sequential Decision Making Under Uncertainty", NeurIPS 2021
Thompson Sampling based Monte Carlo Tree Search for MDPs and POMDPs
Group 14's MDP project
CS7641 - Machine Learning - Assignment 4 - Markov Decision Processes
This repository contains the MATLAB code to devise an optimal policy for the motion of the robot given the obstacles and world boundaries. This file contains implementation to a specific environment wiht known parameters and obstacles, but can easily be modified or generalized for any environment. The code was linked to the V-Rep simulation environment and tested.
Imandra Modelling Language CME MDP Model
Probabilistic planning in continuous state-action MDPs in TensorFlow.
Pathfinding Using Reinforcement Learning
Markovian State and Action Abstractions for MDPs via Hierarchical MCTS within a POMDP Formulation
MDP-ProbLog is a framework to represent and solve (infinite-horizon) MDPs specified by probabilistic logic programming.
Hosts domain and instance RDDL files, covering problems from a wide range of disciplines, integration with the pyRDDLGym ecosystem.
In- and post- process methods for optimizing explanations path based on newly defined quantitative explanation metrics