"topic:q-learning-algorithm" — Search

167 results for “topic:q-learning-algorithm”

Tabular methods for reinforcement learning

algorithmcliffwalkinggridworldgridworld-cliffgridworld-environmentpolicy-evaluationpolicy-iterationq-learningq-learning-algorithmq-learning-vs-sarsareinforcement-learningreinforcement-learning-agentreinforcement-learning-algorithmssarsasarsa-algorithmsarsa-learningtabular-environmentstabular-methodstabular-q-learningvalue-iteration

BY571/Normalized-Advantage-Function-NAF-

PyTorch implementation of the Q-Learning Algorithm Normalized Advantage Function for continuous control problems + PER and N-step Method

Jupyter Notebook2813Updated 7 months ago

continuous-controlddpg-algorithmdqnn-step-bootstrappingnafnormalized-advantage-functionsprioritized-experience-replayq-learningq-learning-algorithmreinforcement-learningreinforcement-learning-algorithms

TimKoornstra/automatic-piano-fingering

This repository contains the code for automatically generating piano fingerings using a reinforcement learning agent that uses Q-Learning.

Python163Updated 4 months ago

fingeringpianopiano-fingeringpythonq-learningq-learning-algorithmreinforcement-learning

PoCInnovation/Open-Zero

Open-zero is a research project aiming to realize the various projects of the company DeepMind

Python140Updated 2 weeks ago

a3ca3c-algorithmchess-aideepmindgym-environmentpytorchq-learning-algorithmreinforcement-learning

akshayratnawat/ReachingTargetLocation_ReinforcementLearning_Webots

The objective is to teach robot to find and reach the target object in the minimum number of steps and using the shortest path and avoiding any obstacles such as humans, walls, etc usinf reinforcement learning algorithms.

Python121Updated 1 year ago

ddpg-algorithmdeep-reinforcement-learningdeepq-learningdeterministic-policy-gradientsepuck-robotpolicy-gradientq-learning-algorithmreinforcement-learningroboticswebots

bahadiraraz/paper-game

Turn based strategy game with AI

Python120Updated 1 year ago

gamekeras-tensorflowpygamepythonq-learning-algorithm

paulinamoskwa/q-learning-gridworld

Implementation of Q-learning to solve GridWorld

Python91Updated 3 months ago

from-scratchgridworldpygameq-learningq-learning-algorithmreinforcement-learningrl

ftmoztl/car-parking-with-reinforcement-learning

Q-learning application to find an optimal parking slot

Jupyter Notebook81Updated 1 year ago

agent-based-modelingcar-parkingdynamic-programminghyperparameter-tuningoptimizationpillowpython-3q-learning-algorithmreinforcement-learning

naseridev/notch

Q-Learning Based Pathfinding in Dynamic Grid Environments

Rust80Updated 1 month ago

q-learning-algorithmrust

brendadenisse16/Advanced-Reinforcement-Learning-for-Financial-Option-Pricing

Implementation of Q-Learning, Double Q-Learning, and LSPI for pricing American options under the Black-Scholes model

60Updated 5 days ago

q-learning-algorithmreinforcement-learningreinforcement-learning-algorithms

MauroLuzzatto/Q-Learning-Demo-Play-nChain

This repository contains a Jupyter Notebook with an implemenation of a Q-Learning Agent, which learns to solve the n-Chain OpenAI Gym environment

Jupyter Notebook52Updated 4 months ago

demogymjupyter-notebookopenai-gympythonq-learningq-learning-algorithmreinforcement-learning

CristianCosci/Reinforcement_Learning_Mouse_vs_Cat

Two intelligent agents (cat and mouse) compete with each other to achieve their goal. Agents are trained through reinforcement learning (Q-learning).

Python50Updated 2 years ago

artificial-intelligencepygameq-learning-algorithmreinforcement-learningreinforcement-learning-agentreinforcement-learning-environments

AntoniovanDijck/BlackJackRL

Deep Q Learning blackbox strategies for casino games

Jupyter Notebook51Updated 3 months ago

blackjackdeep-learningdeep-neural-networksdeep-q-networkdeep-reinforcement-learningmachine-learningmlxmuzeroq-learning-algorithmreinforcement-learningreinforcement-learning-algorithmsrlxtensorflowtorch

imm-rl-lab/q-learning_with_bounded_naf

The implementation for the paper Continuous Deep Q-Learning in Optimal Control Problems: Normalized Advantage Functions Analysis // NeurIPS 2022

Python51Updated 7 months ago

nafoptimal-controlq-learningq-learning-algorithmreinforcement-learningreinforcement-learning-algorithmsreinforcement-learning-environments

viniciusenari/Q-Learning-and-SARSA-Mountain-Car-v0

Demonstration of Q-Learning and SARSA algorithms utilizing Python and OpenAI GYM

Python53Updated 3 months ago

machine-learningpythonq-learningq-learning-algorithmq-learning-vs-sarsareinforcement-learningsarsa-algorithmsarsa-learning

StepanTita/q-learning

a Python-based platformer infused with Q-Learning and dynamic level creation from simple JSON files.

Python50Updated 1 year ago

epsilon-greedygame-aimachine-learningmachine-learning-algorithmsplatformer-gamepythonq-learningq-learning-algorithmreinforcement-learningreinforcement-learning-algorithmsreinforcement-learning-environmentsreinforcement-learning-playground

cathydou/reflection-agent-maze

A reinforcement learning agent with reflection capabilities for dynamic maze navigation. Implements dual memory system, real-time adaptation, and environment change detection. Open source with research papers and documentation.

Python52Updated 4 months ago

aidynamic-environmentsmachine-learningmaze-navigationmeta-learningq-learningq-learning-algorithmreflection-agentreinforcement-learning

Shyamyar/grid-path-qlearning

Docking robot in a grid environment trained it with Q-learning

Python40Updated 10 months ago

pathfindingq-learning-algorithmreinforcement-learning

Mhijazi16/Game-Optimization

🕹️ Welcome to Game-Optimization, a repository dedicated to exploring and implementing various optimization algorithms to solve complex games. This project initially focuses on solving the classic game Sokoban using the Q-learning algorithm, with plans to extend to genetic algorithms and other optimization techniques in the future.

C++40Updated 1 year ago

cppgamesgenetic-algorithmoptimization-algorithmsq-learningq-learning-algorithmsokoban-game

Samthesimpsons/Project-Reinforcement-Learning-Wordle-Solver

SUTD 50.021 Artificial Intelligence Project - Wordle Solver using Reinforcement Learning

Python42Updated 2 months ago

agglomerative-clusteringlevenshtein-distancepythonq-learning-algorithmreinforcement-learning-algorithmswordle-solver

LauraKarimova/Big_Data_Research_Project

The 3D bin packing problem is a combinatorial optimization problem that involves fitting a given set of items of various sizes into a container of a specific size such that the total volume of the items is as close to the volume of the container as possible.

Jupyter Notebook30Updated 3 months ago

deep-q-networkmodel-free-rlq-learning-algorithmreinforcement-learning

nilskruse/mdp

Markov decision process master thesis

Rust30Updated 2 years ago

markov-decision-processesmdpq-learningq-learning-algorithmq-learning-lambdareinforcement-learningreinforcement-learning-algorithmsreinforcement-learning-environmentsrustsarsasarsa-lambda

phamduyaaaa/Play-All-ToyText-with-Q-Learning

Q-Learning applied to Gymnasium's Toy Text environments: FrozenLake, CliffWalking, BlackJack, and Taxi.

Python30Updated 4 months ago

gymnasiumq-learning-algorithm

lx10077/AveQLearning

Codes for the AISTATS 2023 paper, A Statistical Analysis of Polyak-Ruppert Averaged Q-learning.

Python31Updated 9 months ago

confidence-intervalsq-learning-algorithm

ChaitanyaC22/Numerical_TicTacToe_Agent_using_Reinforcement_Learning

Build an RL (Reinforcement Learning) agent that learns to play Numerical Tic-Tac-Toe. The agent learns the game by Q-Learning.

Jupyter Notebook31Updated 1 year ago

actionsconvergenceepisodesepsilon-decayepsilon-greedyhyperparameter-tuninglearning-ratemarkov-decision-processmdp-frameworkmodel-buildingpolicyq-learningq-learning-algorithmq-valueq-value-iterationreinforcement-learningrewardsrlstates

sahandkhoshdel99/Intelligent-Systems-ML-

No description provided.

Jupyter Notebook30Updated 1 year ago

affinity-propagationarmijo-backtrackbias-variance-tradeoffbootsrappingclustering-algorithmsdecision-treesfuzzy-systemsgenetic-algorithmgradient-descentk-means-clusteringknn-classifiermlp-classifiernaive-bayes-classifierneural-networksoptimzationq-learning-algorithmrandom-forest-classifiersimulated-annealingstochastic-gradient-descentsvm-classifier

officialarijit/DQLFS

Dynamic Q-Learning Based Feature Selection approach

MATLAB20Updated 1 year ago

machine-learningq-learningq-learning-algorithm

ghazaleh-mahmoodi/Neural_Networks

This repository contains various networks implementation such as MLP, Hopfield, Kohonen, ART, LVQ1, Genetic algorithms, Adaboost and fuzzy-system, CNN with python.

Jupyter Notebook20Updated 3 years ago

adaboostartcnnconvolutional-neural-networksfrozenlakefully-connected-deep-neural-networkfuzzy-logicgenetic-algorithmhopfield-networkkeraskohonen-maplvqmlpmnistmountaincar-coninuousneural-networkspythonq-learning-algorithm

showman-sharma/taxi-v3-learning

In this project, we tried two different Learning Algorithms for Hierarchical RL on the Taxi-v3 environment from OpenAI gym. SMDP Q-Learning and Intra Option Q-Learning and contrasted them with two other methods that involve hardcoding based on human understanding. We conclude that the solutions learnt by machine are way superior than humans for this problem. Intra Option Q-Learning outperforms SMDP Q-Learning because of better usage of the SARS samples (similar to experience replay). Our algorithms even outperform the Hardcoded Agent. We also demonstrated and concluded the strong effectiveness of state compression on the model performance.

Jupyter Notebook20Updated 1 year ago

machine-learningopenai-gymopengyq-learning-algorithmreinforcement-learningsemi-markov-decision-processtaxi-v3

hussein-shamy/Bachelor-Thesis

A collaborative repository for our Bachelor's thesis, focused on optimizing the Cell Outage Compensation (COC) algorithm in Self-Organizing Networks (SONs). Leveraging AI-Hardware Acceleration, the project aims to bolster 5G network reliability, particularly for emerging technologies like autonomous driving.

TeX20Updated 1 year ago

5gai-acceleratorsfpgaq-learning-algorithmself-organizing-networkthesis

Page 1 of 6