"topic:reinforcement-learning" — Search

15,630 results for “topic:reinforcement-learning”

List of Computer Science courses with video lectures.

algorithmsbioinformaticscomputational-biologycomputational-physicscomputer-architecturecomputer-sciencecomputer-visiondatabase-systemsdatabasesdeep-learningembedded-systemsmachine-learningquantum-computingreinforcement-learningroboticssecuritysystemsweb-development

labmlai/annotated_deep_learning_paper_implementations

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠

Python65.9k6.6kUpdated 2 hours ago

attentiondeep-learningdeep-learning-tutorialganliterate-programmingloramachine-learningneural-networksoptimizerspytorchreinforcement-learningtransformertransformers

unslothai/unsloth

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.

Python53.6k4.5kUpdated just now

agentdeepseekdeepseek-r1fine-tuninggemmagemma3gpt-ossllamallama3llmllmsmistralopenaiqwenqwen3reinforcement-learningtext-to-speechttsunslothvoice-cloning

ray-project/ray

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python41.7k7.3kUpdated just now

data-sciencedeep-learningdeploymentdistributedhyperparameter-optimizationhyperparameter-searchlarge-language-modelsllmllm-inferencellm-servingmachine-learningoptimizationparallelpythonpytorchrayreinforcement-learningrllibservingtensorflow

eugeneyan/applied-ml

📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.

28.7k3.8kUpdated 9 hours ago

applied-data-scienceapplied-machine-learningcomputer-visiondata-discoverydata-engineeringdata-qualitydata-sciencedeep-learningmachine-learningnatural-language-processingproductionrecsysreinforcement-learningsearch

d2l-ai/d2l-en

Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.

Python28.4k5.0kUpdated 4 hours ago

bookcomputer-visiondata-sciencedeep-learninggaussian-processeshyperparameter-optimizationjaxkagglekerasmachine-learningmxnetnatural-language-processingnotebookpythonpytorchrecommender-systemreinforcement-learningtensorflow

sgl-project/sglang

SGLang is a high-performance serving framework for large language models and multimodal models.

Python24.2k4.7kUpdated just now

attentionblackwellcudadeepseekdiffusionglmgpt-ossinferencellamallmminimaxmoeqwenqwen-imagereinforcement-learningtransformervlmwan

Unity-Technologies/ml-agents

The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement learning and imitation learning.

C#19.2k4.4kUpdated just now

deep-learningdeep-reinforcement-learningmachine-learningneural-networksreinforcement-learningunityunity3d

AI4Finance-Foundation/FinGPT

FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.

Jupyter Notebook18.8k2.6kUpdated 1 hour ago

chatgptfinancefingptfintechlarge-language-modelsmachine-learningnlpprompt-engineeringpytorchreinforcement-learningrobo-advisorsentiment-analysistechnical-analysis

tensorflow/tensor2tensorArchived

Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.

Python17.0k3.7kUpdated 6 hours ago

deep-learningmachine-learningmachine-translationreinforcement-learningtpu

datawhalechina/leedl-tutorial

《李宏毅深度学习教程》（李宏毅老师推荐👍，苹果书🍎），PDF下载地址：https://github.com/datawhalechina/leedl-tutorial/releases

Jupyter Notebook16.4k3.1kUpdated 4 hours ago

bertchatgptcnndeep-learningdiffusionganleedl-tutorialmachine-learningnetwork-compressionpruningreinforcement-learningrnnself-attentiontransfer-learningtransformertutorial

ddbourgin/numpy-ml

Machine learning, in numpy

Python16.3k3.8kUpdated 11 hours ago

attentionbayesian-inferencegaussian-mixture-modelsgaussian-processesgood-turing-smoothinggradient-boostinghidden-markov-modelsknnlstmmachine-learningmfccneural-networksreinforcement-learningresnettopic-modelingvaewavenetwgan-gpword2vec

microsoft/agent-lightning

The absolute trainer to light up AI agents.

Python15.4k1.3kUpdated 1 hour ago

agentagentic-aillmmlopsreinforcement-learning

MathFoundationRL/Book-Mathematical-Foundation-of-Reinforcement-Learning

This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."

MATLAB14.9k1.4kUpdated 1 hour ago

artificial-intelligencebookcoursesreinforcement-learningtutorials

ShangtongZhang/reinforcement-learning-an-introduction

Python Implementation of Reinforcement Learning: An Introduction

Python14.6k5.0kUpdated 1 hour ago

artificial-intelligencereinforcement-learning

bulletphysics/bullet3

Bullet Physics SDK: real-time collision detection and multi-physics simulation for VR, games, visual effects, robotics, machine learning etc.

C++14.3k3.0kUpdated 1 hour ago

computer-animationgame-developmentkinematicspybulletreinforcement-learningroboticssimulationsimulatorvirtual-reality

datawhalechina/easy-rl

强化学习中文教程（蘑菇书🍄），在线阅读地址：https://datawhalechina.github.io/easy-rl/

Jupyter Notebook13.8k2.2kUpdated 2 hours ago

a3cddpgdeep-reinforcement-learningdouble-dqndqndueling-dqneasy-rlimitation-learningpolicy-gradientppoq-learningreinforcement-learningsarsatd3

owainlewis/awesome-artificial-intelligence

A curated list of Artificial Intelligence (AI) courses, books, video lectures and papers.

13.1k2.1kUpdated just now

aiartificial-intelligencedeep-learningintelligent-machinesintelligent-systemsmachine-intelligencemachine-learningneural-networkreinforcement-learningstatistical-learningunsupervised-learning

DLR-RM/stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Python12.8k2.1kUpdated 3 hours ago

baselinesgsdegymmachine-learningopenaipythonpytorchreinforcement-learningreinforcement-learning-algorithmsroboticssb3sdestable-baselinestoolbox

kmario23/deep-learning-drizzle

Drench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures!!

HTML12.8k3.0kUpdated 5 hours ago

artificial-intelligence-algorithmsartificial-neural-networksbayesian-statisticscomputer-visiondeep-learningdeep-neural-networksdeep-reinforcement-learningexplainable-aigeometric-deep-learninggraph-neural-networksmachine-learningmedical-imagingnatural-language-processingoptimizationpattern-recognitionprobabilistic-graphical-modelsprobabilityreinforcement-learningspeech-recognitionvisual-recognition

Farama-Foundation/Gymnasium

An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)

Python11.5k1.3kUpdated 4 hours ago

apigymreinforcement-learning

wandb/wandb

The AI developer platform. Use Weights & Biases to train and fine-tune models, and manage models from experimentation to production.

Python10.9k831Updated just now

aicollaborationdata-sciencedata-versioningdeep-learningexperiment-trackhyperparameter-optimizationhyperparameter-searchhyperparameter-tuningjaxkerasmachine-learningml-platformmlopsmodel-versioningpytorchreinforcement-learningreproducibilitytensorflow

aws/amazon-sagemaker-examples

Example 📓 Jupyter notebooks that demonstrate how to build, train, and deploy machine learning models using 🧠 Amazon SageMaker.

Jupyter Notebook10.9k7.0kUpdated just now

awsdata-sciencedeep-learningexamplesinferencejupyter-notebookmachine-learningmlopsreinforcement-learningsagemakertraining

MorvanZhou/Reinforcement-learning-with-tensorflow

Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学

Python9.4k5.0kUpdated 2 days ago

a3cactor-criticasynchronous-advantage-actor-criticddpgdeep-deterministic-policy-gradientdeep-q-networkdouble-dqndqndueling-dqnmachine-learningpolicy-gradientppoprioritized-replayproximal-policy-optimizationq-learningreinforcement-learningsarsasarsa-lambdatensorflow-tutorialstutorial

Hvass-Labs/TensorFlow-Tutorials

TensorFlow Tutorials with YouTube Videos

Jupyter Notebook9.3k4.1kUpdated 1 day ago

deep-learningmachine-learningneural-networkpython-notebookreinforcement-learningtensorflowtutorialyoutube

vwxyzjn/cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Python9.2k1.0kUpdated 1 hour ago

a2cactor-criticadvantage-actor-criticaleatarideep-learningdeep-reinforcement-learninggymmachine-learningphasic-policy-gradientppoproximal-policy-optimizationpythonpytorchreinforcement-learningwandb

OpenRLHF/OpenRLHF

An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)

Python9.1k888Updated 1 hour ago

large-language-modelsopenai-o1proximal-policy-optimizationraylibreinforcement-learningreinforcement-learning-from-human-feedbacktransformersvllm

OpenPipe/ART

Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen3.5, GPT-OSS, Llama, and more!

Python9.0k765Updated 4 hours ago

agentagentic-aigrpollmsloraqwenqwen3reinforcement-learningrl

lazyprogrammer/machine_learning_examples

A collection of machine learning examples and tutorials.

Python8.8k6.4kUpdated 3 hours ago

data-sciencedeep-learningmachine-learningnatural-language-processingpythonreinforcement-learning

VowpalWabbit/vowpal_wabbit

Vowpal Wabbit is a machine learning system which pushes the frontier of machine learning with techniques such as online, hashing, allreduce, reductions, learning2search, active, and interactive learning.

C++8.7k1.9kUpdated 1 day ago

active-learningc-plus-pluscontextual-banditscpplearning-to-searchmachine-learningonline-learningreinforcement-learning

Page 1 of 34