Sven Mika

A fast and simple framework for building and running distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library.

2Python

Repositories

sven1977/rllib_tutorials

Ray RLlib tutorial material

Jupyter Notebook12535Updated 3 years ago

sven1977/transformer_rlaif_end_to_end

Pre-train a small transformer with ray on your laptop from scratch on a translation task, then post-train with RLAIF.

Python00Updated 1 week ago

sven1977/RLlib_UE5_Demo

RLlib + Unreal Engine 5 demo

C++22Updated 6 months ago

sven1977/rayFork

Python20Updated 4 months ago

sven1977/rl_debugging

No description provided.

Python10Updated 10 months ago

sven1977/rl_algorithms

No description provided.

Python10Updated 11 months ago

sven1977/rl_introduction

python code accompanying the talk "Reinforcement Learning, An Introduction", Dr. Sven Mika (Duesseldorf, Germany Aug 20th 2017)

Python00Updated 8 years ago

markov-decision-processesmdppythonq-learningreinforcement-learningreinforcement-learning-algorithms

sven1977/dreamer_v3

Implementation (TensorFlow/keras) of the DreamerV3 model-based RL algorithm by Hafner et al. 2023

Python30Updated 2 years ago

sven1977/attention_is_all_you_need

Implementation of the Transformer Model described in "Attention is all you need" by Vaswani et al.

Python20Updated 3 years ago

sven1977/dreamerv3_danijarFork

Mastering Diverse Domains through World Models

00Updated 2 years ago

sven1977/huggingface_rllib

Load and upload RLlib models from and to the Hub.

Jupyter Notebook71Updated 3 years ago

sven1977/awesome-machine-learningFork

A curated list of awesome Machine Learning frameworks, libraries and software.

Python00Updated 4 years ago

sven1977/rl-experimentsFork

Keeping track of RL experiments

00Updated 5 years ago

sven1977/shine

[s]erver-[h]osted [i]ntelligent [n]eural-net [e]nvironment

Python30Updated 8 years ago

artificial-intelligencedeep-learningdeep-reinforcement-learninggame-enginemachine-learningneural-networkspygamepython-3reinforcement-learningtensorflow

sven1977/perceptron_to_dnc

"From Perceptrons to Differentiable Neural Computers", by Dr. Sven Mika. Presentation at DUS ML meetup Dec 2nd 2017

Jupyter Notebook10Updated 8 years ago

sven1977/gymFork

A toolkit for developing and comparing reinforcement learning algorithms.

Python00Updated 8 years ago

Sven Mika

Languages

Loading contributions...

Top Repositories

Repositories

Gists

Recent Activity