GitHunt

Sven Mika

sven1977

Reinforcement Learning (RLlib) and LLMs

anyscale.com
San Francisco, CA; Düsseldorf, Germany

Languages

Python71%Jupyter Notebook21%C++7%

Repos

16

Stars

147

Forks

38

Top Language

Python

Loading contributions...

Top Repositories

Repositories

16
SV
sven1977/rllib_tutorials

Ray RLlib tutorial material

Jupyter Notebook12535Updated 3 years ago
SV
sven1977/transformer_rlaif_end_to_end

Pre-train a small transformer with ray on your laptop from scratch on a translation task, then post-train with RLAIF.

Python00Updated 1 week ago
SV
sven1977/RLlib_UE5_Demo

RLlib + Unreal Engine 5 demo

C++22Updated 6 months ago
SV
sven1977/rayFork

A fast and simple framework for building and running distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library.

Python20Updated 4 months ago
SV
sven1977/rl_debugging

No description provided.

Python10Updated 10 months ago
SV
sven1977/rl_algorithms

No description provided.

Python10Updated 11 months ago
SV
sven1977/rl_introduction

python code accompanying the talk "Reinforcement Learning, An Introduction", Dr. Sven Mika (Duesseldorf, Germany Aug 20th 2017)

Python00Updated 8 years ago
markov-decision-processesmdppythonq-learningreinforcement-learningreinforcement-learning-algorithms
SV
sven1977/dreamer_v3

Implementation (TensorFlow/keras) of the DreamerV3 model-based RL algorithm by Hafner et al. 2023

Python30Updated 2 years ago
SV
sven1977/attention_is_all_you_need

Implementation of the Transformer Model described in "Attention is all you need" by Vaswani et al.

Python20Updated 3 years ago
SV
sven1977/dreamerv3_danijarFork

Mastering Diverse Domains through World Models

00Updated 2 years ago
SV
sven1977/huggingface_rllib

Load and upload RLlib models from and to the Hub.

Jupyter Notebook71Updated 3 years ago
SV
sven1977/awesome-machine-learningFork

A curated list of awesome Machine Learning frameworks, libraries and software.

Python00Updated 4 years ago
SV
sven1977/rl-experimentsFork

Keeping track of RL experiments

00Updated 5 years ago
SV
sven1977/shine

[s]erver-[h]osted [i]ntelligent [n]eural-net [e]nvironment

Python30Updated 8 years ago
artificial-intelligencedeep-learningdeep-reinforcement-learninggame-enginemachine-learningneural-networkspygamepython-3reinforcement-learningtensorflow
SV
sven1977/perceptron_to_dnc

"From Perceptrons to Differentiable Neural Computers", by Dr. Sven Mika. Presentation at DUS ML meetup Dec 2nd 2017

Jupyter Notebook10Updated 8 years ago
SV
sven1977/gymFork

A toolkit for developing and comparing reinforcement learning algorithms.

Python00Updated 8 years ago

Gists

Recent Activity

Sven Mika (sven1977) | GitHunt