Repositories
20NeMo: a toolkit for conversational AI
Train neural networks up to 7x faster
LLM training code for Databricks foundation models
Arena-Hard benchmark
RewardBench: the first evaluation tool for reward models.
No description provided.
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
scratch work
A modular RL library to fine-tune language models to human preferences
Essential guides and programming tools in my toolbox (with focus on ML Training)
A Data Streaming Library for Efficient Neural Network Training
Fast and flexible reference benchmarks
RLMeta is a light-weight flexible framework for Distributed Reinforcement Learning Research.
Implementation of the Off Belief Learning algorithm.
A reinforcement learning toolkit for compiler optimizations
A pytorch implementation of the paper "Prediction, Consistency, Curvature: Representation Learning for Locally-Linear Control"
Probabilistic reasoning and statistical analysis in TensorFlow
No description provided.
No description provided.
Computation using data flow graphs for scalable machine learning