Repos
53
Stars
69
Forks
16
Top Language
Python
Loading contributions...
Top Repositories
An implementation of a phase vocoder that uses the Fast Lifting Wavelet Transform for pitch detection and TD-PSOLA for pitch correction
Read your tfrecord files from the command line
Scalable toolkit for efficient model reinforcement
Build RL environments for LLM training
A high-throughput and memory-efficient inference and serving engine for LLMs
SGLang is a high-performance serving framework for large language models and multimodal models.
Repositories
53Build RL environments for LLM training
A high-throughput and memory-efficient inference and serving engine for LLMs
Scalable toolkit for efficient model reinforcement
An implementation of a phase vocoder that uses the Fast Lifting Wavelet Transform for pitch detection and TD-PSOLA for pitch correction
SGLang is a high-performance serving framework for large language models and multimodal models.
Converts audio to MakeCode Arcade code!
Ongoing research training transformer models at scale
NeMo: a toolkit for conversational AI
Scalable toolkit for data curation
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
CI/CD templates for NeMo-FW libraries
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper GPUs, to provide better performance with lower memory utilization in both training and inference.
No description provided.
Scalable toolkit for efficient model alignment
A framework for few-shot evaluation of language models.
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
No description provided.
NeMo Megatron launcher and tools
A machine learning compiler for GPUs, CPUs, and ML accelerators
No description provided.
wait for all `workflow_run` required workflows to be successful
Lingvo
Flax is a neural network library for JAX that is designed for flexibility.
A Fast, Extensible Progress Bar for Python and CLI
No description provided.
No description provided.
Pax is a Jax-based machine learning framework for training large scale models. Pax allows for advanced and fully configurable experimentation and parallelization, and has demonstrated industry leading model flop utilization rates.
Read your tfrecord files from the command line
JAX-Toolbox
A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.