Yao Matrix

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Python00Updated 2 months ago

yao-matrix/transformersFork

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python00Updated 1 month ago

yao-matrix/llm-dFork

Achieve state of the art inference performance with modern accelerators on Kubernetes

Shell00Updated 3 months ago

yao-matrix/DeepSpeedFork

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python00Updated 3 months ago

yao-matrix/optimum-quantoFork

A pytorch quantization backend for optimum

Python00Updated 3 months ago

yao-matrix/APOLLOFork

APOLLO: SGD-like Memory, AdamW-level Performance; MLSys'25 Oustanding Paper Honorable Mention

Python00Updated 3 months ago

yao-matrix/FP-QuantFork

No description provided.

Python00Updated 4 months ago

yao-matrix/LOMOFork

LOMO: LOw-Memory Optimization

00Updated 1 year ago

yao-matrix/bitsandbytesFork

Accessible large language models via k-bit quantization for PyTorch.

Python00Updated 5 months ago

yao-matrix/blogFork

Public repo for HF blog posts

Jupyter Notebook00Updated 6 months ago

yao-matrix/mkldnn_rnn

TF rnn ops w/ MKL-DNN kernel

C++55Updated 8 years ago

lstmmkl-dnnrnntensorflow

yao-matrix/detectron2Fork

Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.

00Updated 9 months ago

yao-matrix/notebooksFork

Notebooks using the Hugging Face libraries 🤗

Jupyter Notebook00Updated 9 months ago

yao-matrix/GaLoreFork

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Python00Updated 9 months ago

yao-matrix/mergekitFork

Tools for merging pretrained large language models.

Python00Updated 9 months ago

yao-matrix/Liger-KernelFork

Efficient Triton Kernels for LLM Training

00Updated 11 months ago

yao-matrix/text-generation-inferenceFork

Large Language Model Text Generation Inference

00Updated 1 year ago

yao-matrix/inference-benchmarkerFork

Inference server benchmarking tool

00Updated 1 year ago

yao-matrix/optimum-habanaFork

Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)

Python00Updated 1 year ago

yao-matrix/llama.cppFork

LLM inference in C/C++

00Updated 1 year ago

yao-matrix/hfcn-translationFork

All about new to the 抱抱脸 localization volunteer collaboration team.

Jupyter Notebook00Updated 1 year ago

yao-matrix/optimum-intelFork

🤗 Optimum Intel: Accelerate inference with Intel optimization tools

Jupyter Notebook00Updated 1 year ago

Yao Matrix

Languages

Loading contributions...

Top Repositories

Repositories

Gists

Recent Activity