GitHunt

Yao Matrix

yao-matrix

A realistic idealist.

Intel
Shanghai

Languages

Python68%Jupyter Notebook23%Shell5%C++5%

Repos

66

Stars

60

Forks

58

Top Language

Python

Loading contributions...

Top Repositories

Repositories

66
YA
yao-matrix/vllmFork

A high-throughput and memory-efficient inference and serving engine for LLMs

Python00Updated 19 hours ago
YA
yao-matrix/deepSpeech2

End-to-end speech recognition using TensorFlow

Python4952Updated 7 years ago
deepspeech2mkltensorflow
YA
yao-matrix/peftFork

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python00Updated 2 weeks ago
YA
yao-matrix/mblog

No description provided.

Jupyter Notebook50Updated 1 year ago
YA
yao-matrix/aiconfiguratorFork

Offline optimization of your disaggregated Dynamo graph

00Updated 1 month ago
YA
yao-matrix/dynamoFork

A Datacenter Scale Distributed Inference Serving Framework

00Updated 1 month ago
YA
yao-matrix/diffusersFork

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.

Python11Updated 2 months ago
YA
yao-matrix/trlFork

Train transformer language models with reinforcement learning.

Python00Updated 2 months ago
YA
yao-matrix/accelerateFork

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Python00Updated 2 months ago
YA
yao-matrix/transformersFork

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python00Updated 1 month ago
YA
yao-matrix/llm-dFork

Achieve state of the art inference performance with modern accelerators on Kubernetes

Shell00Updated 3 months ago
YA
yao-matrix/DeepSpeedFork

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python00Updated 3 months ago
YA
yao-matrix/optimum-quantoFork

A pytorch quantization backend for optimum

Python00Updated 3 months ago
YA
yao-matrix/APOLLOFork

APOLLO: SGD-like Memory, AdamW-level Performance; MLSys'25 Oustanding Paper Honorable Mention

Python00Updated 3 months ago
YA
yao-matrix/FP-QuantFork

No description provided.

Python00Updated 4 months ago
YA
yao-matrix/LOMOFork

LOMO: LOw-Memory Optimization

00Updated 1 year ago
YA
yao-matrix/bitsandbytesFork

Accessible large language models via k-bit quantization for PyTorch.

Python00Updated 5 months ago
YA
yao-matrix/blogFork

Public repo for HF blog posts

Jupyter Notebook00Updated 6 months ago
YA
yao-matrix/mkldnn_rnn

TF rnn ops w/ MKL-DNN kernel

C++55Updated 8 years ago
lstmmkl-dnnrnntensorflow
YA
yao-matrix/detectron2Fork

Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.

00Updated 9 months ago
YA
yao-matrix/notebooksFork

Notebooks using the Hugging Face libraries 🤗

Jupyter Notebook00Updated 9 months ago
YA
yao-matrix/GaLoreFork

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Python00Updated 9 months ago
YA
yao-matrix/mergekitFork

Tools for merging pretrained large language models.

Python00Updated 9 months ago
YA
yao-matrix/Liger-KernelFork

Efficient Triton Kernels for LLM Training

00Updated 11 months ago
YA
yao-matrix/text-generation-inferenceFork

Large Language Model Text Generation Inference

00Updated 1 year ago
YA
yao-matrix/inference-benchmarkerFork

Inference server benchmarking tool

00Updated 1 year ago
YA
yao-matrix/optimum-habanaFork

Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)

Python00Updated 1 year ago
YA
yao-matrix/llama.cppFork

LLM inference in C/C++

00Updated 1 year ago
YA
yao-matrix/hfcn-translationFork

All about new to the 抱抱脸 localization volunteer collaboration team.

Jupyter Notebook00Updated 1 year ago
YA
yao-matrix/optimum-intelFork

🤗 Optimum Intel: Accelerate inference with Intel optimization tools

Jupyter Notebook00Updated 1 year ago

Gists

Recent Activity

Yao Matrix (yao-matrix) | GitHunt