xiaobochen-amd | GitHunt

xiaobochen-amd

AMD

Languages

Python100%

Repositories

12

xiaobochen-amd/aoFork

PyTorch native quantization and sparsity for training and inference

Python01Updated 1 week ago

xiaobochen-amd/torchtitanFork

A PyTorch native platform for training generative AI models

Python00Updated 1 week ago

xiaobochen-amd/flashinferFork

FlashInfer: Kernel Library for LLM Serving

00Updated 1 week ago

xiaobochen-amd/FBGEMMFork

FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/

00Updated 2 months ago

xiaobochen-amd/pytorchFork

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python00Updated 2 months ago

xiaobochen-amd/HipKittensFork

Fast and Furious AMD Kernels

00Updated 3 months ago

xiaobochen-amd/Primus-TurboFork

No description provided.

00Updated 5 months ago

xiaobochen-amd/Triton-distributedFork

Distributed Compiler based on Triton for Parallel Systems

00Updated 6 months ago

xiaobochen-amd/Megatron-LMFork

Ongoing research training transformer models at scale

00Updated 7 months ago

xiaobochen-amd/TransformerEngineFork

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory utilization in both training and inference.

00Updated 7 months ago

xiaobochen-amd/composable_kernelFork

Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators

00Updated 8 months ago

xiaobochen-amd/aiterFork

AI Tensor Engine for ROCm

Python00Updated 9 months ago

Gists

Recent Activity