GitHunt

tongqiu

apinge

Languages

Python47%C++35%MLIR6%Cuda6%LLVM6%

Top Repositories

Repositories

63
AP
apinge/rocMLIRFork

No description provided.

MLIR00Updated 4 days ago
AP
apinge/MeloTTS.cpp

A lightweight pure C++ Text-to-Speech (TTS) pipeline with OpenVINO, supporting multiple languages.

C++9514Updated 1 week ago
aiopenvinoopenvino-toolkittext-to-speechtts
AP
apinge/aiterFork

AI Tensor Engine for ROCm

Python00Updated 1 week ago
AP
apinge/Awesome-GPUFork

Awesome resources for GPUs

10Updated 2 weeks ago
AP
apinge/pyhipFork

A python interface for ROCM HIP language

C++00Updated 2 weeks ago
AP
apinge/gcnasmFork

amdgpu example code in hip/asm

00Updated 1 month ago
AP
apinge/tritonbenchFork

Tritonbench is a collection of PyTorch custom operators with example inputs to measure their performance.

00Updated 1 month ago
AP
apinge/LLVM-Code-GenerationFork

LLVM Code Generation, published by Packt

C++00Updated 1 month ago
AP
apinge/dsl-labFork

Playground for Domain-Specific Languages (DSL)

00Updated 1 month ago
AP
apinge/triton-runnerFork

Multi-Level Triton Runner supporting Python, IR, PTX, and cubin.

00Updated 1 month ago
AP
apinge/TritonStudyGroupFork

No description provided.

00Updated 1 month ago
AP
apinge/HipKittensFork

Fast and Furious AMD Kernels

C++00Updated 1 month ago
AP
apinge/leetgpu-challengesFork

LeetGPU Challenges

00Updated 1 month ago
AP
apinge/TPTFork

Triton kernel profile and debug tool among vllm & aiter for internal usage

Python00Updated 1 month ago
AP
apinge/asterFork

ASTER 💫 : Assembly Tooling and Representations

C++00Updated 1 month ago
AP
apinge/openvino_ai_practice

Hands-on examples for optimizing and deploying AI models with OpenVINO.

Python42Updated 2 months ago
AP
apinge/cutlassFork

CUDA Templates and Python DSLs for High-Performance Linear Algebra

C++00Updated 2 months ago
AP
apinge/Alpha-MoEFork

No description provided.

Cuda00Updated 2 months ago
AP
apinge/flashinferFork

FlashInfer: Kernel Library for LLM Serving

00Updated 2 months ago
AP
apinge/tritonFork

Development repository for the Triton language and compiler

00Updated 2 months ago
AP
apinge/sglangFork

SGLang is a fast serving framework for large language models and vision language models.

Python00Updated 3 months ago
AP
apinge/llm-benchmark-toolsFork

No description provided.

00Updated 3 months ago
AP
apinge/flash-attentionFork

Fast and memory-efficient exact attention

Python00Updated 3 months ago
AP
apinge/vllmFork

A high-throughput and memory-efficient inference and serving engine for LLMs

Python00Updated 3 months ago
AP
apinge/vllm-audio

No description provided.

Python00Updated 3 months ago
AP
apinge/whisper-docker-fileFork

No description provided.

Python00Updated 3 months ago
AP
apinge/TensorRT-LLMFork

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way.

00Updated 3 months ago
AP
apinge/llvm-projectFork

The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.

LLVM00Updated 3 months ago
AP
apinge/CUDA-ProgrammingFork

Sample codes for my CUDA programming book

00Updated 5 months ago
AP
apinge/LLM-ViewerFork

Analyze the inference of Large Language Models (LLMs). Analyze aspects like computation, storage, transmission, and hardware roofline model in a user-friendly interface.

00Updated 5 months ago

Gists

Recent Activity

tongqiu (apinge) | GitHunt