tongqiu

apinge

Languages

Python47%C++35%MLIR6%Cuda6%LLVM6%

Top Repositories

A lightweight pure C++ Text-to-Speech (TTS) pipeline with OpenVINO, supporting multiple languages.

openvino_ai_practice

Hands-on examples for optimizing and deploying AI models with OpenVINO.

Awesome resources for GPUs

AI Tensor Engine for ROCm

A python interface for ROCM HIP language

Repositories

63

apinge/rocMLIRFork

No description provided.

MLIR00Updated 4 days ago

apinge/MeloTTS.cpp

A lightweight pure C++ Text-to-Speech (TTS) pipeline with OpenVINO, supporting multiple languages.

C++9514Updated 1 week ago

aiopenvinoopenvino-toolkittext-to-speechtts

apinge/aiterFork

AI Tensor Engine for ROCm

Python00Updated 1 week ago

apinge/Awesome-GPUFork

Awesome resources for GPUs

10Updated 2 weeks ago

apinge/pyhipFork

A python interface for ROCM HIP language

C++00Updated 2 weeks ago

apinge/gcnasmFork

amdgpu example code in hip/asm

00Updated 1 month ago

apinge/tritonbenchFork

Tritonbench is a collection of PyTorch custom operators with example inputs to measure their performance.

00Updated 1 month ago

apinge/LLVM-Code-GenerationFork

LLVM Code Generation, published by Packt

C++00Updated 1 month ago

apinge/dsl-labFork

Playground for Domain-Specific Languages (DSL)

00Updated 1 month ago

apinge/triton-runnerFork

Multi-Level Triton Runner supporting Python, IR, PTX, and cubin.

00Updated 1 month ago

apinge/TritonStudyGroupFork

No description provided.

00Updated 1 month ago

apinge/HipKittensFork

Fast and Furious AMD Kernels

C++00Updated 1 month ago

apinge/leetgpu-challengesFork

LeetGPU Challenges

00Updated 1 month ago

Triton kernel profile and debug tool among vllm & aiter for internal usage

Python00Updated 1 month ago

apinge/asterFork

ASTER 💫 : Assembly Tooling and Representations

C++00Updated 1 month ago

apinge/openvino_ai_practice

Hands-on examples for optimizing and deploying AI models with OpenVINO.

Python42Updated 2 months ago

apinge/cutlassFork

CUDA Templates and Python DSLs for High-Performance Linear Algebra

C++00Updated 2 months ago

apinge/Alpha-MoEFork

No description provided.

Cuda00Updated 2 months ago

apinge/flashinferFork

FlashInfer: Kernel Library for LLM Serving

00Updated 2 months ago

apinge/tritonFork

Development repository for the Triton language and compiler

00Updated 2 months ago

apinge/sglangFork

SGLang is a fast serving framework for large language models and vision language models.

Python00Updated 3 months ago

apinge/llm-benchmark-toolsFork

No description provided.

00Updated 3 months ago

apinge/flash-attentionFork

Fast and memory-efficient exact attention

Python00Updated 3 months ago

apinge/vllmFork

A high-throughput and memory-efficient inference and serving engine for LLMs

Python00Updated 3 months ago

apinge/vllm-audio

No description provided.

Python00Updated 3 months ago

apinge/whisper-docker-fileFork

No description provided.

Python00Updated 3 months ago

apinge/TensorRT-LLMFork

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in a performant way.

00Updated 3 months ago

apinge/llvm-projectFork

The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.

LLVM00Updated 3 months ago

apinge/CUDA-ProgrammingFork

Sample codes for my CUDA programming book

00Updated 5 months ago

apinge/LLM-ViewerFork

Analyze the inference of Large Language Models (LLMs). Analyze aspects like computation, storage, transmission, and hardware roofline model in a user-friendly interface.

00Updated 5 months ago

Gists

Recent Activity

tongqiu (apinge) | GitHunt