GitHunt — Discover GitHub Repositories

44 results for “topic:ai-infra”

OpenSandbox is a general-purpose sandbox platform for AI applications, offering multi-language SDKs, unified sandbox APIs, and Docker/Kubernetes runtimes for scenarios like Coding Agents, GUI Agents, Agent Evaluation, AI Code Execution, and RL Training.

Python6.8k495Updated just now

aiai-agentai-infrakubernetessandbox

HuaizhengZhang/AI-Infra-from-Zero-to-Hero

🚀 Awesome System for Machine Learning ⚡️ AI System Papers and Industry Practice. ⚡️ System for Machine Learning, LLM (Large Language Model), GenAI (Generative AI). 🍻 OSDI, NSDI, SIGCOMM, SoCC, MLSys, etc. 🗃️ Llama3, Mistral, etc. 🧑‍💻 Video Tutorials.

3.7k369Updated 1 day ago

ai-infragenailarge-language-modelsllmsysmlsys+2

thu-ml/TurboDiffusion

TurboDiffusion: 100–200× Acceleration for Video Diffusion Models

Python3.4k237Updated 2 hours ago

ai-infraconsistency-modeldiffusion-modelsdistillationinference-acceleration+5

Tencent/AI-Infra-Guard

A full-stack AI Red Teaming platform securing AI ecosystems via AI Infra scan, MCP scan, Agent skills scan, and LLM jailbreak evaluation.

Python3.1k310Updated 1 hour ago

agentagent-scanagentskillsai-infraai-red-team+13

trustgraph-ai/trustgraph

The context backend for AI agents. Durable agent memory you can trust. Build, version, and retrieve grounded context from a context graph.

Python1.3k113Updated 1 day ago

agentagent-memoryaiai-infraai-tools+15

thu-ml/SpargeAttn

[ICML2025] SpargeAttention: A training-free sparse attention that accelerates any model inference.

Cuda95387Updated 20 hours ago

ai-infraattentioninference-accelerationllmmlsys+6

ForceInjection/AI-fundermentals

AI 基础知识 - GPU 架构、CUDA 编程、大模型基础及AI Agent 相关知识

HTML885143Updated just now

ai-agentai-infracuda

CurvineIO/curvine

High-performance distributed multi-tier cache system. Built in Rust.

Rust59573Updated 2 days ago

aiai-infracloud-nativefluidhigh-performance-computing+8

thu-ml/SLA

SLA: Beyond Sparsity in Diffusion Transformers via Fine-Tunable Sparse–Linear Attention

Python28418Updated 5 days ago

ai-infradiffusion-transformerinference-accelerationlinear-attentionmlsys+5

jinbooooom/ai-infra-hpc

hpc 教程，包含集合通信(mpi、nccl)、cuda 编程、向量化 SIMD、RDMA 通信等

Cuda28329Updated 2 hours ago

aiai-infradeep-learninghpcllm

jinbooooom/OriginDL

Implement a Pytorch-like DL library in C++ from scratch, step by step

C++17423Updated 21 hours ago

ai-frameworkai-infracudadeeplearningpytorch+1

raptor-ml/raptor

Transform your pythonic research to an artifact that engineers can deploy easily.

Go16114Updated 1 week ago

ai-infradata-engineeringdata-sciencedataopsfeature-engineering+13

tensorchord/ai-infra-landscape

This is a landscape of the infrastructure that powers the generative AI ecosystem

HTML15454Updated 6 days ago

ai-infraawesomegenaigenerative-aillmops

kleveross/klever

Cloud Native ML/DL Platform

13320Updated 2 months ago

ai-infracloud-nativedeep-learningkubeflowkubernetes+2

lumia431/photon_infer

A High-Performance LLM Inference Engine with vLLM-Style Continuous Batching

C++944Updated 2 weeks ago

ai-infracontinuous-batchinginference-enginellm-inferencemodern-cpp+2

toyaix/triton-runner

Multi-Level Triton Runner supporting Python, IR, PTX, and cubin.

Python843Updated 1 month ago

ai-infratoolstriton

toyaix/TritonLLM

LLM Inference via Triton (Flexible & Modular): Focused on Kernel Optimization using CUBIN binaries, Starting from gpt-oss Model

Python762Updated 20 hours ago

ai-infrainferllmtriton

agent-sandbox/agent-sandbox

Agent Sandbox is an E2B compatible, enterprise-grade ai-first, cloud-native runtime environment for AI Agents. Allows Agents to securely run untrusted LLM-generated Code, Browser use, Computer use, and Shell commands etc. with stateful, long-running, multi-session and multi-tenant.

Go717Updated 3 days ago

agentagent-sandboxai-infraai-sandboxbrowser-use+5

GradientHQ/lattica

💥 Make peer-2-peer global works

Rust476Updated 3 weeks ago

ai-infradecentralizednetworkingpeer-to-peerrust

Tencent/KsanaDiT

KsanaDiT: High-Performance DiT (Diffusion Transformer) Inference Framework for Video & Image Generation

Python465Updated 23 hours ago

ai-infraattentioncudadiffusioninference+6

awesomelistsio/awesome-ai-infrastructure

A curated list of awesome tools, frameworks, platforms, and resources for building scalable and efficient AI infrastructure, including distributed training, model serving, MLOps, and deployment.

Python4412Updated 5 hours ago

aiai-infraai-infrastructureawesomeawesome-list+1

hpdps-group/ElasticMM

ElasticMM: Elastic and Efficient MLLM Serving System

Python402Updated 4 days ago

ai-infrainferencellmmllmserving+1

biubiutomato/TME-Agent

TME: Structured memory engine for LLM agents to plan, rollback, and reason across multi-step tasks.

Python352Updated 1 month ago

ai-agentai-agents-frameworkai-infraai-infrastructurechatgpt+7

NexusGPU/vgpu.rs

vgpu.rs is the fractional GPU & vgpu-hypervisor implementation written in Rust

Rust3011Updated 3 days ago

aiai-infrafractional-gpugpu-utilizationnvidia+3

toyaix/triton-ocl

Triton for OpenCL backend, and use mlir-translate to get source OpenCL code

MLIR244Updated 1 month ago

ai-infraopencltriton

oliverlabs/alz-catalogue

This repository contains a list of various service-specific Azure Landing Zone implementation options.

130Updated 1 month ago

ai-infraai-infrastructurealzawesomeawesome-list+14

memas-ai/MeMaS

Memory Management Service, a Long Term Memory Solution for AI

Python80Updated 1 year ago

aiai-inframemorystorevectorsearch

yi-json/synapse

A distributed cluster orchestrator for AI/ML batch workloads. Orchestrates containers via a custom Rust runtime.

Go50Updated 2 months ago

ai-infragogrpcscheduler

Summoner-Network/summoner

The coordination protocol for autonomous AI agents across networks. Summoner lets you compose, run, and coordinate agents over a WAN with a Python SDK and Rust server.

Python50Updated 2 days ago

agentsaiai-agentsai-infraai-tools+3

yuelinxin/lisa

The Lisa programming language.

C++31Updated 1 year ago

ai-infracompilerinfrastructurelisallvm+3

Page 1 of 2