"topic:llm-proxy" — Search

75 results for “topic:llm-proxy”

Plano is an AI-native proxy and data plane for agentic apps — with built-in orchestration, safety, observability, and smart LLM routing so you stay focused on your agents core logic.

Rust6.0k360Updated 3 hours ago

ai-gatewayai-gateway-supportenvoyenvoyproxygatewaygenerative-aillm-gatewayllm-inferencellm-proxyllm-routingllmopsllmsopenaipromptproxyproxy-serverrouting

theopenco/llmgateway

Route, manage, and analyze your LLM requests across multiple providers with a unified API interface.

TypeScript1.0k108Updated just now

aiai-gatewayanalyticsanthropicapi-key-managementclaudecodexenterpriseguardrailsinferencellmllm-gatewayllm-proxyllmsobservabilityopenaiopencoderate-limitingtypescript

doramirdor/NadirClaw

Open-source LLM router & AI cost optimizer. Routes simple prompts to cheap/local models, complex ones to premium — automatically. Drop-in OpenAI-compatible proxy for Claude Code, Codex, Cursor, OpenClaw. Saves 40-70% on AI API costs. Self-hosted, no middleman.

Python35545Updated 2 days ago

aiai-cost-reductionai-routerclaude-codecodexcost-optimizationgeminillmllm-gatewayllm-proxyllm-routermodel-routingollamaopenaiopenai-proxyopenclawprompt-routingproxypythonself-hosted

nghyane/llm-mux

AI Gateway: Claude Pro, Copilot, Gemini subscriptions → OpenAI/Anthropic/Gemini APIs. No API keys needed.

Go33292Updated 1 month ago

ai-coding-assistantantigravityapi-multiplexerclaudeclaude-procopilotgeminigithub-copilotllm-gatewayllm-proxymulti-provideropenai-compatibleprotocol-translatorsubscription-to-api

starbaser/ccproxy

Build mods for Claude Code: Hook any request, modify any response, /model "with-your-custom-model", intelligent model routing using your logic or ours

Python18921Updated 11 hours ago

aiai-gatewayai-proxyai-toolsanthropicclaudeclaude-aiclaude-apiclaude-codeclaude-maxclaudecodegeminigemini-clilitellmllmllm-gatewayllm-proxyllmopsopenaiopenrouter

thushan/olla

High-performance lightweight proxy and load balancer for LLM infrastructure. Intelligent routing, automatic failover and unified model discovery across local and remote inference backends.

Go17420Updated 1 week ago

aiamdgolangintelllama-cppllamacppllm-inferencellm-proxyllm-routerllm-routinglmstudiolocal-aimlxnvidiaollamaproxyself-hostedself-hosted-aisglangvllm

peva3/SmarterRouter

SmarterRouter: An intelligent LLM gateway and VRAM-aware router for Ollama, llama.cpp, and OpenAI. Features semantic caching, model profiling, and automatic failover for local AI labs.

Python985Updated 6 days ago

ai-cacheai-gatewaydockerfastapigpu-monitoringllmllm-proxyllm-routerlocal-llmmodel-servingollamaollama-apiopenai-proxyself-hostedself-hosted-aisemantic-cache

Nayjest/lm-proxy

OpenAI-compatible HTTP LLM proxy / gateway for multi-provider inference (Google, Anthropic, OpenAI, PyTorch). Lightweight, extensible Python/FastAPI—use as library or standalone service.

Python9512Updated 1 month ago

aianthropicapi-proxyfastapigoogle-ailanguage-modelsllmllm-apillm-gatewayllm-inferencellm-proxyopenaiopenai-apiproxyproxy-serverpyton

greynewell/infermux

Route inference across LLM providers. Track cost per request.

Go897Updated 1 month ago

ai-gatewayai-infrastructureanthropicapi-gatewaycost-trackinggolanginferenceinference-routingllmllm-proxyllm-routerload-balancingmist-stackmodel-routingmodel-servingmulti-modelobservabilityopenaiprovider-abstractiontoken-tracking

LeenHawk/gproxy

gproxy is a Rust-based multi-channel LLM proxy that exposes OpenAI / Claude / Gemini-style APIs through a unified gateway, with a built-in admin console, user/key management, and request/usage auditing.

Rust6810Updated 21 hours ago

claudegeminigptllm-proxy

omarluq/cc-relay

⚡️ Blazing fast LLMs API Gateway written in Go

Go648Updated 3 hours ago

anthropicbedrockclaudeclaude-aiclaude-apiclaude-codegeminigemini-apiglm-4-7kimi-k2llm-apillm-gatllm-gateway-systemllm-proxymistral-aiollamaopenaiopenai-apivertex-aizai

PromptSail/prompt_sail

Open Source LLM proxy that transparently captures and logs all interactions with LLM API

HTML635Updated 9 months ago

llmllm-promptingllm-proxyprompt-engineeringprompt-managementprompt-managerprompt-tool

romgX/openrelay

几百个免费 AI 模型配额，一键接入本地项目。| Hundreds of free AI model quotas, one-click access to local projects.

TypeScript618Updated 23 hours ago

aiai-proxyaidercerebrasclaudeclaude-codecopilotcursordeveloper-toolsfree-aifree-apigroqkirollm-proxymodel-routeropenaiopenclawproxywindsurf

ferro-labs/ai-gateway

Open-source AI Gateway written in Go, one API for OpenAI, Anthropic, Bedrock, Azure, and 100+ LLMs. Built-in caching, guardrails, retries, and cost optimization. Run as a proxy or embed as a library.

Go317Updated 1 hour ago

ai-gatewayai-infrastructureanthropicazurebedrockdeepinfradeepseekdockergatewayllmllm-gatewayllm-proxyllmopsmcpmoonshot-aiqwensemantic-cache

fabiojbg/LLMApiGateway

A personal LLM gateway with fault-tolerant capabilities for calls to LLM models from any provider with OpenAI-compatible APIs. Advanced features like retry, model sequencing, and body parameter injection are also available. Especially useful to work with AI coders like Cline and RooCode and providers like OpenRouter.

Python233Updated 1 month ago

cline-aifault-tolerancegatewayllmllm-gatewayllm-proxyopenai-apiopenrouterproxyroocode

mulkymalikuldhrs/jsputer-proxy

A unified AI proxy server for free access to multiple LLM providers through Puter.js SDK - No expensive API keys needed!

JavaScript217Updated 1 month ago

ai-apiai-proxyanthropic-compatiblechatbotchatgptclaudedeepseekdeveloper-toolsexpressfree-aifree-llmgrokjavascriptllmllm-proxylocal-aimistralnodejsopenai-compatibleputer-js

Alorse/llm-proxy

Allows any BYOK AI editor or extension, such as Cursor or Continue, to connect to any openai-compatible LLM by aliasing it as a different model

TypeScript181Updated 8 months ago

aicursor-extensionllmllm-proxyvscode-extension

zxcloli666/AI-Worker-Proxy

OpenAI-compatible AI proxy: Anthropic Claude, Google Gemini, GPT-5, Cloudflare AI. Free hosting, automatic failover, token rotation. Deploy in 1 minute.

TypeScript1829Updated 1 week ago

ai-failoverai-gatewayai-load-balancerai-proxyanthropic-claudeapi-aggregatorapi-gatewaychatgpt-apicloudflare-workersedge-computingfree-proxygoogle-geminigpt-5llm-proxymulti-provideropenai-apiopenai-compatibleproxy-serverserverlesstoken-rotation

wa91h/local-ai-toolkit

A self-hosted AI toolkit running locally via Docker Compose, bundling an LLM gateway, workflow automation, and a chat UI — all backed by a shared PostgreSQL database.

Shell162Updated 3 weeks ago

aiai-agentdockerdocker-composelitellmllmllm-gatewayllm-proxylocal-llmn8nollamaopenwebuiself-hostedworkflow

tokligence/tokligence-gateway

Go LLM gateway — one interface for Claude Code, Codex, Gemini CLI, Anthropic, OpenAI, Qwen, and vLLM.

Go155Updated 4 weeks ago

ai-gatewayllm-proxymodel-routeropenai-compatible-proxy-servertoken-tracking

matdev83/llm-interactive-proxy

Connect any LLM-powered client app, such as a coding agent, to any supported inference backend/model.

Python151Updated 1 week ago

anthropic-apiclaude-aiclaude-apiclaude-codeclaude-proxydeepseekgemini-apigemini-cligoogle-vertex-apigrok-apilitellmlitellm-ai-gatewayllmllm-agentic-aillm-proxyopenai-apiopenrouterproxyqwen-coderqwen3

vibheksoni/UniClaudeProxy

Use any LLM with Claude Code — proxy that translates Anthropic API to OpenAI, Gemini, DeepSeek, Ollama, and more. Full tool calling, streaming, ReAct XML fallback, hot-reload config.

Python103Updated 1 month ago

ai-toolsanthropicapi-proxyclaudeclaude-codedeepseekfastapigeminillm-proxyollamaopenaireact-xmlssestreamingtool-calling

lynxai-team/goinfer

Local LLM proxy, DevOps friendly

Go92Updated 1 month ago

inferenceinference-apiinference-serverlanguage-model-apillama-apillama-cppllama-serverllamacppllmllm-proxyllm-routerlocal-ailocal-llmlocal-llm-integrationlocal-lmlocalllmopenai-apiopenaiapi

kiku-jw/reliapi

Small reliability layer for HTTP APIs and LLM calls. Idempotent HTTP/LLM proxy with retries, cache, circuit breaker and predictable AI costs.

Python80Updated 1 week ago

api-gatewayapi-reliabilitybudget-controlcachingcircuit-breakerfastapihttp-proxyidempotencyllm-gatewayllm-proxyprometheuspythonredisreliabilityretryself-hosted

quilrai/LLMWatcher

llm gateway that runs on your desktop

Rust70Updated 1 month ago

agent-gatewayagent-proxyclaude-code-monitoringcodex-monitorllm-gatewayllm-proxyllm-visibilityrate-limitingtoken-counter

Inebrio/Routerly

Self-hosted LLM gateway that routes requests across AI providers (OpenAI, Anthropic, Gemini, Mistral, Ollama) using intelligent multi-policy scoring — including an LLM-native routing policy. Drop-in compatible: just swap the base URL. No database required, built-in cost tracking, budget enforcement and multi-tenant isolation.

TypeScript70Updated 1 day ago

ai-gatewayai-routeranthropicbudget-enforcementcost-trackingllm-gatewayllm-proxyllm-routingmulti-tenantopenai-proxyself-hosted

zamorofthat/elida

Session-aware reverse proxy for AI agents: OWASP LLM Top 10 security policies for SOC and SRE teams

Go71Updated 1 day ago

ai-border-controllerai-gatewayai-governanceai-securityai-session-layerllm-proxyowaspreverse-proxysession-management

xiaoliuzhuan/model-relay-desktop

Local desktop relay for Trae and Cursor, routing OpenAI and Anthropic models through switchable proxy config groups.

Python60Updated 2 weeks ago

anthropicclaudecursordesktop-appllm-proxymodel-relaynuxtopenaiproxytauritrae

NodeNestor/CodeGate

Drop-in AI gateway. Route any coding agent through any LLM provider. Auto-failover, Anthropic<->OpenAI format conversion, 15 PII guardrails, multi-tenancy, multiple subscription support.

TypeScript60Updated 1 week ago

ai-proxyanthropicclaudecoding-agentfailoverformat-conversionllm-proxymulti-provideropenaipii-detectionprivacyrate-limiting

promptshieldhq/promptshield-proxy

A free, open-source LLM security proxy. Drop it between your app and any LLM provider to get rate limiting, audit logging, token tracking, and Prometheus metrics with no code changes to your app.

Go60Updated 18 hours ago

ai-agentllmllm-agentllm-proxy

Page 1 of 3