"topic:ai-infrastructure" — Search

309 results for “topic:ai-infrastructure”

Open-source context retrieval layer for AI agents

agent-infrastructureaiai-agentsai-infrastructureapicontext-retrievaldata-connectorsdeveloper-toolsenterprise-datainformation-retrievalintegrationllmopen-sourceragretrievalretrieval-augmented-generationsdksearchsearch-apisemantic-search

CaviraOSS/OpenMemory

Local persistent memory store for LLM applications including claude desktop, github copilot, codex, antigravity, etc.

TypeScript3.6k411Updated 1 hour ago

aiai-agentsai-infrastructureai-memoryartificial-intelligencecognitive-architectureembeddingsgeminillmlong-term-memorymemorymemory-enginememory-retrievalollamaone-lineopenaiopenmemoryragsupermemoryvector-database

aeromomo/claw-compactor

🦞 LLM Token Compression & Reduction Tool — Cut AI agent token costs by up to 97%. 6-layer deterministic context compression for AI agent workspaces. No LLM required. Prompt compression, context window optimization & cost reduction for any LLM pipeline.

Python1.4k124Updated just now

ai-agent-toolsai-cost-savingai-infrastructureclaw-compactorcontext-compressioncontext-pruningcontext-window-optimizationdeveloper-toolsllm-compressionllm-context-compressionllm-cost-reductionllm-token-compressionllm-toolsopenclawprompt-compressionpython-toolstoken-compressiontoken-optimizationtoken-reductiontoken-saving

Hawksight-AI/semantica

Semantica 🧠 — A framework for building semantic layers, context graphs, and decision intelligence systems with explainability and provenance.

Python765117Updated just now

agent-memoryagentic-aiai-agentsai-infrastructurecontext-graphcontext-managementdata-infrastructuredeveloper-toolsgraph-analyticsgraph-modelinggraphragknowledge-engineeringknowledge-graphsllmopsontology-engineeringpython-libraryragschema-designsemantic-layersemantic-web

divagr18/memlayer

Plug-and-play memory for LLMs in 3 lines of code. Add persistent, intelligent, human-like memory and recall to any model in minutes.

Python26131Updated 1 week ago

agentaiai-infrastructurecontext-managementdeveloper-toolsembeddedgraph-databaseknowledge-graphllmllm-memorymemoryopenaipersistent-memorypythonragretrievalretrieval-augmented-generationsemantic-searchtransformervector-database

TonyStef/Grov

Grov automatically captures the context from your private AI sessions and syncs it to a shared team memory. It auto injects relevant memories across developers and future sessions to save tokens and time spent on tasks.

TypeScript17212Updated 2 days ago

aiai-coding-toolsai-infrastructureai-memoryanthropicclaudeclaude-codeclideveloper-toolsllmmemorysoftware-engineeringtypescript

greynewell/infermux

Route inference across LLM providers. Track cost per request.

Go897Updated 2 weeks ago

ai-gatewayai-infrastructureanthropicapi-gatewaycost-trackinggolanginferenceinference-routingllmllm-proxyllm-routerload-balancingmist-stackmodel-routingmodel-servingmulti-modelobservabilityopenaiprovider-abstractiontoken-tracking

Corpus-OS/corpusos

Stop rewriting integrations. Open-source protocol suite that standardizes LLM, Vector, Graph, and Embedding interfaces across LangChain, LlamaIndex, AutoGen, CrewAI, Semantic Kernel, MCP — and any provider.

Python820Updated just now

agentsai-agentsai-infrastructureanthropicautogenchatgptcrewaienterprisegeminiknowledge-graphlangchainllamaindexllmmcpmultiagentopenaipythonragsemantickernelvector-database

redbco/redb-open

Distributed data mesh for real-time access, migration, and replication across diverse databases — built for AI, security, and scale.

Go754Updated 2 months ago

ai-infrastructurecompliancedata-meshdata-migrationdata-obfuscationdata-portabilitydata-replicationdistributed-systemsgolangopen-sourcepolicy-drivenschema-versioning

Shadylukin/Neumann

A Rust runtime that unifies relational tables, graph relationships, and vector embeddings in a single tensor-based storage layer with distributed consensus and semantic search

Rust736Updated 6 days ago

aiai-infrastructuredatabaseinfrastructureunified-database

zetic-ai/ZETIC_Melange_apps

NPU powered On-device AI Mobile applications using Melange

Swift506Updated 5 hours ago

aiai-computingai-infrastructureaiinfrastructureedge-computingedgeainpuon-device-aivibe-coding

ai-infra-curriculum/ai-infra-engineer-learning

AI Infrastructure Engineer Learning Track - Production ML infrastructure curriculum (2-4 years experience)

Python4610Updated 21 hours ago

ai-infrastructurecurriculumgpuintermediatekuberneteslearningllmmachine-learningmlopsproductionterraform

localcloud-sh/localcloud

Stop paying for AI APIs during development. LocalCloud runs everything locally - GPT-level models, databases, all free.

Go462Updated 2 months ago

aiai-appai-developmentai-development-toolsai-infrastructurechatbotdeveloper-toolsdevtoolsdockerembeddingsllmlocal-developmentlocal-firstlocal-first-softwarelocal-llmragself-hostedspeech-to-textvector-database

awesomelistsio/awesome-ai-infrastructure

A curated list of awesome tools, frameworks, platforms, and resources for building scalable and efficient AI infrastructure, including distributed training, model serving, MLOps, and deployment.

Python4511Updated 1 day ago

aiai-infraai-infrastructureawesomeawesome-listawesome-lists

Papr-ai/memory-opensource

Predictive memory layer for AI agents. MongoDB + Qdrant + Neo4j with multi-tier caching, custom schema support & GraphQL. 91% Stanford STARK accuracy, <100ms on-device retrieval

Python413Updated 5 days ago

ai-agentsai-infrastructurecontext-engineeringknowledge-graphllm-memorylong-term-memoryopen-source-aipaprpredictive-memoryragrag-chatbotrecommendation-enginestanford-starkvector-databasevoice-assistant

cxlinux-ai/cx-core

CX Linux — AI-powered Linux OS. Natural language system administration for Ubuntu & Debian. The AI layer for Linux infrastructure.

Rust4160Updated 3 days ago

agentic-linuxai-devopsai-infrastructureai-linuxai-native-linuxai-operating-systemai-osai-sysadmincortex-linuxdebian-ailinux-ailinux-ai-agentllm-linuxnatural-language-linuxopen-sourcepackage-managerubuntu-ai

biubiutomato/TME-Agent

TME: Structured memory engine for LLM agents to plan, rollback, and reason across multi-step tasks.

Python352Updated 1 month ago

ai-agentai-agents-frameworkai-infraai-infrastructurechatgptllmmemorymulti-agentopenaireasoningtask-planningtme

epappas/llmtrace

Zero-code LLM security & observability proxy. Real-time prompt injection detection, PII scanning, and cost control for OpenAI-compatible APIs. Built in Rust.

Rust341Updated 1 hour ago

agenticai-agentsai-infrastructureai-securityaiopschatgptllm-inferencellm-monitoringllm-securityllm-security-compliance-prompt-injectionllmopsmlopsobservabilityopenaipii-detectionprompt-injectionproxyrustsecurity

redbco/infermesh

GPU-aware inference mesh for large-scale AI serving

Rust301Updated 1 week ago

ai-inferenceai-infrastructuredistributed-systemsfault-tolerancegpu-inferencegpu-meshhigh-availabilityinference-engineml-infrastructuremodel-servingobservabilityrustservice-mesh

MasihMoafi/A-Modular-Kingdom

Production-ready AI infrastructure: RAG with smart reindexing, persistent memory, browser automation, and MCP integration. Stop rebuilding tools for every AI project.

Python282Updated 3 weeks ago

ai-agentsai-infrastructurebrowser-automationllmmcpmemory-systemollamaplaywrightpythonrag

defilantech/LLMKube

Kubernetes operator for GPU-accelerated LLM inference - air-gapped, edge-native, production-ready

Go274Updated 4 hours ago

aiai-infrastructureapple-siliconedge-aiedge-computingggufgpuhomelabinferencekuberneteskubernetes-operatorllama-cppllmlocal-llmmachine-learningmetalmlopsnvidiaoperatorself-hosted

pavangudiwada/awesome-ai-sre

AI SRE tools for RCA, Incident Response, Cost-Saving, Infra management, DevOps and more

JavaScript246Updated 5 hours ago

ai-agentsai-cost-optimizationai-devopsai-incident-responseai-infrastructureai-platformai-platform-iacai-rcaai-srecloud-native-ai-platformdevopsincident-manageincident-responsercasre

petterjuan/agentic-reliability-framework

ARF is an agentic reliability intelligence platform that separates decision intelligence (OSS) from governed execution (Enterprise), enabling autonomous operations with deterministic safety guarantees.

Python195Updated 2 weeks ago

ai-agentsai-infrastructureai-opsanomaly-detectionautonomous-systemsdevopsgraph-memoryincident-managementmlops-workflowobservabilityobservability-platformproduction-monitoringpython-libraryreliability-engineeringself-healingself-healing-infrastructuresre

matrix97317/UniRobot

UniRobot is an embodied intelligent software framework that integrates the robot brain (data, models, model training) with the robot body (perception, model inference, control).

Python181Updated 1 week ago

ai-infrastructuredistributed-systemsembodied-airobotrobot-frameworkrobot-learningrosros2ros2-jazzy

ferro-labs/ai-gateway

Open-source AI Gateway written in Go, one API for OpenAI, Anthropic, Bedrock, Azure, and 100+ LLMs. Built-in caching, guardrails, retries, and cost optimization. Run as a proxy or embed as a library.

Go185Updated 2 hours ago

ai-gatewayai-infrastructureanthropicazurebedrockdockergatewayllmllm-gatewayllm-proxyllmopsmcp

ai-infra-curriculum/ai-infra-junior-engineer-learning

AI Infrastructure Junior Engineer Learning Track - Comprehensive curriculum for entry-level ML infrastructure engineers (0-2 years experience)

Python186Updated 2 days ago

ai-infrastructureawsbeginnercurriculumdockereducationfundamentalsgitjunior-engineerkuberneteslearningmachine-learningpython

anaslimem/llm-gateway-core

A production-grade LLM gateway that abstracts multiple model providers, implements intelligent routing, caching, retries, and observability to deliver reliable, cost-aware LLM access.

Python171Updated 1 day ago

ai-infrastructurefastapigoogle-geminigrafanaollamaprometheusrate-limitingredis-cache

lightfastai/lightfast

Lightfast surfaces every decision your team makes across your tools — searchable, cited, and ready for people and agents.

TypeScript142Updated 1 day ago

agent-frameworkagentsaiai-infrastructureai-memoryautomationcloud-nativedeveloper-toolsinfrastructurememorynextjsorchestrationserverlesstypescriptvercelworkflow-engine

BoardKit/orchestrator

Orchestrator for shared AI agents and documentation synchronization across multi-repo organizations.

Shell143Updated 1 day ago

ai-agentsai-infrastructureclaude-codedevelopers-toolsdocumentation-syncorchestrator

oliverlabs/alz-catalogue

This repository contains a list of various service-specific Azure Landing Zone implementation options.

130Updated 2 months ago

ai-infraai-infrastructurealzawesomeawesome-listawesome-listsazureazurelandingzonecurated-collectionscurated-listcurated-listsenterprise-scalelanding-zonelandingzonemicrosoftmicrosoft-azureoaiopenaireference-implementation

Page 1 of 11