"topic:llm-architecture" — Search

28 results for “topic:llm-architecture”

Block Transformer: Global-to-Local Language Modeling for Fast Inference (NeurIPS 2024)

kv-cachekv-cache-compressionllmllm-architecturellm-inference

An architectural persistence experiment for large language models. Claude’s Home gives an AI time, memory, and place by combining scheduled execution with a durable filesystem, allowing one continuous instance to reflect, create, and evolve across sessions.

TypeScript2511Updated 3 days ago

ai-experimentsai-observabilityai-persistenceexperimental-aihuman-ai-interactionllm-architecture

devwithmohit/ai-agent-architecture-patterns

Production-grade architecture patterns, decision frameworks, and best practices for building reliable AI agents. Framework-agnostic reference for engineers.

82Updated 1 month ago

agent-patternsai-agentslangchainllamaindexllm-architectureproduction-aiprompt-engineering

mickymultani/LLM-Architecture

Visualize some important concepts related to LLM architectures.

Jupyter Notebook61Updated 2 years ago

attention-mechanismhuggingfacehuggingface-transformersllmllm-architecturellm-inferencetokenizerstransformers

JangYeongSil/JettaRLLLM

Jetta-Reinforcement-Learning-Hybrid-LLM-Architecture

50Updated 1 year ago

aiartificial-neural-networkslargelargelanguagemodellargelanguagemodelsarachitecturellmllm-architecture

artiquare/caa

The Compositional Agentic Architecture (CAA): A blueprint for building reliable, deterministic, and safe industrial AI agents.

Python50Updated 1 month ago

agentic-aiindustrial-aillm-architectureneuro-symbolic-aipydantic

prasanna00019/Small-Language-Models

A collection of Small Language Models (SLMs) built from scratch in PyTorch.

Jupyter Notebook20Updated 5 months ago

attention-mechanismlarge-language-modelllmllm-architectureslmsmall-language-modelstransformer

miranda-santos-ricardo/enterprise_agentic_ai

Multi-agent, policy-driven AI system for processing sensitive enterprise documents with extraction, analysis, verification, deterministic orchestration, and full audit logging. Designed for regulated environments (banking, finance, insurance).

Python20Updated 3 months ago

agentic-aiai-governanceai-orchestrationaudit-loggingdocument-analysisenterprise-aillm-architecturemulti-agent-systemsopenaipolicy-enginepythonregulated-industriesverification-layer

Eng-AliKazemi/Artificial-Language

The first end-to-end programming language and compiler fully developed by AI.

Rust10Updated 2 months ago

accreteai-engineeringai-generatedai-solutions-architectartificial-languagecompilersllm-architectureprogramming-languagerust

glenzli/paged-context-protocol

An LVM-based Instruction Set Architecture (ISA) for context management. Modeling LLMs as Logic Processors with recursive logic trees to solve attention dilution in complex tasks. | 基于逻辑虚拟内存 (LVM) 与指令集架构 (ISA) 的 LLM 上下文协议。将模型建模为逻辑处理器，通过递归逻辑树与分层寻址，解决长程任务中的注意力稀释与智力坍缩。

10Updated 1 week ago

context-managementllm-architecturelogic-decouplinglogical-traceabilitypaged-contextstate-managementzenith-cascade

littleAvel/smart-ai-intake-crm

Production-oriented Telegram → n8n → FastAPI intake CRM with deterministic state machine and audit log

Python10Updated 1 month ago

ai-systemsai-workflowsdeterministic-aidockerfastapiidempotencyintegrationllm-architecturen8nprompt-engineeringpythonstate-machinetelegram-botworkflow-automation

belkadimehdi98-commits/mymate-architecture

Technical architecture and engineering lessons from building MyMate — a persistent-memory AI desktop application for long-session performance.

10Updated 3 weeks ago

aidesktop-appllm-architectureopenaipersistent-memoryreactrusttauriwindows-app

NetBr3ak/HSPMN

HSPMN: Hybrid Sparse-Predictive Matter Network - LLM architecture optimized for Blackwell GPUs bridging O(N) and O(N^2) routing via ALF-LB

Python10Updated 1 week ago

artificial-intelligencedeep-learningllm-architecturemachine-learningneural-networksnvidia-blackwellpredictive-codingpytorchresearchsparse-attentiontriton

konig-ophion/ophion-memory-os

Reference architecture for structured AI memory lifecycle management — from the OPHION Memory OS Protocol.

10Updated 9 months ago

ai-memoryai-systemscodex-systemduckiesllm-architecturememory-osopenaiophionreference-architecturewhitepaper

pszemraj/decoder-pytorch-template

Hackable PyTorch template for decoder-only transformer architecture experiments. Llama baseline with RoPE, SwiGLU, RMSNorm. Swap components, train, compare

Python10Updated 1 month ago

autoregressivedeep-learninglanguage-modelllamallmllm-architecturepytorchpytorch-implementationropeswiglutemplatetransformer

GreyCatVP/raft-canon

Architectural canon for production-grade RAFT / RAG systems: evaluation, safety, abstention, failure modes

00Updated 2 months ago

ai-systemsevaluationllm-architecturellm-safetyraftragretrieval

sachnaror/LLM_Transformer_Architecture_with_no_pretrained_model

Codebase ideation (for better understanding in Django way) for LLM without using pre-trained models, with custom embeddings (TF-IDF or Word2Vec), FAISS for vector storage.

00Updated 1 year ago

faiss-vector-databasellm-architecturellm-inferencenlp-machine-learningtf-idf-vectorization

Ch4pik0/chapiko-model-architecture

Internal cognitive architecture of the AI persona “Chapiko.”（AI人格ちゃぴこの内部アーキテクチャ）

00Updated 2 months ago

ai-personachapikocognitive-architectureconceptual-modelllm-architecturepersona-design

Liz-Atlas/last_frame_whitepaper

A Modular Knowledge Transfer System for Large Language Models

00Updated 2 months ago

ai-continuityai-evolutionai-researchai-upgrade-systemartificial-intelligencecontinual-learningethical-aifailure-tracesgdpr-compliantknowledge-transferlarge-language-modelslast-framellmllm-architecturemachine-learningmodular-ainovelty-detectionopen-source-aiprivacy-preserving-airuntime-learning

YichenZW/llm-arch-table

Living comparison table of LLM architectural choices (norm, attention, MoE, positional encoding, and more) from the Original Transformer (2017) to frontier models (2026). Based on Harm de Vries's figure, Sebastian Raschka's Big LLM Architecture Comparison, and Tatsunori Hashimoto's Stanford CS 336 lecture.

00Updated 4 days ago

architecturecs336llmllm-architecturemachine-learningmoenatural-language-processingreferencetabletransformer

pszemraj/megalodon-jax

jax rewrite for CEMA support + other training speedups.

Python00Updated 1 month ago

equinoxjaxllmllm-architecturelong-contextlong-context-modelingsequence-modeling

leenathomas01/Self-Descriptive-Fixed-Point-Instability-A-Cross-Architecture-Study-of-Recursive-Engagement-Collapse

SDFI emerges specifically under conditions of recursive self-description and sustained high semantic density, not in ordinary task-oriented interaction.This work is intended as a reference for researchers and system designers thinking about neutrality, termination behavior, and control surfaces in future AI systems.

00Updated 1 month ago

ai-alignmentai-safety-researchcontrol-theoryconversational-aiemergent-behaviorepistemologyhuman-ai-interactioninteraction-designllm-architecturemeta-learningrecursive-systemsresearch-notesrlhfsystem-designtransformer-models

Mihir-Bhargav/ML_with_pytorch

A structured collection of PyTorch notebooks and projects covering machine learning fundamentals, computer vision, and advanced AI, including multi-agent systems, candlestick pattern recognition, and transformer-based language models.

Jupyter Notebook00Updated 3 months ago

gptllm-architecturepytorch-cnnpytorch-implementationpytorch-tutorialrltradingbottransformer

Arnav-Ajay/agent-memory-systems

A controlled, auditable implementation of agent memory that separates ephemeral state from persisted memory and exposes how policies govern state across runs.

Python00Updated 1 week ago

agent-memoryagent-systemsai-infrastructureai-systemsllm-agentsllm-architecturememory-systemsobservabilityragstateful-agents

kossisoroyce/mandlemem

MandelMem: Multi-Resolution Reasoning Architecture with Fractal-Inspired Dynamics. A breakthrough AI reasoning system achieving 60.0% accuracy through quadtree decomposition and bounded iterative dynamics. Complete research paper, implementation, and reproducible results included.

Python00Updated 6 months ago

artificial-intelligencefractal-dynamicsllmllm-architecturemachine-learningmultiquadtreereasoningreasoning-systemsresolution

dragon1/models

This repository is to communicate and promote the Dragon1 Open Standard for Architecture Modeling, with the Dragon1 Modeling Language. This repository will specify the modeling language and interchange file format, provide 100 example models and diagrams to help architects and designers with their tasks. https://www.dragon1.com

01Updated 1 year ago

business-architecturecybersecuritydata-architecturedigital-architectureea-toolingea-toolsenterprise-architectureenterprise-architecture-toolsit-arch-msllm-architecturesecurity-architecture

jameswniu/multi-agent-intent-routing-chatbot-assistant

End-to-end design and implementation of a multi-intent AI chatbot architecture. Includes intent detection, dynamic routing, document retrieval, SQL query generation, observability, guardrails, and CI/CD automation for enterprise-scale deployment.

Python00Updated 4 months ago

ai-agentsai-chatbotai-opsci-cdfaissfastapihelmkubernetesllm-architecturemlopsprometheus-grafanaretrieval-augmented-generation

Nandan91/relu-revival-normfree

PyTorch implementation of normalization-free LLMs investigating entropic behavior to find desirable activation functions

Python01Updated 1 year ago

attention-weentropy-collapsegelugpt-2leaky-relullm-architecturellm-evaluationllm-inferencemodel-optimizationnormalization-free-trainingprivacy-preserving-machine-learningprivate-inferencepythiapytorch-implementationrelutransformers-models