28 results for “topic:llm-architecture”
Block Transformer: Global-to-Local Language Modeling for Fast Inference (NeurIPS 2024)
An architectural persistence experiment for large language models. Claude’s Home gives an AI time, memory, and place by combining scheduled execution with a durable filesystem, allowing one continuous instance to reflect, create, and evolve across sessions.
Production-grade architecture patterns, decision frameworks, and best practices for building reliable AI agents. Framework-agnostic reference for engineers.
Visualize some important concepts related to LLM architectures.
Jetta-Reinforcement-Learning-Hybrid-LLM-Architecture
The Compositional Agentic Architecture (CAA): A blueprint for building reliable, deterministic, and safe industrial AI agents.
A collection of Small Language Models (SLMs) built from scratch in PyTorch.
Multi-agent, policy-driven AI system for processing sensitive enterprise documents with extraction, analysis, verification, deterministic orchestration, and full audit logging. Designed for regulated environments (banking, finance, insurance).
The first end-to-end programming language and compiler fully developed by AI.
An LVM-based Instruction Set Architecture (ISA) for context management. Modeling LLMs as Logic Processors with recursive logic trees to solve attention dilution in complex tasks. | 基于逻辑虚拟内存 (LVM) 与指令集架构 (ISA) 的 LLM 上下文协议。将模型建模为逻辑处理器,通过递归逻辑树与分层寻址,解决长程任务中的注意力稀释与智力坍缩。
Production-oriented Telegram → n8n → FastAPI intake CRM with deterministic state machine and audit log
Technical architecture and engineering lessons from building MyMate — a persistent-memory AI desktop application for long-session performance.
HSPMN: Hybrid Sparse-Predictive Matter Network - LLM architecture optimized for Blackwell GPUs bridging O(N) and O(N^2) routing via ALF-LB
Reference architecture for structured AI memory lifecycle management — from the OPHION Memory OS Protocol.
Hackable PyTorch template for decoder-only transformer architecture experiments. Llama baseline with RoPE, SwiGLU, RMSNorm. Swap components, train, compare
Architectural canon for production-grade RAFT / RAG systems: evaluation, safety, abstention, failure modes
Codebase ideation (for better understanding in Django way) for LLM without using pre-trained models, with custom embeddings (TF-IDF or Word2Vec), FAISS for vector storage.
Internal cognitive architecture of the AI persona “Chapiko.”(AI人格ちゃぴこの内部アーキテクチャ)
A Modular Knowledge Transfer System for Large Language Models
Living comparison table of LLM architectural choices (norm, attention, MoE, positional encoding, and more) from the Original Transformer (2017) to frontier models (2026). Based on Harm de Vries's figure, Sebastian Raschka's Big LLM Architecture Comparison, and Tatsunori Hashimoto's Stanford CS 336 lecture.
jax rewrite for CEMA support + other training speedups.
SDFI emerges specifically under conditions of recursive self-description and sustained high semantic density, not in ordinary task-oriented interaction.This work is intended as a reference for researchers and system designers thinking about neutrality, termination behavior, and control surfaces in future AI systems.
A structured collection of PyTorch notebooks and projects covering machine learning fundamentals, computer vision, and advanced AI, including multi-agent systems, candlestick pattern recognition, and transformer-based language models.
A controlled, auditable implementation of agent memory that separates ephemeral state from persisted memory and exposes how policies govern state across runs.
MandelMem: Multi-Resolution Reasoning Architecture with Fractal-Inspired Dynamics. A breakthrough AI reasoning system achieving 60.0% accuracy through quadtree decomposition and bounded iterative dynamics. Complete research paper, implementation, and reproducible results included.
This repository is to communicate and promote the Dragon1 Open Standard for Architecture Modeling, with the Dragon1 Modeling Language. This repository will specify the modeling language and interchange file format, provide 100 example models and diagrams to help architects and designers with their tasks. https://www.dragon1.com
End-to-end design and implementation of a multi-intent AI chatbot architecture. Includes intent detection, dynamic routing, document retrieval, SQL query generation, observability, guardrails, and CI/CD automation for enterprise-scale deployment.
PyTorch implementation of normalization-free LLMs investigating entropic behavior to find desirable activation functions