KI
Kimosabey/live-nexus-ai
Real-time WebRTC AI Agent with Generative UI.
LiveNexus AI
Real-Time Hybrid Audio Intelligence Platform
LiveNexus AI is a real-time speech intelligence system. It demonstrates how to achieve <200ms Latency Transcription on standard CPU hardware by combining WebRTC Transport (LiveKit) with optimized Edge Inference (Faster-Whisper + VAD).
๐ Quick Start
Get the system running in 2 steps:
# 1. Start Frontend (UI)
npm install && npm run dev
# 2. Start AI Worker (Deep Learning)
cd ai-worker && docker build -t worker . && docker run --env-file ../.env.local workerDetailed Setup: See GETTING_STARTED.md for API Keys.
๐ธ Demo & Architecture
System Architecture

WebRTC Client -> LiveKit Cloud -> Python Worker (VAD + Whisper)
Real-Time Pipeline

Audio In -> VAD Filter -> Inference -> DataChannel Out
Features Overview

Double-Buffer UI, CPU Optimization, and Binary Transport
Deep Dive: See ARCHITECTURE.md for the VAD Gating logic.
โจ Key Features
- โก <200ms Latency: Optimized quantized models run faster than cloud APIs.
- ๐ VAD Gating:
webrtcvaddrops 70% of silence packets, saving massive CPU. - ๐ง Resource Intelligence: Automatically downgrades model size if CPU > 80%.
- ๐ Zero-Stutter UI: "Double-Buffer" rendering strategy for smooth text updates.
๐ Documentation
| Document | Description |
|---|---|
| System Architecture | Hybrid Cloud/Edge design and VAD Pipeline. |
| Getting Started | Connect to LiveKit and run Docker worker. |
| Failure Scenarios | Handling High CPU and Network Jitter. |
| Interview Q&A | "Why not OpenAI API?" and "WebSockets vs DataChannels". |
๐ง Tech Stack
| Component | Technology | Role |
|---|---|---|
| Transport | LiveKit (WebRTC) | SFU & Signaling. |
| Inference | Faster-Whisper | Quantized Speech-to-Text. |
| Filter | WebRTCVAD | Voice Activity Detection. |
| Frontend | Next.js 14 | Real-time UI. |
๐ค Author
Harshan Aiyappa
Senior Full-Stack Hybrid Engineer
GitHub Profile
๐ License
This project is licensed under the MIT License - see the LICENSE file for details.
On this page
Languages
TypeScript57.7%Python32.8%CSS5.3%JavaScript2.3%PowerShell1.1%Dockerfile0.9%
Contributors
MIT License
Created January 13, 2026
Updated February 1, 2026
