71 results for “topic:realtime-audio”
Realtime audio analysis in Python, using PyAudio and Numpy to extract and visualize FFT features from streaming audio.
Rapida is an open-source, end-to-end voice AI orchestration platform for building real-time conversational voice agents with audio streaming, STT, TTS, VAD, multi-channel integration, agent state management, and observability.
Join the community on Discord for more discussions around Neutone! https://discord.gg/VHSMzb8Wqp
:musical_note: strongly-timed musical programming language
Guide on how to set-up Linux and Docker for real-time applications using the Ubuntu realtime-kernel/PREEMPT_RT patch with a focus on robotics with ROS and ROS 2
Example applications that use the OpenTok iOS SDK
FreeSWITCH module to stream audio to websocket and receive response
Rust Agent Development Kit (ADK-Rust): Build AI agents in Rust with modular components for models, tools, memory, realtime voice, and more. ADK-Rust is a flexible framework for developing AI agents with simplicity and power. Model-agnostic, deployment-agnostic, optimized for frontier AI models. Includes support for real-time voice agents.
ppooll (formerly lloopp) is an audio & video performance environment written in max/MSP.
🔥 Modular shader engine designed for simplicity and speed
Realtime Safe OSC packet serialization and dispatch
A realtime scripted modular audio engine for video games and musical applications.
An opensource harmonizer implementation leveraging the DISTRHO Plugin Framework.
STatic (LLVM) Object file Analysis Tool
Xiaozhi websocket protocol implemented by Golang, setup your own xiaozhi-server by routing requests to OpenAI Realtime API protocol such as Stepfun API
PersonaPlex on Apple Silicon: an MLX port of NVIDIA’s full-duplex speech-to-speech model with realtime local/web modes and offline WAV inference.
Syntax sugar of OpenTok iOS SDK with Audio/Video communication including screen sharing
Coffee Chat Voice Assistant is a voice-driven ordering system powered by Azure OpenAI GPT-4o Realtime API, simulating the experience of ordering coffee with a café barista. It supports natural conversations, live order updates, and real-time transcription, showcasing the power of AI for seamless customer interactions.
🦀 Rust powered LLM, Whisper, Embedding inference, backed by 🤗 candle from HuggingFace
TEN VAD low-latency voice activity detection for real-time streaming, integrated with livekit-agents
C# bindings for Jackd
Google Gemini live voice to text realtime stream in the browser
A python library to control a csound process
Next-gen (SO)und (PRO)cessing: golang native, upscaler, downscaler, transcoder, neural networks, full mem & on-the-fly & streaming.
eShopLite - Semantic Search is a reference .NET application implementing an eCommerce site with Search features using Keyword Search and Semantic Search with a chat that uses gpt-40-realtime audio
Video Chat room support tens of thousands of people simultaneously online video chat, online karaoke dance, video dating.
Jacob Collier-like harmonizer, because I'm jealous and I want a choir for myself too
A comprehensive sample app built by OpenTok Accelerator Packs
Pluggable real-time audio conversation framework for .NET. Local VAD, STT, TTS, and LLM
A high-level real-time audio library for playback, generation and recording, focusing on ease of use and performance. Based on miniaudio.