11 results for “topic:ai-audio-generation”
Soundstorm is a cutting-edge AI-powered audio manipulation application designed to provide a rich yet simplified experience for sound designers, algorithmic composers, and experimental audio enthusiasts. From sample pack creation and algorithmic composition to AI text-to-audio and onscreen ChatGPT, Soundstorm is a sonic powerhouse.
An audiobook sound effect generator that transforms SRT files into immersive audio experiences. It parses SRT files, uses ChatGPT to create sound effect prompts, generates sounds via the ElevenLabs API, and syncs the audio on an MP3 timeline.
AI Audio Framework 🎵
Production-ready voice agents and speech pipelines: STT → LLM/Agent → TTS, voice receptionists, telephony, call recording, tool/function calling. Built with Twilio, OpenAI Whisper, ElevenLabs, Vapi/Retell, FastAPI, WebSockets, ffmpeg; designed for deployment, monitoring, and real-world reliability
VoxForge Pro is a premium, offline audiobook generator powered by Kokoro-82M & Chatterbox TTS. Transform PDFs and text into professional audio using 47 lifelike voices across 6 languages. Features include voice cloning, smart OCR for scanned documents, and multi-speaker narration support.
AI-based music mood generation and remix system using MusicGen
Bedtime stories and soothing nursery rhymes, featuring AI-powered narration and a beautiful dark theme optimized for nighttime use.
Streamlining Text-to-Speech Tasks Using Google Colab
This project demonstrates real-time audio processing using Python. It captures audio from a microphone, converts the speech to text, and then synthesizes the text back to speech using a different voice. This can be useful for applications such as voice changers, real-time translation, and more.
SoundScroll is an AI audiobook generator
cli TTS script