274 results for “topic:tts-api”
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch support, handling, and auto-stitching
[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
Self-host the powerful Chatterbox TTS model. This server offers a user-friendly Web UI, flexible API endpoints (incl. OpenAI compatible), predefined voices, voice cloning, and large audiobook-scale text processing. Runs accelerated on NVIDIA (CUDA), AMD (ROCm), and CPU.
A simple VITS HTTP API, developed by extending Moegoe with additional features.
免费的在线文本转语音API
A simple FastAPI Server to run XTTSv2
A feature-rich portal to chat with GPT-4, Claude, Gemini, Mistral, & OpenAI Assistant APIs via a lightweight Node.js web app; supports customizable multimodality for voice, images, & files.
TTS-文本转语音/文本转语音前端,兼容OpenAI、EdgeTTS等接口
Self-host the powerful Dia TTS model. This server offers a user-friendly Web UI, flexible API endpoints (incl. OpenAI compatible), support for SafeTensors/BF16, voice cloning, dialogue generation, and GPU/CPU execution.
Self-host the ultra-lightweight Kitten TTS model with this enhanced API server with an intuitive Web UI, large text processing for audiobooks, and GPU acceleration.
NoneBot DeepSeek 插件。接入 DeepSeek 模型,提供智能对话与问答功能
openai-whisper-talk is a sample voice conversation application powered by OpenAI technologies such as Whisper, Completions, Embeddings, and the latest Text-to-Speech. The application is built using Nuxt, a Javascript framework based on Vue.js.
AivisSpeech Engine: AI Voice Imitation System - Text to Speech Engine
🗣️ ZAI/GLM TTS to OpenAI Speech API, 免费的语音合成API,支持克隆音色,基于智谱TTS
🌻 VITS ONNX TTS server designed for fast inference 🔥
Streaming TTS based on Piper with optional RK3588 NPU support
Open-Audio TTS: A robust web app leveraging OpenAI's powerful Text-to-Speech (TTS) models to generate natural-sounding audio from text. Built with modern web technologies for an intuitive user experience, including customizable voice and speech speed settings, and the ability to download audio files directly.
Simple Python script to interact with the TikTok TTS Voices.
A Non-Official ElevenLabs RESTful API Client for dotnet
CapCut TTS rapper API - CapCut API
An AI-powered chatbot integrated with Telegram, using OpenAI GPT-3.5 Turbo, language embeddings, and FAISS for similarity search to provide more contextually relevant responses to user queries
OpenAI API-compatible text-to-speech server using Microsoft VibeVoice-Realtime-0.5B. Docker or Python venv support, multiple voices with OpenAI aliases, CUDA-optimized.
any4any是一个企业级多模态AI平台,提供完整的智能交互解决方案。集成了大语言模型对话、数字人系统、智能SQL查询、语音处理、知识库系统等核心功能,支持OpenAI兼容API接口,可无缝集成到各类AI应用中。
not official API for Microsoft speech synthesis from Microsoft Edge web browser read aloud
Official AllVoiceLab Model Context Protocol (MCP) server, supporting interaction with powerful text-to-speech and video translation APIs.
Text To Speech Multilingual Support (+20 Language)
Twitch Streamer GPT is a NodeJS-based Twitch enhancement tool, offering interactive stream experiences with AI-powered automated responses, voice command activations, and advanced modules. It's easy to set up and suited for users of all tech levels.
Your speech assistant. Communicate with text-to-speech in games, on voice chat, on stream or simply on your speakers!
Some simple wrappers around eSpeak NG intended to make using this excellent TTS for waveform and IPA generation as convenient as possible.
A Chrome extension for high-quality Text-to-Speech APIs like Google's WaveNet / OpenAI TTS API. Contributions Welcome!