27 results for “topic:openai-tts”
React / Vanilla JS Text to Speech with highlighting the words and sentences that are being spoken using audio files, text to speech API, and web speech synthesis API
openai-whisper-talk is a sample voice conversation application powered by OpenAI technologies such as Whisper, Completions, Embeddings, and the latest Text-to-Speech. The application is built using Nuxt, a Javascript framework based on Vue.js.
Implementation of OpenAI's Text-To-Speech in Unity. Synthesize any text and play it via any AudioSource.
Use OpenAI TTS(Text to Speech) API with Gradio
ESP32-based voice device for chatting with multiple custom AI bots. Recording questions with I2S microphone, transcribing via ElevenLabs or Deepgram STT, creating response with Groq or Open AI LLM. TTS audio output with custom AI voices via I2S & speaker. Supporting ongoing dialogues, calling bots ‘by name’, real-time web search via keyword.
MCP Server for Text to Speech
Code snippets showing how to record I2S audio and store as .wav file on ESP32 with SD card, how to transcribe pre-recorded audio via Deepgram SpeechToText (STT) API, how to generate audio from text via TextToSpeech (TTS) API from OpenAI a/o SpeechGen a/o Google TTS. Triggering ESP32 actions via Voice.
Implementation of OpenAI's Realtime API in Unity. Easily integrate low-latency, multi-modal conversations via push-to-talk or text.
Try out the OpenAI Text to Speech API in your browser.
This repository features a Gradio interface designed to leverage the OpenAI Text-To-Speech (TTS) API. The interface lets users create speech from provided text using different models and voice options.
Stream GPT response to TTS directly using Flask
OpenAI API powered Twitch chat bot that allows users to ask questions and receive answers via TTS.
The ai-tts.rs project provides a command-line tool for generating spoken audio from text files using OpenAI's text-to-speech (TTS) API.
Use LLMs to generate short ad-hoc English lessons based on any fiction work (won the Clarifai's NextGen GPT AI Hackathon).
Unofficial web client for OpenAI Text-to-Speech API with multilingual interface and voice options
This Python script processes a video file, generates a compelling description, creates a voiceover script in the style of David Attenborough, and synthesizes the voiceover using OpenAI's Text-to-Speech API.
An AI Voice Assistant that can read emails, WhatsApp messages, clipboard data, and captures webcam images and screenshots for contextual understanding.
This project allows you to create short videos based on a given topic. Version 2 (improve transform image and real clip) can be found at: Colab notebook. The reason why version 2 is not in this GitHub is that my laptop cannot afford the Nvidia-smi utility, which is required to install xformers for stable diffusion.
A powerful and easy-to-use Python library for generating natural-sounding speech using OpenAI's text-to-speech capabilities.
Convert text into natural, human-like speech using OpenAI's cutting-edge Text-to-Speech API.
Poem blog
This Next.js application provides a user interface for interacting with OpenAI's audio APIs, including text-to-speech and speech-to-text. Features include history tracking, audio playback, and download functionality.
An intelligent voice assistant for hospitals that interacts with patients in real-time over phone calls that listens, understands intent, books appointments, and responds instantly - all via a real-time WebSocket pipeline.
CLI for OpenAI's tts API
Openai tts example
A multilingual AI voice agent that speaks, listens, and understands Bangla & English. Automates calls, confirms orders, and schedules appointments.
A tool to convert a Wikipedia page into a podcast.