104 results for “topic:whisper-api”
Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks
Project that allows one to use a microphone with OpenAI whisper.
A 100% private AI voice transcription app that converts speech to text in 100+ languages. Built with Compose Multiplatform for Android & iOS using Whisper AI - no cloud uploads, all processing happens on-device for complete privacy.
⚡ 一款用于自动语音识别 (ASR)、翻译的高性能异步 API。不需要购买Whisper API,使用本地运行的Whisper模型进行推理,并支持多GPU并发,针对分布式部署进行设计。还内置了包括TikTok、抖音等社交媒体平台的爬虫,可实现来自多个社交平台的无缝媒体处理,为媒体内容数据自动化处理提供了强大且可扩展的解决方案。
Transcribe and translate audio to text using Whisper and DeepL.
Private voice keyboard, AI chat, images, webcam, recordings, voice control with >= 4 GiB of VRAM.
Generador de logotipos de eSports por IA (con fines académicos durante el evento Tenerife GG)
openai-whisper-talk is a sample voice conversation application powered by OpenAI technologies such as Whisper, Completions, Embeddings, and the latest Text-to-Speech. The application is built using Nuxt, a Javascript framework based on Vue.js.
whisper.cpp bindings for python
A sample speech transcription app implementing OpenAI Text to Speech API based on Whisper, an automatic speech recognition (ASR) system, built using Next 13, the React framework
A feature-rich Python-based Telegram bot for OpenAI API & Perplexity API
Unleash the power of AI with QueryWhisperer! Get instant answers to your questions about YouTube videos.
macOS menu bar app providing a local HTTP server compatible with the OpenAI Whisper API for fast and private audio transcription.
YouTube Video Summarization App built using open source LLM and Framework like Llama 2, Haystack, Whisper, and Streamlit. This app smoothly runs on CPU as Llama 2 model is in GGUF format loaded through Llama.cpp.
A client library of OpenAI Whisper transcription and translation API for Unity.
Live translation tool utilizing OpenAI's Whisper model for real-time audio transcription/translation with BYOK OpenAI API key for your choice of language.
Drop-in replacement for the OpenAI's Whisper API using the same API but running locally
A working Speech to Speech AI assistant that can interact with you, manage your system, and more!
A web app that lets users create AI prompts using voice input.
Discord bot that downloads and transcribes twitter space audio file
This repository provides a Flask app that processes voice messages recorded through Twilio or Twilio Studio, transcribes them using OpenAI's Whisper ASR, generates responses with GPT-3.5, and sends the replies as SMS using Twilio.
YASS.ai - Team Orange's entry to the Flow AI Hackathon 2023
用 Open AI 的 Whisper API 轉譯字幕的 Web UI。
🎧 Submind is a modern PyQt6 app for generating subtitles (SRT) using Whisper AI. Supports drag & drop, batch processing, and auto translation in a sleek UI.
Simple User Interface: Enter text and generate speech with a single click.
Code in VS Code, using your voice, fmedia, WhisperAI and ChatGPT
AI-powered math chatbot with Gemini, Whisper, file upload, and voice input
A simple UI tool written in Python, for recording audio from a microphone and automatically transcribing the recording using OpenAI's Whisper model via OpenAI's API.
Swift Package wrapping WhisperCore and whisper.xcframeworks for on-device speech-to-text transcription on iOS.
Speech2Text