203 results for “topic:audio-transcription”
Cutting edge AI technology for automated audio transcription. A nice GUI for OpenAIs Whisper and pyannote (speaker identification)
Pybind11 bindings for Whisper.cpp
The main repo for Stage Whisper — a free, secure, and easy-to-use transcription app for journalists, powered by OpenAI's Whisper automatic speech recognition (ASR) machine learning models.
A static site demonstrating real-time audio transcription via Amazon Transcribe over a WebSocket.
Record audio or transcribe files using ctranslate2 and whisper!
WhisperClip simplifies your life by automatically transcribing audio recordings and saving the text directly to your clipboard. With just a click of a button, you can effortlessly convert spoken words into written text, ready to be pasted wherever you need it. This application harnesses the power of OpenAI’s Whisper for free.
Free speech to text
AI-powered transcription for audio & video with Whisper — self-hosted, fast, and open-source.
Efficient LLM inference on Slurm clusters.
Transcribe any audio or video file. Edit and view your transcripts in a standalone HTML editor.
Uses the powerful WhisperS2T and Ctranslate2 libraries to batch transcribe multiple files
GUI for Faster‑Whisper‑XXL transcription tool: download YouTube audio, transcribe local files, manage models, and export multiple formats with themes and auto yt‑dlp updates.
Transcription and annotation interface for recorded audio or video files
Streamlit Audio Transcription with OPENAI's Whisper Ai: An interactive Streamlit app demonstrating real-time audio transcription using OPENAI's Whisper Ai.
Podcast/ YouTube video → Transcript!
Generate subtitles for long movies / podcasts with OpenAI Whisper API.
Dockerized Whisper C++ speech-to-text API for easy deployment and rapid integration. Offering the latest stable and nightly builds for efficient audio transcription.
Speakscribe is a web application that allows users to transcribe audios using OpenAI and also interact with a chat bot. The web application is created in Python using NiceGUI.
Audio Transcription using whisper.cpp
The GroqCloud API wrapper for Delphi provides access to models from Meta, OpenAI, MistralAI and Google on Groq’s LPUs, offering chat, text generation, image analysis, audio transcription, JSON output, tool integration, and content moderation capabilities.
OpenSceneSense is a Python library that harnesses AI for advanced video analysis, offering customizable frame and audio insights for dynamic applications in media, education, and content moderation.
Scribe is a Python script that transcribes audio and video files using OpenAI Whisper and exports the transcriptions as PDF documents, enhanced by the gpt-3.5-turbo model.
Free software transcription and description application for local and online multimedia content. Moved to Codeberg
silk codec bindings for Node.js
Generate text captions for audio files & youtube video using OpenAI Whisper on Google Colab. Multiple languages support.
Quran AI transcriping with accurate Ayah, Surah Matching with audio timestamps
High-performance Google Colab Notebook for fast & accurate audio transcription/translation using OpenAI Whisper. Accelerated on TPUs with PyTorch/XLA. Features an interactive UI for model selection, multi-language support, and long-form audio processing.
Ear training game using machine learning models in the browser
A portal that offers a transcription chain for multi upload and processing of audio files using ASR, OCTRA, MAUS and EMU-webApp.
🎬 A tool with a UI that transcribes audio files into subtitles in SRT format using OpenAI's Whisper and runs completely on your local machine.