1,882 results for “topic:transcription”
Privacy first, AI meeting assistant with 4x faster Parakeet/Whisper live transcription, speaker diarization, and Ollama summarization built on Rust. 100% local processing. no cloud required. Meetily (Meetly Ai - https://meetily.ai) is the #1 Self-hosted, Open-source Ai meeting note taker for macOS & Windows.
AI wearables. Put it on, speak, transcribe, automatically
Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation.
A lightweight yet powerful audio-to-MIDI converter with pitch bend detection
No description provided.
Transcribe any audio to text, translate and edit subtitles 100% locally with a web UI. Powered by whisper models!
Self-hosted AI audio transcription
🔊 Awesome list for Whisper — an open-source AI-powered speech recognition system developed by OpenAI
Instant, controllable, local pre-trained AI models in Rust
A python package to build AI-powered real-time audio applications
an editor for spoken-word audio with automatic transcription
Cutting edge AI technology for automated audio transcription. A nice GUI for OpenAIs Whisper and pyannote (speaker identification)
Simple GUI for ByteDance's Piano Transcription with Pedals
OBS plugin for local speech recognition and captioning using AI
「硬地骇客 - 两个月 $12000 ARR 实践之路」是由 硬地骇客 团队编著,本书是关于 Podwise 产品历程的忠实记录:内容包含 灵感 - 构建 - 发布 - 增长 - 复盘 五个章节。如果你觉得一个人读不够过瘾,欢迎加入「硬地骇客」官方知识星球与专家们一起讨论!Podwise 的故事才刚刚开始,我们也将在星球持续分享我们的认知,成功可能无法复制,但失败一定可以借鉴。现在就点击下方链接加入吧!
VOICE → WORDS
视频音频生成字幕,生成srt文件。无需申请第三方API,本地实现音频转文本。基于Transformer的视频字幕生成框架。A GUI tool for generating subtitle from videos and generating srt files.
The open-source iOS app that's making quality voice transcription more accessible on mobile devices.
🎙️ AI Dictation App - Open Source and Local-first ⚡ Type 3x faster, no keyboard needed. 🆓 Powered by open source models, works offline, fast and accurate.
Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection
turnkey self-hosted offline transcription and diarization service with llm summary
Generate subtitles, summaries, and chapters from videos in seconds
Optimized Whisper models for streaming and on-device use
Generate transcripts for audio and video content with a user friendly UI, powered by Open AI's Whisper with automatic translations and download videos automatically with yt-dlp integration
🎤 The easiest way to transcribe audio in Swift
A command-line application to convert images, PDFs, and audio files to text using Apple's APIs
On-device streaming speech-to-text engine powered by deep learning
Easily take an entire YouTube playlist and turn it into high quality transcripts using Whisper.
Whisper-Flow is a framework designed to enable real-time transcription of audio content using OpenAI’s Whisper model. Rather than processing entire files after upload (“batch mode”), Whisper-Flow accepts a continuous stream of audio chunks and produces incremental transcripts immediately.
Command line interface for the built-in speech recognition and transcription capabilities in macOS.