225 results for “topic:voice-to-text”
Transcribe and translate voice into LRC file using Whisper and LLMs (GPT, Claude, et,al). 使用whisper和LLM(GPT,Claude等)来转录、翻译你的音频为字幕文件。
Voice-to-text with push-to-talk for Wayland compositors
On-device speech-to-text engine powered by deep learning
A dynamic, scalable AI chatbot built with Django REST framework, supporting custom training from PDFs, documents, websites, and YouTube videos. Leveraging OpenAI's GPT-3.5, Pinecone, FAISS, and Celery for seamless integration and performance.
VOXD is a speech-to-text, voice-typing, dictation software for linux distributions. It is an open-source, free of charge, USER-FRIENDLY software, for as many linux distros as possible.
:iphone: :runner: :apple: Fitness application that’s used to keep track of your physical fitness data, daily calorie count, invite friends to work out together and ultimately get healthy.
Voice to text, one key to input.
A bash script using OpenAI Whisper API for continuous audio transcription with automatic silence detection
Privacy-First Voice-to-Text with AI Enhancement for macOS
Chrome Web Speech API
Voice-to-text CLI for terminal users
GUI for Faster‑Whisper‑XXL transcription tool: download YouTube audio, transcribe local files, manage models, and export multiple formats with themes and auto yt‑dlp updates.
Codo-File is a code editor that primarily supports JavaScript and Python, with partial Dart support. Additionally, it features a real-time website editor where you can create your own website in the browser using HTML, CSS, and JavaScript. The project also includes an image-to-text feature and a voice-to-text feature .
This package can be used to connect Telegram bot to AI engines such as OpenAI ChatGPT, Dall-E, Midjourney, Stable Diffusion, etc.
一个简洁且优秀的描述是:这是一款在任何网页上实现无缝语音转文字的 Chrome 扩展,使用先进的 ASR API。
Kotlin Multiplatform Mobile Translator App
Free ChatGPT voice interaction and integration into python workflows.
macOS voice productivity app — built-in dictation, AI rewrite, and translation. Powered by local Whisper + LLM.
ANDROID APP to AUTO GENERATE SUBTITLE FILE and TRANSLATED SUBTITLE FILE (using unofficial online Google Translate API) for any audio/video files
A simple iOS App that can convert speech/voice into text. Only English voice is supported for now. Used Swift 5, AVKit and Speech.
Telegram bot with ASR
ANDROID APP to AUTO GENERATE SUBTITLE FILE and TRANSLATED SUBTITLE FILE (using unofficial online Google Translate API) for any audio/video files using 2 ACTIVITIES
Use ChatGPT in your own voice to place a phone call on your behalf, just by prompting it.
A high-performance C++ application that captures Chrome Live Caption text in real time for accessibility, transcripts, and AI-driven analysis. Designed for job interviews, content creators, and language learners.
Welcome to LLM-Utility-Cookbook! Here you'll find tools to make LLMs easy: voice to text, text to voice, document scan to text, prompt management, and more. Jump in, make your work easier
Your privacy-first voice-to-text tool. Local Whisper transcription with optional LLM enhancement so your audio never leaves your computer.
Customizable Web Component that adds speech-to-text dictation capabilities to site text fields
It is a voice bot based on LLM.
Push-to-talk voice dictation for macOS. 100% local, free, open source. Apple Silicon MLX. No cloud, no subscription.
Privacy-first voice-to-text for macOS and Windows. Local Whisper (Metal/CUDA) or Groq cloud, with LLM post-processing. Built with Rust + Tauri 2.