"topic:paraformer" — Search

25 results for “topic:paraformer”

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python15.2k1.6kUpdated just now

audio-visual-speech-recognitionconformerdfsmnparaformerpretrained-modelpunctuationpytorchrnntspeaker-diarizationspeech-recognitionspeechgptspeechllmvadvoice-activity-detectionwhisper

RapidAI/RapidASR

📣 商用级开源语音自动识别程序库，开箱即用，全平台支持，中英文混合识别。A Cross-platform implementation of ASR inference. It's based on ONNXRuntime and FunASR. We provide a set of easier APIs to call ASR models.

C++60170Updated 1 day ago

asrpaddlespeechparaformerwenet

manyeyes/ManySpeech

AI Speech Solutions for Tasks such as ASR, Vocal Extraction, Accompaniment Extraction, Audio Denoising, and Enhancement, Support models such as paraformer, sensevoice, fireredasr, zipformer, moonshine, wenet, whisper, fsmn-vad, silero-vad, CT Transformer punc, Spleeter, Uvr5, etc, apply ONNX models in various scenarios.

C#7112Updated 7 hours ago

asrct-transformer-puncfireredasrfsmn-vadmauimoonshineonnxonnxruntimeparaformersensevoicesilero-vadspeech-recognitionspeech-to-textspleeteruvr5wenetwhisperzipformer

GetcharZp/go-speech

go-speech 基于 Golang + ONNX 构建的轻量语音库，支持 TTS（文本转语音）与 ASR（语音转文字）。已集成 MeloTTS、Piper、达摩院 Paraformer 架构模型、Whisper 模型。

Go467Updated 1 week ago

asrgolangmelottsparaformerpiper-ttsttswhisper

yuekaizhang/minutes

Podcast Summarizer with LLM Technology

Python306Updated 1 month ago

chatglmlangchainllmparaformerwhisper

TeaPoly/CE-OptimizedLoss

Optimized loss based on cross-entropy (CE), like MWER (minimum WER) Loss with beam search and negative sampling strategy, Smoothed Max Pooling Loss.

Python246Updated 2 months ago

asrmwerparaformerpytorchsmpspeech-recognition

lukeewin/FunASR_API

这是基于FunASR实现的区分说话人语音识别API | This is a speaker-diarization-based speech recognition API implemented using FunASR.

HTML237Updated 1 week ago

asrfunasrparaformer

lukeewin/desktop_subtitle

Realtime ASR for Desktop Subtitle

Java232Updated 3 days ago

funasrparaformerrealtime-asr

aidayang/FunASR-OneClick

FunASR实时语音识别版，识别麦克风和电脑内播放的声音，电脑语音打字软件

131Updated 3 months ago

audio-visual-speech-recognitionconformerdfsmnfunasrparaformerpretrained-modelspunctuationpytorchrnntspeaker-diarizationspeech-recognitionspeechgptspeechllmvadvoice-activity-detectionwhisper

Garcke/QW-InterviewAssitsant

一个基于qwen-max-latest(LLM) + paraformer-realtime-v2(ASR)的一个实时语音AI面试助手

JavaScript122Updated 3 days ago

fastapijavascriptparaformerpythonqwenwebsocket

lissettecarlr/AutomaticSpeechRecognition

语音转文本的各类python封装实现（paraformer、whisper_online、whisper_offline、funasr），用于服务kuon仓库

Python80Updated 3 months ago

aiasraudioaudio-processingdeeplparaformerpythonspeech-to-texttextwhisper

XDcobra/react-native-sherpa-onnx

React Native TurboModule for Sherpa-ONNX offline on-device Speech Processing (STT/TTS/Diarization/VAD) completely offline on the device. Support for Android & iOS

TypeScript61Updated 1 day ago

androidctcdata-privacydiarizationiosonnxparaformerreact-nativesherpa-onnxsource-separationspeech-enhancementspeech-to-textssttext-to-speechttsturbomodulevadvoice-activity-detectionwhisperzipformer

luke-lin-vmc/paraformer-zh-ovep-python-static

Python pipeline to run Paraformer ASR on Intel CPU/GPU/NPU thru ONNX Runtime + OpenVINO Execution Provider

Python32Updated 1 week ago

onnxruntimeopenvinoparaformer

XDcobra/react-native-sherpa-onnx-stt

Offline Speech-to-Text for React Native using sherpa-onnx Supports Zipformer, Paraformer, NeMo CTC, Whisper & more.

C20Updated 2 weeks ago

androidctciosmobileofflineonnxparaformerparakeetreact-nativesensevoicesherpa-onnxspeech-to-textsttwenetspeechwhisperwhisper-cppzipformer

ljyou001/echotype

Real-time voice-to-text transcription with support for multiple AI models and integration with external AI services

C#10Updated 3 weeks ago

clawbotfunasrofflineopenclawparaformerproductivitypythonqwen3speech-to-texttransformervoice-inputvoice-recognitionwindows

YannJY02/AutoTranscribe

🎙 Automated offline video transcription for macOS — FunASR + speaker diarization + language detection (zh/en/mixed). Zero cloud costs, 100% local.

Swift10Updated 1 week ago

asrautomationchineseenglishfunasrlaunchagentmacosofflineparaformersensevoicespeaker-diarizationspeech-recognitionspeech-to-texttranscriptionvideo-transcription

starsdaisuki/StarSummary

视频/音频一键转文字 + AI 总结 | CLI / Web UI / Telegram Bot 三合一

Python10Updated 1 week ago

asrgradioparaformerpythonspeech-to-texttelegram-bottranscriptionyt-dlp

moziarnj07-sys/doubaoime-asr

🎤 Enable voice recognition for the Doubao input method using Python; ideal for learning and research with a focus on audio processing.

Python10Updated 1 hour ago

asrtaudio-visual-speech-recognitionchinese-speech-recognitioncnnctcdfsmnkerasparaformerpretrained-modelpunctuationpythonpytorchspeaker-diarizationspeech-recognitionspeechgptspeechllmtensorflowvadvoice-activity-detectionwhisper

sanamid/Fun-ASR

No description provided.

Python00Updated 2 hours ago

asyncioaudioaudio-language-modelaudio-visual-speech-recognitionfunasr-clientmultimodal-large-language-modelsparaformerpretrained-modelspunctuationpythonspeaker-diarizationspeech-recognitionspeechllmvoice-activity-detectionwebsocketwhisper

kalab12321/realtime-subtitle

🎙️ Enable real-time speech-to-text and translation on macOS with multiple ASR backends and smooth overlay interface for seamless communication.

Python00Updated 2 hours ago

chromefunasrobsobs-studioparaformerprivate-workrealtimerealtime-asrsttsubtitlestranslator

goodguy11320-web/AutoTranscribe

🎙 Automate offline, local transcription of audio/video on macOS with speaker diarization and no cloud costs.

Python00Updated 1 hour ago

asrautomationchineseenglishfunasrlaunchagentmacosofflineparaformersensevoicespeaker-diarizationspeech-recognitionspeech-to-texttranscriptionvideo-transcription

Frida7771/GoSpeech

speech processing tool using Go

Go00Updated 1 month ago

asrgogolangmelottsonnxruntimeparaformertts

bseceenn/Fun-CosyVoice3-0.5B-2512-Deploy

🎤 Deploy a simplified voice synthesis service with Fun-CosyVoice3-0.5B-2512, featuring real-time audio output and advanced performance optimizations.

Python00Updated 1 hour ago

algorithm-engineeringaudio-visual-speech-recognitionchineseconformerdeep-learningfine-grainedgpt-4ojapanesemachine-learningparaformerpretrained-modelrecommender-systemrnntspeechgpttext-to-speechtianchi-competitionvadvoice-activity-detectionwhisper

12alz/fun-with-clip-path

🎨 Explore clip-path techniques in HTML and CSS to create interactive menus and dynamic shapes without JavaScript for responsive design.

CSS00Updated 1 hour ago

airtablealgorithm-engineeringaudio-visual-speech-recognitionclaude-codedata-engineeringdata-qualitydfsmnhiring-without-whiteboardsmlopsparaformerrecommendation-algorithmsrnntspeaker-diarizationspeech-recognitionspeechgptspeechllmtechutility

lancetodjk14/react-native-sherpa-onnx-stt

🎤 Enable offline speech recognition in React Native using sherpa-onnx, supporting various model architectures for reliable performance.

C00Updated 1 hour ago

ctcmobileofflineparaformerparakeetreact-nativesource-separationspeech-enhancementspeech-to-textssttext-to-speechttsvadvoice-activity-detectionzipformer