"topic:speaker-identification" — Search

158 results for “topic:speaker-identification”

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

Jupyter Notebook14.4k1.7kUpdated 6 hours ago

androidasrdeep-learningdeep-neural-networksdeepspeechgoogle-speech-to-textioskaldiofflineprivacypythonraspberry-pispeaker-identificationspeaker-verificationspeech-recognitionspeech-to-textspeech-to-text-androidsttvoice-recognitionvosk

FluidInference/FluidAudio

Frontier CoreML audio models in your apps — text-to-speech, speech-to-text, voice activity detection, and speaker diarization. In Swift, powered by SOTA open source.

Swift1.6k203Updated 5 hours ago

aneasraudioautomatic-speech-recognitionavfoundationcoremliosmacosnvidiaparakeetreal-timespeaker-diarizationspeaker-embeddingspeaker-identificationspeaker-recognitionspeech-to-textswiftvadvoice-activity-detection

mravanelli/SincNet

SincNet is a neural architecture for efficiently processing raw audio samples.

Python1.2k270Updated 6 hours ago

artificial-intelligenceasraudioaudio-processingcnnconvolutional-neural-networksdeep-learningdigital-signal-processingfilteringneural-networkspythonpytorchsignal-processingspeaker-identificationspeaker-recognitionspeaker-verificationspeech-processingspeech-recognitiontimitwaveform

HarryVolek/PyTorch_Speaker_Verification

PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.

Python597164Updated 1 week ago

pytorchspeaker-identificationspeaker-verification

google/speaker-id

This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.

Python44238Updated 2 days ago

source-separationspeaker-diarizationspeaker-identificationspeaker-recognitionspeaker-verification

speechbrain/speechbrain.github.io

The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.

HTML37431Updated 1 week ago

beamformingdeep-learningdeeplearninglibrispeechneural-networkneural-networksspeaker-identificationspeaker-recognitionspeaker-verificationspeechspeech-analysisspeech-apispeech-emotion-recognitionspeech-processingspeech-recognitionspeech-recognizerspeech-separationspeech-to-textspeechrecognitiontimit

Atul-Anand-Jha/Speaker-Identification-Python

Speaker Identification System (upto 100% accuracy); built using Python 2.7 and python_speech_features library

Python21373Updated 6 days ago

python-2speaker-identificationspeaker-recognition

jymsuper/SpeakerRecognition_tutorial

Simple d-vector based Speaker Recognition (verification and identification) using Pytorch

Python21246Updated 4 months ago

deep-learningpytorchspeaker-identificationspeaker-recognitionspeaker-verification

altunenes/parakeet-rs

very fast speech-to-text, diarization, streaming (even in CPU) with NVIDIA Parakeet in Rust

Rust20728Updated 3 hours ago

asrautomatic-speech-recognitiononnxparakeetspeaker-diarizationspeaker-identificationspeechspeech-recognitionspeech-to-text

Speaker-Identification/You-Only-Speak-Once

Deep Learning - one shot learning for speaker recognition using Filter Banks

Jupyter Notebook17141Updated 3 weeks ago

audiodeep-learningdeep-speakerneural-networkone-shot-learningsiamese-networksspeaker-identificationspeaker-recognitionspeechtriplet-lossvoice-authentication

oscarknagg/voicemap

Identifying people from small audio fragments

Python17171Updated 2 months ago

convolutional-neural-networksmachine-learningspeaker-identificationspeaker-recognition

kaistmm/Audio-Mamba-AuM

Official Implementation of the work "Audio Mamba: Bidirectional State Space Model for Audio Representation Learning"

Python16720Updated 1 month ago

audioaudio-classificationaudio-mambadeep-learningmambapytorchrepresentation-learningspeaker-identificationspeech-classificationstate-space-model

Warma10032/easytts

打造最简单的TTS前端集合，最简单的有声小说制作工作流。基于正则规则对小说进行分句，基于RoBERTa对小说中的对话进行说话人识别，从而实现一键式生成多人有声小说。多说话人的语音合成，高质量的有声小说制作。

Python14728Updated 5 days ago

aiaudio-generationnlppyqtspeaker-identificationtts

jefflai108/pytorch-kaldi-neural-speaker-embeddings

A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.

Perl13634Updated 1 year ago

kaldilearnable-dictionary-encodingpytorchspeaker-identificationspeaker-recognitionspeaker-verificationspeech-processing

SiavashShams/ssamba

[SLT'24] The official implementation of SSAMBA: Self-Supervised Audio Representation Learning with Mamba State Space Model

Python13412Updated 1 week ago

audioaudio-classificationdeep-learningemotion-recognitionkeyword-spottingmambarepresentation-learningself-supervised-learningspeaker-identificationstate-space-model

Anwarvic/Speaker-Recognition

This repo contains my attempt to create a Speaker Recognition and Verification system using SideKit-1.3.1

Python11433Updated 4 months ago

gmmgmm-ubmi-vectoridentity-vectoridentity-verificationsidekitspeaker-identificationspeaker-recognitionspeaker-verificationubm

Appen/UHV-OTS-SpeechArchived

A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.

Forth10620Updated 1 month ago

accent-detectionaudio-segmentationgender-classificationspeaker-diarizationspeaker-identificationspeech-annotationspeech-processingspeech-recognitionspeech-seperationspeech-transcriptionsynthetic-speech-detectiontopic-detection

FAKEBOB-adversarial-attack/FAKEBOB

Source code for paper "Who is real Bob? Adversarial Attacks on Speaker Recognition Systems" (IEEE S&P 2021)

Python10528Updated 4 months ago

adversarial-attacksclose-set-speaker-identificationgmm-ubmivectorivector-pldaopen-set-speaker-identificationspeaker-identificationspeaker-recognition-systemsspeaker-verification

funcwj/ge2e-speaker-verification

Pytorch implementation of "Generalized End-to-End Loss for Speaker Verification"

Python10325Updated 8 months ago

pytorchspeaker-identificationspeaker-verification

cvqluu/GE2E-Loss

Pytorch implementation of Generalized End-to-End Loss for speaker verification

Python8816Updated 1 month ago

d-vectorsge2epytorchspeaker-diarizationspeaker-identificationspeaker-recognitionspeaker-verification

nezhar/speech-condenser

A tool for summarizing dialogues from videos or audio

Python8410Updated 4 weeks ago

asrspeach-recognitionspeaker-diarizationspeaker-identificationsummarization

cyrta/voxceleb

mirror of VoxCeleb dataset - a large-scale speaker identification dataset

Shell7420Updated 2 months ago

corpusdatasetspeakerspeaker-identificationspeaker-recognitionspeaker-verificationspeech

Wadaboa/titanet

Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO

Jupyter Notebook6813Updated 1 month ago

d-vectorsml4cvnvidiaspeaker-embeddingsspeaker-identificationspeaker-recognitionspeaker-verificationtitanetunibo

mjpyeon/wavenet-classifier

Keras Implementation of Deepmind's WaveNet for Supervised Learning Tasks

Python6412Updated 1 year ago

deep-learningdeep-neural-networksdeepmindspeaker-identificationspeaker-recognitionspeaker-verificationspeech-analysisspeech-apispeech-emotion-recognitionsupervised-learningwavenet-keras

jingzhunxue/TargetDiarization

Multi-speaker separation, identification, diarization ALL-IN-ONE. It can isolate the target speaker from a conversation audio and do ASR.

Python639Updated 4 days ago

speaker-diarizationspeaker-identificationspeaker-separationspeech-to-textvoice-activity-detection

CouncilDataProject/speakerbox

Speakerbox: Fine-tune Audio Transformers for speaker identification.

Python596Updated 6 months ago

audio-classificationspeaker-idspeaker-identificationtransformers

mialrr/Speaker-Recognition

声纹识别(Voiceprint Recognition, VPR)，也称为说话人识别(Speaker Recognition)，有两类，即说话人辨认(Speaker Identification)和说话人确认(Speaker Verification)

Python5710Updated 6 months ago

pythonspeaker-identificationspeaker-recognitionspeaker-verificationvoiceprint-recognitionvpr

SuperKogito/Voice-based-speaker-identification

:sound: :boy: :girl: :woman: :man: Speaker identification using voice MFCCs and GMM

Python5415Updated 11 months ago

gaussian-mixture-modelsgmmmachine-learningmel-frequenciesmel-frequency-cepstral-coefficientsmfccscikit-learnscikit-learn-pythonsignalspeaker-identificationspeaker-recognitionspeechvocalvoice

KrishnaDN/Attentive-Statistics-Pooling-for-Deep-Speaker-Embedding

Implementation of the paper "Attentive Statistics Pooling for Deep Speaker Embedding" in Pytorch

Python4911Updated 2 months ago

attention-modelspeaker-identificationspeaker-recognitionspeech

jojojaeger/whisper-streamlit

this master thesis project is based on OpenAI Whisper with the goal to transcibe interviews

Python4819Updated 2 months ago

asrclinical-researchspeaker-identificationwhisper

Page 1 of 6