"topic:active-speaker-detection" — Search

5 results for “topic:active-speaker-detection”

TaoRuijie/TalkNet-ASD

ACM MM 2021: 'Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection'

Python455100Updated 3 hours ago

active-speaker-detectionaudio-visualawesome-asdmultimedia

SRA2/SPELL

Learning Long-Term Spatial-Temporal Graphs for Active Speaker Detection (ECCV 2022)

Python689Updated 3 months ago

active-speaker-detectionava-dataseteccveccv2022graph-learningpytorch-geometric

joactr/AnnoTheia

AnnoTheia is a data annotation toolkit that identifies when a person speaks in a scene and transcribes their speech, also offering flexibility to replace modules for different languages.

Python271Updated 2 months ago

active-speaker-detectiondata-annotationfine-tuninglanguagesspeech-technologies

Overcautious/ADENet

Accepted by TMM 2022

Python192Updated 1 month ago

active-speaker-detectionaudio-visualmultimodelspeech-enhancement

will-rice/pe-av-syncnet

SyncNet based on Meta's Perception Encoder Audio-Visual (PE-AV)

Python01Updated 1 month ago

active-speaker-detectionaudio-visualaudio-visual-synccomputer-visiondeep-learningdeepfake-detectionliplip-synclip-sync-detectionlipsyncmultimodal-learningpytorchrepresentation-learningspeech-processingsyncnettime-delay-estimationtime-synchronizationvideo-processing