GitHunt

Yuchen Hu

YUCHEN005

Ph.D. student at NTU, research focus on speech, LLM and multimodal.

Nanyang Technological University
Singapore

Languages

Python86%SCSS7%HTML7%

Top Repositories

Repositories

18
YU
YUCHEN005/UNA-GAN

Code for paper "Unsupervised Noise adaptation using Data Simulation"

Python140Updated 1 week ago
YU
YUCHEN005/STAR-Adapt

Code for paper "Self-Taught Recognizer: Toward Unsupervised Adaptation for Speech Foundation Models"

Python2423Updated 3 weeks ago
YU
YUCHEN005/RobustGER

Code for paper "Large Language Models are Efficient Learners of Noise-Robust Speech Recognition"

Python1395Updated 1 month ago
YU
YUCHEN005/GenTranslate

Code for paper "GenTranslate: Large Language Models are Generative Multilingual Speech and Machine Translators"

Python1989Updated 2 months ago
YU
YUCHEN005/UniVPM

Code for paper "Hearing Lips in Noise: Universal Viseme-Phoneme Mapping and Transfer for Robust Audio-Visual Speech Recognition"

Python281Updated 3 months ago
YU
YUCHEN005/NASE

Code for paper "Noise-aware Speech Enhancement using Diffusion Probabilistic Model"

Python883Updated 3 months ago
YU
YUCHEN005/DPSL-ASR

Code for paper "Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition"

Python434Updated 4 months ago
YU
YUCHEN005/Gradient-Remedy

Code for paper "Gradient Remedy for Multi-Task Learning in End-to-End Noise-Robust Speech Recognition"

Python201Updated 4 months ago
YU
YUCHEN005/Unified-Enhance-Separation

Code for paper "Unifying Speech Enhancement and Separation with Gradient Modulation for End-to-End Noise-Robust Speech Separation"

Python447Updated 4 months ago
YU
YUCHEN005/RATS-Channel-A-Speech-Data

This is a public repository for RATS Channel-A Speech Data, which is a chargeable noisy speech dataset under LDC. Here we release its Log-Mel Fbank features and several raw wavform listening samples.

160Updated 4 months ago
YU
YUCHEN005/yuchen005.github.ioFork

AcadHomepage: A Modern and Responsive Academic Personal Homepage

SCSS00Updated 10 months ago
YU
YUCHEN005/TTS_finetune

No description provided.

Python33Updated 11 months ago
YU
YUCHEN005/UNO-TTS-demos

No description provided.

40Updated 1 year ago
YU
YUCHEN005/RIO-TTS-demos

No description provided.

40Updated 1 year ago
YU
YUCHEN005/MIR-GAN

Code for paper "MIR-GAN: Refining Frame-Level Modality-Invariant Representations with Adversarial Network for Audio-Visual Speech Recognition"

Python161Updated 1 year ago
YU
YUCHEN005/GILA

Code for paper "Cross-Modal Global Interaction and Local Alignment for Audio-Visual Speech Recognition"

Python190Updated 1 year ago
YU
YUCHEN005/Hypo2TransFork

Single-blind supplementary materials for NeurIPS 2023 submission

00Updated 1 year ago
YU
YUCHEN005/UNA-GAN-Demo

No description provided.

HTML20Updated 2 years ago

Gists

Recent Activity