Yuchen Hu

YUCHEN005

Ph.D. student at NTU, research focus on speech, LLM and multimodal.

Nanyang Technological University

Singapore

https://yuchen005.github.io

Languages

Python86%SCSS7%HTML7%

Top Repositories

STAR-Adapt

Code for paper "Self-Taught Recognizer: Toward Unsupervised Adaptation for Speech Foundation Models"

242Python

GenTranslate

Code for paper "GenTranslate: Large Language Models are Generative Multilingual Speech and Machine Translators"

198Python

RobustGER

Code for paper "Large Language Models are Efficient Learners of Noise-Robust Speech Recognition"

139Python

NASE

Code for paper "Noise-aware Speech Enhancement using Diffusion Probabilistic Model"

88Python

Unified-Enhance-Separation

Code for paper "Unifying Speech Enhancement and Separation with Gradient Modulation for End-to-End Noise-Robust Speech Separation"

44Python

DPSL-ASR

Code for paper "Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition"

43Python

Repositories

YUCHEN005/UNA-GAN

Code for paper "Unsupervised Noise adaptation using Data Simulation"

Python140Updated 1 week ago

YUCHEN005/STAR-Adapt

Code for paper "Self-Taught Recognizer: Toward Unsupervised Adaptation for Speech Foundation Models"

Python2423Updated 3 weeks ago

YUCHEN005/RobustGER

Code for paper "Large Language Models are Efficient Learners of Noise-Robust Speech Recognition"

Python1395Updated 1 month ago

YUCHEN005/GenTranslate

Code for paper "GenTranslate: Large Language Models are Generative Multilingual Speech and Machine Translators"

Python1989Updated 2 months ago

YUCHEN005/UniVPM

Code for paper "Hearing Lips in Noise: Universal Viseme-Phoneme Mapping and Transfer for Robust Audio-Visual Speech Recognition"

Python281Updated 3 months ago

YUCHEN005/NASE

Code for paper "Noise-aware Speech Enhancement using Diffusion Probabilistic Model"

Python883Updated 3 months ago

YUCHEN005/DPSL-ASR

Code for paper "Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition"

Python434Updated 4 months ago

YUCHEN005/Gradient-Remedy

Code for paper "Gradient Remedy for Multi-Task Learning in End-to-End Noise-Robust Speech Recognition"

Python201Updated 4 months ago

YUCHEN005/Unified-Enhance-Separation

Code for paper "Unifying Speech Enhancement and Separation with Gradient Modulation for End-to-End Noise-Robust Speech Separation"

Python447Updated 4 months ago

YUCHEN005/RATS-Channel-A-Speech-Data

This is a public repository for RATS Channel-A Speech Data, which is a chargeable noisy speech dataset under LDC. Here we release its Log-Mel Fbank features and several raw wavform listening samples.

160Updated 4 months ago

YUCHEN005/yuchen005.github.ioFork

AcadHomepage: A Modern and Responsive Academic Personal Homepage

SCSS00Updated 10 months ago

YUCHEN005/TTS_finetune

No description provided.

Python33Updated 11 months ago

YUCHEN005/UNO-TTS-demos

No description provided.

40Updated 1 year ago

YUCHEN005/RIO-TTS-demos

No description provided.

40Updated 1 year ago

YUCHEN005/MIR-GAN

Code for paper "MIR-GAN: Refining Frame-Level Modality-Invariant Representations with Adversarial Network for Audio-Visual Speech Recognition"

Python161Updated 1 year ago

YUCHEN005/GILA

Code for paper "Cross-Modal Global Interaction and Local Alignment for Audio-Visual Speech Recognition"

Python190Updated 1 year ago

YUCHEN005/Hypo2TransFork

Single-blind supplementary materials for NeurIPS 2023 submission

00Updated 1 year ago

YUCHEN005/UNA-GAN-Demo

No description provided.

HTML20Updated 2 years ago

Yuchen Hu

Languages

Top Repositories

Repositories

Gists

Recent Activity