BohaoSu

Subohao

Postdoctoral Researcher @ CMU-LTI WAVLab | Ph.D. @ NTHU-EE

Pittsburgh, PA

Languages

Python50%HTML25%JavaScript25%

Repos

23

Stars

3

Forks

1

Top Language

Python

Loading contributions...

Top Repositories

Zero_Shot_SER_LLM_synthetic

End-to-End Speech Processing Toolkit

Versatile Evaluation of Speech and Audio

seamless_interaction

Foundation Models and Data for Human-Human and Human-AI interactions.

ICASSP26-Emotion-Reasoning

ICASSP26

Repositories

23

Subohao/espnetFork

End-to-End Speech Processing Toolkit

Python00Updated 3 months ago

Subohao/Zero_Shot_SER_LLM_synthetic

No description provided.

Python31Updated 1 year ago

Subohao/versaFork

Versatile Evaluation of Speech and Audio

Python00Updated 4 months ago

Subohao/seamless_interactionFork

Foundation Models and Data for Human-Human and Human-AI interactions.

00Updated 7 months ago

Subohao/Subohao

No description provided.

00Updated 6 months ago

Subohao/ICASSP26-Emotion-Reasoning

ICASSP26

HTML00Updated 6 months ago

Subohao/SALMONNFork

SALMONN family: A suite of advanced multi-modal LLMs

00Updated 8 months ago

Subohao/borrissu.github.io

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

JavaScript00Updated 9 months ago

Subohao/audio-flamingoFork

PyTorch implementation of Audio Flamingo: Series of Advanced Audio Understanding Language Models

00Updated 7 months ago

Subohao/audiocraftFork

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

00Updated 1 year ago

Subohao/pyannote-audioFork

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

00Updated 8 months ago

Subohao/diaFork

A TTS model capable of generating ultra-realistic dialogue in one pass.

00Updated 8 months ago

Subohao/AudioLDMFork

AudioLDM: Generate speech, sound effects, music and beyond, with text.

00Updated 8 months ago

Subohao/tangoFork

A family of diffusion models for text-to-audio generation.

00Updated 1 year ago

Subohao/LLaMA-FactoryFork

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

00Updated 8 months ago

Subohao/shinjiwlab.github.ioFork

visiting scholar at CMU-LTI WAVLab

JavaScript00Updated 9 months ago

Subohao/olmesFork

Reproducible, flexible LLM evaluations

00Updated 10 months ago

Subohao/Zero_Shot_SER_LLM_synthetic_audio-samplesFork

audio sample demo

HTML00Updated 1 year ago

Subohao/hands-on-codesFork

No description provided.

00Updated 3 years ago

Subohao/dual-fisheye-video-stitchingFork

Dual fisheye video stitching

00Updated 7 years ago

Subohao/BIIC_Meeting

No description provided.

00Updated 7 years ago

Subohao/BIIClab

No description provided.

Python00Updated 9 years ago

Subohao/homework0Fork

No description provided.

00Updated 9 years ago

Gists

Recent Activity

BohaoSu (Subohao) | GitHunt