Repos
46
Stars
45
Forks
4
Top Language
Python
Loading contributions...
Top Repositories
An implement of STFT-Domain Neural Speech Enhancement with Very Low Algorithmic Latency of Zhong-Qiu Wang et al.
2022WHU计算机系统综合设计 基于RISCV的五级流水线CPU Five stage CPU implement based on RISC-V
武汉大学抬头信纸
WHU 计算机网络课程设计大作业
The PyTorch implementation of Multi-head Latent Attention.
An unofficial implementation of STaR: Distilling Speech Temporal Relation for Lightweight Speech Self-supervised Learning Models.
Repositories
462022WHU计算机系统综合设计 基于RISCV的五级流水线CPU Five stage CPU implement based on RISC-V
An implement of STFT-Domain Neural Speech Enhancement with Very Low Algorithmic Latency of Zhong-Qiu Wang et al.
武汉大学抬头信纸
No description provided.
Open-Source Frontier Voice AI
No description provided.
A high-throughput and memory-efficient inference and serving engine for LLMs
Code for GeoX: Geometric Problem Solving Through Unified Formalized Vision-Language Pre-training
WHU 计算机网络课程设计大作业
A framework for efficient model inference with omni-modality models
AAAI 2025: Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model
Distributed YouTube Audio Downloading
The PyTorch implementation of Multi-head Latent Attention.
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
An unofficial implementation of STaR: Distilling Speech Temporal Relation for Lightweight Speech Self-supervised Learning Models.
Structured CoT and Step-wise Reinforcement Learning for Multimodal Geometry Problem Solving
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding
EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL
MM-EUREKA: Exploring Visual Aha Moment with Rule-based Large-scale Reinforcement Learning
Tensors and Dynamic neural networks in Python with strong GPU acceleration
No description provided.
A jounery to real multimodel R1 ! We are doing on large-scale experiment
A fork to add multimodal model training to open-r1
一个用python写的编译器
No description provided.
No description provided.
A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations
No description provided.
Self-Supervised Speech Pre-training and Representation Learning Toolkit