Repos
40
Stars
0
Forks
0
Top Language
Python
Loading contributions...
Repositories
40Robust Speech Recognition via Large-Scale Weak Supervision
OpenAI Whisper demo on Axera
No description provided.
Reverse engineering the rk3588 npu
Efficient Inference of Transformer models
Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR benchmarks, while also offering outstanding singing lyrics recognition capability.
Ubuntu for Rockchip RK35XX Devices
Improved Rockchip Linux
No description provided.
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
TF implementation of our CVPR 2021 paper: OSTeC: One-Shot Texture Completion
DECA: Detailed Expression Capture and Animation (SIGGRAPH 2021)
Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set (CVPRW 2019). A PyTorch implementation.
A generative speech model for daily dialogue.
Linux based GDDR6/GDDR6X VRAM temperature reader for NVIDIA RTX 3000/4000 series GPUs.
Get up and running with Llama 2, Mistral, and other large language models locally.
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
LCD screen ST7302 for arduino
python版本网易云音乐ncm文件格式转换
Vocal Remover using Deep Neural Networks
Ultimate repo for embedded devices
Promise based HTTP client for the browser and node.js
No description provided.
No description provided.
A native go client for HDFS
Java Client for TiKV
BusyBox mirror
Mirror of Apache Zeppelin