2,447 results for “topic:diffusion-models”
High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.
A collection of resources and papers on Diffusion Models
HunyuanVideo: A Systematic Framework For Large Video Generation Model
Production ready toolkit to run AI locally
OpenVINO™ is an open source toolkit for optimizing and deploying AI inference
Official repository for LTX-Video
[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image generation, image/video restoration/enhancement, etc.
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
A curated list of recent diffusion models for video generation, editing, and various other applications.
Taming Stable Diffusion for Lip Sync!
SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.
The open source research environment for AI researchers to seamlessly train, evaluate, and scale models from local hardware to GPU clusters.
《Pytorch实用教程》(第二版)无论是零基础入门,还是CV、NLP、LLM项目应用,或是进阶工程化部署落地,在这里都有。相信在本书的帮助下,读者将能够轻松掌握 PyTorch 的使用,成为一名优秀的深度学习工程师。
[ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models
MAGI-1: Autoregressive Video Generation at Scale
TurboDiffusion: 100–200× Acceleration for Video Diffusion Models
Diffusion model papers, survey, and taxonomy
LTX-Video Support for ComfyUI
[ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling
Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models
A unified inference and post-training framework for accelerated video generation.
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
[ICLR 2024] Official implementation of DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior
A general fine-tuning kit geared toward image/video/audio diffusion models.
Implementation of papers in 100 lines of code.
High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
[CSUR] A Survey on Video Diffusion Models
Lumina-T2X is a unified framework for Text to Any Modality Generation
Official PyTorch Code and Models of "RePaint: Inpainting using Denoising Diffusion Probabilistic Models", CVPR 2022