104 results for “topic:world-model”
Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels with Hunyuan3D World Model
Collect some World Models for Autonomous Driving (and Robotic, etc.) papers.
Matrix-Game 2.0: An Open-Source, Real-Time, and Streaming Interactive World Model
Voyager is an interactive RGBD video generation model conditioned on camera input, and supports real-time 3D reconstruction.
A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and Autonomous Driving, including papers, codes, and related websites.
HY-World 1.5: A Systematic Framework for Interactive World Modeling with Real-Time Latency and Geometric Consistency
Build, evaluate and train General Multi-Agent Assistance with ease
Helios: Real Real-Time Long Video Generation Model
Fast and Universal 3D reconstruction model for versatile tasks
[NeurIPS 2024] A Generalizable World Model for Autonomous Driving
Official code of Motus: A Unified Latent Action World Model
Code for "TD-MPC2: Scalable, Robust World Models for Continuous Control"
[NeurIPS 2025 spotlight] Official implementation for "FutureSightDrive: Thinking Visually with Spatio-Temporal CoT for Autonomous Driving"
The official code of Yume
[ICCV 2025 & ICCV 2025 RIWM Outstanding Paper] Aether: Geometric-Aware Unified World Modeling
Official codebase for "Causal Forcing: Autoregressive Diffusion Distillation Done Right for High-Quality Real-Time Interactive Video Generation"
[ICLR 2026] OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling
Visionary: The World Model Carrier Built on WebGPU-Powered Gaussian Splatting Platform
[CVPR 2024 Highlight] Visual Point Cloud Forecasting
VerseCrafter: Dynamic Realistic Video World Model with 4D Geometric Control
No description provided.
Official repository for "RLVR-World: Training World Models with Reinforcement Learning" (NeurIPS 2025), https://arxiv.org/abs/2505.13934
[ICLR 2026] Astra : General Interactive World Model with Autoregressive Denoising"
A skill-based platform for ROS v.2 with knowledge representating, planning and reasoning
DeepVerse: 4D Autoregressive Video Generation as a World Model
Thinking with Videos from Open-Source Priors. We reproduce chain-of-frames visual reasoning by fine-tuning open-source video models. Give it a star 🌟 if you find it useful.
[ICML'25] The PyTorch implementation of paper: "AdaWorld: Learning Adaptable World Models with Latent Actions".
Code for "Hierarchical World Models as Visual Whole-Body Humanoid Controllers"
[ICLR 2025 Oral] PyTorch code for the paper "Open-World Reinforcement Learning over Long Short-Term Imagination"
Reliable, minimal and scalable library for evaluating and conducting world model research