Repos
101
Stars
12.3k
Forks
865
Top Language
Python
Loading contributions...
Top Repositories
Edit anything in images powered by segment-anything, ControlNet, StableDiffusion, etc. (ACM MM)
PoolFormer: MetaFormer Is Actually What You Need for Vision (CVPR 2022 Oral)
C++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.
Understanding R1-Zero-Like Training: A Critical Perspective
Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models
🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.
Repositories
101Understanding R1-Zero-Like Training: A Critical Perspective
C++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.
[ICLR 2025] When Attention Sink Emerges in Language Models: An Empirical View (Spotlight)
CVPR'24, Official Codebase of our Paper: "Let's Think Outside the Box: Exploring Leap-of-Thought in Large Language Models with Creative Humor Generation".
[ArXiv 2025] Imperceptible Jailbreaking against Large Language Models
Masked Diffusion Transformer is the SOTA for image synthesis. (ICCV 2023)
[CVPR 2026] TeamHOI: Learning a Unified Policy for Cooperative Human-Object Interactions with Any Team Size
Zero Bubble Pipeline Parallelism
[ICLR 2025] Cheating Automatic LLM Benchmarks: Null Models Achieve High Win Rates (Oral)
[TMLR 2025] On Memorization in Diffusion Models
[NeurIPS 2023] Efficient Diffusion Policy
Rethinking the Trust Region in LLM Reinforcement Learning
Intriguing Properties of Data Attribution on Diffusion Models (ICLR 2024)
No description provided.
Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models
LongSpec: Long-Context Lossless Speculative Decoding with Efficient Drafting and Verification
[ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".
[NeurIPS 2024] The official implementation of paper: Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs.
Edit anything in images powered by segment-anything, ControlNet, StableDiffusion, etc. (ACM MM)
MetaFormer Baselines for Vision (TPAMI 2024)
PoolFormer: MetaFormer Is Actually What You Need for Vision (CVPR 2022 Oral)
[CVPR 2025] TreeMeshGPT: Artistic Mesh Generation with Autoregressive Tree Sequencing
No description provided.
🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.
Code for "Variational Reasoning for Language Models"
Optimizing Anytime Reasoning via Budget Relative Policy Optimization
The official implementation of paper: SimLayerKV: A Simple Framework for Layer-Level KV Cache Reduction.
Reinforcing General Reasoning without Verifiers
No description provided.
The official PyTorch implementation of Fast Diffusion Model