Shoufa Chen
ShoufaChen
Ph.D. student, The University of Hong Kong
Languages
Loading contributions...
Top Repositories
[ICCV2023 Best Paper Finalist] PyTorch implementation of DiffusionDet (https://arxiv.org/abs/2211.09788)
[NeurIPS 2022] Implementation of "AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition"
Pixel-Space Generative Models
[ICLR'22 Oral] Implementation of "CycleMLP: A MLP-like Architecture for Dense Prediction"
https://www.shoufachen.com/Awesome-Diffusion-Transformers/
[ICCV'21] Implementation of "Watch Only Once: An End-to-End Video Action Detection Framework"
Repositories
42Pixel-Space Generative Models
[ICCV2023 Best Paper Finalist] PyTorch implementation of DiffusionDet (https://arxiv.org/abs/2211.09788)
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
[NeurIPS 2022] Implementation of "AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition"
https://www.shoufachen.com/Awesome-Diffusion-Transformers/
[ICLR'22 Oral] Implementation of "CycleMLP: A MLP-like Architecture for Dense Prediction"
A PyTorch native platform for training generative AI models
No description provided.
[ICCV'21] Implementation of "Watch Only Once: An End-to-End Video Action Detection Framework"
PyTorch tutorials.
clone/download codes from https://anonymous.4open.science/
Marrying Grounding DINO with Segment Anything & Stable Diffusion & BLIP - Automatically Detect , Segment and Generate Anything with Image and Text Inputs
🚀 A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision
The Mini Sora project aims to explore the implementation path and future development direction of Sora.
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
AI methods for Anything: AnyObject, AnyGeneration, AnyModel, AnyTask
⚡ Building applications with LLMs through composability ⚡
Language Quantized AutoEncoders
End-to-End Object Detection with Transformers
A curated list of prompt-based paper in computer vision and vision-language learning.
Many Class Activation Map methods implemented in Pytorch for CNNs and Vision Transformers. Examples for classification, object detection, segmentation, embedding networks and more. Including Grad-CAM, Grad-CAM++, Score-CAM, Ablation-CAM and XGrad-CAM
PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.
Face Recognition Using Python and MySQL
Implementation of DiffDock: Diffusion Steps, Twists, and Turns for Molecular Docking
A powerful and flexible machine learning platform for drug discovery
:globe_with_meridians: Jekyll is a blog-aware static site generator in Ruby
Multi-GPU CUDA stress test
Waymo Open Dataset
No description provided.