Ronghang Hu

ronghanghu

multimodal @ xAI

xAI

Palo Alto, CA

http://ronghanghu.com/

Organizations

Languages

Python52%Jupyter Notebook19%C++19%Cuda5%SourcePawn5%

Loading contributions...

Top Repositories

seg_every_thing

Code release for Hu et al., Learning to Segment Every Thing. in CVPR, 2018.

Code release for Hu et al. Learning to Reason: End-to-End Module Networks for Visual Question Answering. in ICCV, 2017

tensorflow_compact_bilinear_pooling

Compact Bilinear Pooling in TensorFlow

speaker_follower

Code release for Fried et al., Speaker-Follower Models for Vision-and-Language Navigation. in NeurIPS, 2018.

natural-language-object-retrieval

Code release for Hu et al. Natural Language Object Retrieval, in CVPR, 2016

112Jupyter Notebook

Code release for Hu et al., Language-Conditioned Graph Networks for Relational Reasoning. in ICCV, 2019

Repositories

136

ronghanghu/text_objseg

Code release for Hu et al. Segmentation from Natural Language Expressions. in ECCV, 2016

Jupyter Notebook8431Updated 8 years ago

ronghanghu/speaker_follower

Code release for Fried et al., Speaker-Follower Models for Vision-and-Language Navigation. in NeurIPS, 2018.

C++13731Updated 3 years ago

ronghanghu/notebooklm-pyFork

Unofficial Python API for Google NotebookLM

00Updated 6 days ago

ronghanghu/verlFork

verl: Volcano Engine Reinforcement Learning for LLMs

00Updated 5 months ago

ronghanghu/cc_torch

No description provided.

Jupyter Notebook42Updated 5 months ago

ronghanghu/torch_generic_nms

No description provided.

Cuda31Updated 5 months ago

ronghanghu/flash-linear-attentionFork

🚀 Efficient implementations of state-of-the-art linear attention models

00Updated 3 months ago

ronghanghu/lmms-evalFork

One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

00Updated 3 months ago

ronghanghu/seg_every_thing

Code release for Hu et al., Learning to Segment Every Thing. in CVPR, 2018.

Python42372Updated 7 years ago

ronghanghu/n2nmn

Code release for Hu et al. Learning to Reason: End-to-End Module Networks for Visual Question Answering. in ICCV, 2017

SourcePawn27257Updated 5 years ago

ronghanghu/segment-anythingFork

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

20Updated 2 years ago

ronghanghu/Rex-OmniFork

Detect Anything via Next Point Prediction (Based on Qwen2.5-VL-3B)

10Updated 5 months ago

ronghanghu/caffeFork

Caffe: a fast open framework for deep learning.

C++86Updated 9 years ago

ronghanghu/tensorflow_compact_bilinear_pooling

Compact Bilinear Pooling in TensorFlow

Python14145Updated 6 years ago

ronghanghu/vit_10b_fsdp_example

See details in https://github.com/pytorch/xla/blob/r1.12/torch_xla/distributed/fsdp/README.md

Python256Updated 3 years ago

ronghanghu/serveFork

Serve, optimize and scale PyTorch models in production

00Updated 7 months ago

ronghanghu/perception_modelsFork

State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!

00Updated 10 months ago

ronghanghu/sam2Fork

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook10Updated 1 year ago

ronghanghu/TrackEvalFork

HOTA (and other) evaluation metrics for Multi-Object Tracking (MOT).

Python10Updated 1 year ago

ronghanghu/lcgn

Code release for Hu et al., Language-Conditioned Graph Networks for Relational Reasoning. in ICCV, 2019

Python9218Updated 6 years ago

Code release for Hu et al. Modeling Relationships in Referential Expressions with Compositional Modular Networks. in CVPR, 2017

Python6718Updated 7 years ago

ronghanghu/detectron2_vitdet

No description provided.

Python30Updated 2 years ago

ronghanghu/natural-language-object-retrieval

Code release for Hu et al. Natural Language Object Retrieval, in CVPR, 2016

Jupyter Notebook11253Updated 9 years ago

ronghanghu/snmn

Code release for Hu et al., Explainable Neural Computation via Stack Neural Module Networks. in ECCV, 2018

Python716Updated 6 years ago

ronghanghu/vqa-maskrcnn-benchmark-m4c

Used in M4C feature extraction script: https://github.com/facebookresearch/mmf/blob/project/m4c/projects/M4C/scripts/extract_ocr_frcn_feature.py

Python132Updated 6 years ago

ronghanghu/SanguoshaEX

Sanguosha EX: An Open Source PC Game Based on Popular Desktop Game "Sanguosha"

C++52Updated 11 years ago

ronghanghu/tpu_profiling

Profiling analyses and comparisons between PyTorch/XLA and JAX

Python10Updated 3 years ago

ronghanghu/xlaFork

Enabling PyTorch on Google TPU

C++21Updated 3 years ago

ronghanghu/gqa_single_hop_baseline

A simple but well-performing "single-hop" visual attention model for the GQA dataset

Python201Updated 6 years ago

ronghanghu/ptxla_scaling_examples

A list of examples for model scaling in PyTorch/XLA

20Updated 3 years ago

Gists

Recent Activity