Top Repositories
Ideas and thoughts about the fascinating Vision-and-Language Navigation
Code of the CVPR 2021 Oral paper: A Recurrent Vision-and-Language BERT for Navigation
Code and Data of the CVPR 2022 paper: Bridging the Gap Between Learning in Discrete and Continuous Environments for Vision-and-Language Navigation
Code and data of the Fine-Grained R2R Dataset proposed in the EMNLP 2021 paper Sub-Instruction Aware Vision-and-Language Navigation
Code of the NeurIPS 2021 paper: Language and Visual Entity Relationship Graph for Agent Navigation
Official Implementation of Learning Navigational Visual Representations with Semantic Map Supervision (ICCV2023)
Repositories
14Code of the CVPR 2021 Oral paper: A Recurrent Vision-and-Language BERT for Navigation
Ideas and thoughts about the fascinating Vision-and-Language Navigation
Code and Data of the CVPR 2022 paper: Bridging the Gap Between Learning in Discrete and Continuous Environments for Vision-and-Language Navigation
Code and data of the Fine-Grained R2R Dataset proposed in the EMNLP 2021 paper Sub-Instruction Aware Vision-and-Language Navigation
Official Implementation of Learning Navigational Visual Representations with Semantic Map Supervision (ICCV2023)
Code of the NeurIPS 2021 paper: Language and Visual Entity Relationship Graph for Agent Navigation
🎓 Hugo Academic Theme 创建一个学术网站. Easily create a beautiful academic résumé or educational website using Hugo, GitHub, and Netlify.
A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites
The DB Group Website
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
A first-of-its-kind acoustic simulation platform for audio-visual embodied AI research. It supports training and evaluating multiple tasks and applications.
A graph plotter that works with any pdf containing one (or more) graphs :chart_with_upwards_trend:
Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome
REVERIE: Remote Embodied Visual Referring Expression in Real Indoor Environments