27 results for “topic:visual-dialog”
Starter code in PyTorch for the Visual Dialog challenge
PyTorch code for Learning Cooperative Visual Dialog Agents using Deep Reinforcement Learning
Code for CVPR'19 "Recursive Visual Attention in Visual Dialog"
Conversational AI Reading Materials
Repository to generate CLEVR-Dialog: A diagnostic dataset for Visual Dialog
✨ Official PyTorch Implementation for EMNLP'19 Paper, "Dual Attention Networks for Visual Reference Resolution in Visual Dialog"
PyTorch code for Reasoning Visual Dialogs with Structural and Partial Observations
This repository contains code used in our ACL'20 paper History for Visual Dialog: Do we really need it?
Recent Advances in Visual Dialog
This repository contains the Pytorch implementation for our SCAI (EMNLP-2018) submission "A Knowledge-Grounded Multimodal Search-Based Conversational Agent"
Visual Dialog: Light-weight Transformer for Many Inputs (ECCV 2020)
Paper, dataset and code list for multimodal dialogue.
This framework provides out-of-the-box implementations of Referential Games variants in order to study the emergence of artificial languages using deep learning, relying on PyTorch (https://www.pytorch.org).
:speech_balloon: Official PyTorch Implementation for CVPR'23 Paper, "The Dialog Must Go On: Improving Visual Dialog via Generative Self-Training"
Visual Dialog
Community Regularization of Visually Grounded Dialog https://arxiv.org/abs/1808.04359
A curated publication list on visual dialog
🌈 PyTorch Implementation for EMNLP'21 Findings "Reasoning Visual Dialog with Sparse Graph Learning and Knowledge Transfer"
A list of research papers on knowledge-enhanced multimodal learning
Code for ACMMM'20 ✨"Answer-Driven Visual State Estimator for Goal-Oriented Visual Dialogue"
Visual dialog agents with pre-trained vision-and-language encoders.
Code for reproducing results in our paper SeqDialN: Sequential Visual Dialog Networks in Joint Visual-Linguistic Representation Space.
Summary of Visual Dialogue Papers
This repository contains code used for data collection on AMT in our ACL'20 paper History for Visual Dialog: Do we really need it?
Probabilistic framework for solving Visual Dialog
[EMNLP 22] Extending Phrase Grounding with Pronouns in Visual Dialogues.
Code repository for the final year project focusing on improving visual dialog by reducing modality biases.