Satwik Kottur
satwikkottur
Research Scientist
Languages
Repos
29
Stars
76
Forks
29
Top Language
Python
Loading contributions...
Top Repositories
Repository to generate CLEVR-Dialog: A diagnostic dataset for Visual Dialog
Learning visually grounded word embeddings using Abstract scenes
MCMC for posterior distribution sampling
A movie recommender system based on Collaborative Filtering and Topic Modeling (LDA)
Fluid simulation - Water and Fire interaction
[Arxiv2022] Egocentric Video-Language Pretraining
Repositories
29Repository to generate CLEVR-Dialog: A diagnostic dataset for Visual Dialog
Learning visually grounded word embeddings using Abstract scenes
[Arxiv2022] Egocentric Video-Language Pretraining
A movie recommender system based on Collaborative Filtering and Topic Modeling (LDA)
A framework for training and evaluating AI models on a variety of openly available dialogue datasets.
MCMC for posterior distribution sampling
Code for SIMMC 2.0: A Task-oriented Dialog Dataset for Immersive Multimodal Conversations
A collection of useful .gitignore templates
We rank the 1st in DSTC8 Audio-Visual Scene-Aware Dialog competition. This is the source code for our IEEE/ACM TASLP (AAAI2020-DSTC8-AVSD) paper "Bridging Text and Video: A Universal Multimodal Transformer for Video-Audio Scene-Aware Dialog".
No description provided.
Code for the paper Multimodal Transformer Networks for End-to-End Video-Grounded Dialogue Systems (ACL19)
With the aim of building next generation virtual assistants that can handle multimodal inputs and perform multimodal actions, we introduce two new datasets (both in the virtual shopping domain), the annotation schema, the core technical tasks, and the baseline models. The code for the baselines and the datasets will be opensourced.
Fluid simulation - Water and Fire interaction
Personal Webpage
[CVPR 2017] Torch code for Visual Dialog
Implementation for "Large-scale Pretraining for Visual Dialog" https://arxiv.org/abs/1912.02379
Starter code in PyTorch for the Visual Dialog challenge
Implementation of EMNLP 2017 Paper "Natural Language Does Not Emerge 'Naturally' in Multi-Agent Dialog" using PyTorch and ParlAI
Code samples to help you get started with the Amazon Mechanical Turk Requester API
No description provided.
No description provided.
Recurrent Neural Network library for Torch7's nn
Implementation of the image-sentence embedding method described in "Unifying Visual-Semantic Embeddings with Multimodal Neural Language Models"
The second version of the interface for Abstract Scenes research project.
Open source software library for numerical computation using data flow graphs.
Code samples for my book "Neural Networks and Deep Learning"
Android App to do sparse reconstruction
Kaggle's competition for using Google's word2vec package for sentiment analysis
Fall 2014 Course project for Computer Vision course