Harsh Agrawal

I am a second year Ph.D student at Georgia Tech advised by Dhruv Batra. I also closely collaborate with Devi Parikh and Alexander Schwing. My research lies at the intersection of computer vision and natural language processing. In my free time, I also help maintain and manage an AI hosting platform called EvalAI (part of CloudCV project) which aims to make AI research more reproducible. Before this, I spent a couple of years as a Research Engineer at Snap Research where I was responsible for building large-scale infrastructure for visual recognition, search and developed algorithms for low-shot instance detection.


Jul 2020
One paper accepted in ECCV 2020!
Jun 2020
We were runner-up in the TextVQA Challenge 2020.
May 2020
Interning at NVIDIA with Gal Chechik.
Dec 2019
Gave a lecture "On what's possible today?" in Dr. Parikh's Computer Vision course
Nov 2019
Gave a lecture on "Meta Learning" in Dr. Batra's Deep Learning course.
Jul 2019
Two papers accepted in ICCV 2019!
Jun 2019
We were runner-up in the TextVQA Challenge 2019.
May 2019
Interning at Facebook AI Research with Marcus Rohrbach.
Apr 2019
Received the College of Computing CS7001 Research Award.
Feb 2019
CloudCV selected as a mentoring organization for Google Summer of Code 5th year in a row!
Jan 2019
Received the Snap Fellowship!


Spatially Aware Multimodal Transformers for TextVQA
Yash Kant, Dhruv Batra, Peter Anderson, Alexander Schwing, Devi Parikh, Jiasen Lu, Harsh Agrawal
European Conference on Computer Vision (ECCV) 2020
Sequential Latent Spaces for Modeling the Intention During Diverse Image Captioning
Jyoti Aneja*, Harsh Agrawal*, Dhruv Batra, Alexander Schwing
International Conference on Computer Vision (ICCV) 2019
nocaps: novel object captioning at scale
Harsh Agrawal*, Karan Desai*, Yufei Wang, Xinlei Chen, Rishabh Jain, Mark Johnson, Dhruv Batra, Devi Parikh, Stefan Lee, Peter Anderson
International Conference on Computer Vision (ICCV) 2019
Sort Story: Sorting Jumbled Images and Captions into Stories
Harsh Agrawal*, Arjun Chandrasekaran*, Dhruv Batra, Devi Parikh, Mohit Bansal
Empirical Methods in Natural Language Processing (EMNLP) 2016
Human Attention in Visual Question Answering: Do Humans and Deep Networks Look at the Same Regions?
Abhishek Das*, Harsh Agrawal*, C. Lawrence Zitnick, Devi Parikh, Dhruv Batra
Computer Vision and Image Understanding (CVIU) 2017
Emperical Methods in Natural Language Processing (EMNLP) 2016
ICML 2016 Workshop on Visualization for Deep Learning (Best Student Paper)
Object-Proposal Evaluation Protocol is 'Gameable'
Neelima Chavali*, Harsh Agrawal*, Aroma Mahendru*, Dhruv Batra
Conference on Computer Vision and Patter Recognition (CVPR) 2016 (Spotlight)
CloudCV: Large Scale Distributed Computer Vision as a Cloud Service
Harsh Agrawal, Clint Solomon Mathialagan, Yash Goyal, Neelima Chavali, Prakriti Banik, Akrit Mohapatra, Ahmed Osman, Dhruv Batra
Book Chapter: Mobile Cloud Visual Media Computing, 265-290
EvalAI: Towards Better Evaluation Systems for AI Agents
Deshraj Yadav, Rishabh Jain, Harsh Agrawal, Prithvijit Chattopadhyay, Taranjeet Singh, Akash Jain, Shiv Baran Singh, Stefan Lee, Dhruv Batra
Fabrik: An Online Collaborative Neural Network Editor
Utsav Garg, Viraj Prabhu, Deshraj Yadav, Ram Ramrakhya, Harsh Agrawal, Dhruv Batra