VQA swMATH ID: 36506 Software Authors: Aishwarya Agrawal, Jiasen Lu, Stanislaw Antol, Margaret Mitchell, C. Lawrence Zitnick, Dhruv Batra, Devi Parikh Description: VQA: Visual Question Answering. VQA is a new dataset containing open-ended questions about images. These questions require an understanding of vision, language and commonsense knowledge to answer. 265,016 images (COCO and abstract scenes). At least 3 questions (5.4 questions on average) per image. 10 ground truth answers per question. 3 plausible (but likely incorrect) answers per question. Automatic evaluation metric. Homepage: https://visualqa.org Source Code: https://github.com/GT-Vision-Lab/VQA Keywords: arXiv_cs.CL; Computer Vision; Pattern Recognition; arXiv_cs.CV; VQA; Visual Question Answering Related Software: CLEVR; YOLO; ImageNet; Grad-CAM; Adam; Flickr30K; CLEVR dataset; CIDEr; Caffe; ViLBERT; PyTorch; Im2Text; MS-COCO; BERT; DenseCap; GloVe; Faster R-CNN; Python; DeepProbLog; AQuA Cited in: 6 Documents Standard Articles 1 Publication describing the Software Year VQA: Visual Question Answering Aishwarya Agrawal, Jiasen Lu, Stanislaw Antol, Margaret Mitchell, C. Lawrence Zitnick, Dhruv Batra, Devi Parikh 2015 all top 5 Cited by 19 Authors 1 Bengio, Yoshua 1 Bensch, Suna 1 Chandar, Sarath 1 Chen, Wei 1 Cho, Kyunghyun 1 Doran, Derek 1 Ganjdanesh, Alireza 1 Gülçehre, Çağlar 1 Hellström, Thomas 1 Higuera, Nelson 1 Huang, Heng 1 Oetsch, Johannes 1 Ostovar, Ahmad 1 Pritz, Michael 1 Ras, Gabrielle 1 Szeliski, Richard 1 van Gerven, Marcel A. J. 1 Xie, Ning 1 Zhang, Jipeng Cited in 5 Serials 1 Acta Informatica 1 Neural Computation 1 The Journal of Artificial Intelligence Research (JAIR) 1 Theory and Practice of Logic Programming 1 Texts in Computer Science Cited in 2 Fields 5 Computer science (68-XX) 1 Biology and other natural sciences (92-XX) Citations by Year