Description

Recently, algorithms for object recognition and related tasks have become sufficiently proficient that new vision tasks can now be pursued. In this paper, we build a system capable of answering open-ended text-based questions about images, which is known as Visual Question Answering (VQA). Our approach’s key insight is that we can predict the form of the answer from the question. We formulate our solution in a Bayesian framework. When our approach is combined with a discriminative model, the combined model achieves state-of-the-art results on four benchmark datasets for open-ended VQA: DAQUAR, COCO-QA, The VQA Dataset, and Visual7W.

Date of creation, presentation, or exhibit

6-2016

Comments

© 2016 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.

Document Type

Conference Paper

Department, Program, or Center

Chester F. Carlson Center for Imaging Science (COS)

Recommended Citation

K. Kafle and C. Kanan, "Answer-Type Prediction for Visual Question Answering," 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, 2016, pp. 4976-4984. doi: 10.1109/CVPR.2016.538

Campus

RIT – Main Campus

Download

COinS

Presentations and other scholarship

Answer-Type Prediction for Visual Question Answering

Description

Date of creation, presentation, or exhibit

Comments

Document Type

Department, Program, or Center

Recommended Citation

Campus

Search

Browse

Author Corner

RIT Links

Presentations and other scholarship

Answer-Type Prediction for Visual Question Answering

Authors

Description

Date of creation, presentation, or exhibit

Comments

Document Type

Department, Program, or Center

Recommended Citation

Campus

Share

Search

Browse

Author Corner

RIT Links