Visual Question Answering using Convolutional Neural Networks

Main Article Content

K. P. Moholkar, et. al.

Abstract

The ability of a computer system to be able to understand surroundings and elements and to think like a human being to process the information has always been the major point of focus in the field of Computer Science. One of the ways to achieve this artificial intelligence is Visual Question Answering. Visual Question Answering (VQA) is a trained system which can answer the questions associated to a given image in Natural Language. VQA is a generalized system which can be used in any image-based scenario with adequate training on the relevant data. This is achieved with the help of Neural Networks, particularly Convolutional Neural Network (CNN) and Recurrent Neural Network (RNN). In this study, we have compared different approaches of VQA, out of which we are exploring CNN based model. With the continued progress in the field of Computer Vision and Question answering system, Visual Question Answering is becoming the essential system which can handle multiple scenarios with their respective data.

Downloads

Download data is not yet available.

Metrics

Metrics Loading ...

Article Details

How to Cite
et. al., K. P. M. . (2021). Visual Question Answering using Convolutional Neural Networks. Turkish Journal of Computer and Mathematics Education (TURCOMAT), 12(15), 170–175. Retrieved from https://turcomat.org/index.php/turkbilmat/article/view/1602
Section
Research Articles