Pytorch implementation of the paper - VQA: Visual Question Answering (https://arxiv.org/pdf/1505.00468.pdf).
$ git clone https://github.com/tbmoon/basic_vqa.git
$ cd basic_vqa/utils
$ ./preprocess.sh
$ cd ..
$ python3 train.py
- Comparison Result
Model | Metric | Dataset | Accuracy | Source |
---|---|---|---|---|
Paper Model | Open-Ended | VQA v2 | 54.08 | VQA Challenge |
My Model | Multiple Choice | VQA v2 | 54.72 |
- Loss and Accuracy on VQA datasets v2
-
Paper implementation
- Paper: VQA: Visual Question Answering
- URL: https://arxiv.org/pdf/1505.00468.pdf
-
Pytorch tutorial
-
Preprocessing
- Tensorflow implementation of N2NNM
- Github: https://github.com/ronghanghu/n2nmn