NLP1_IR

NLP1 2017-18 UvA Project

Image Features link, https://aashishv.stackstorage.com/s/sEdlxbDqaPHeNxi

Projects

Please find the project report requirements here.

Links

The Unreasonable Effectiveness of Recurrent Neural Networks
Understanding LSTM Networks
PyTorch Tutorials
PyTorch Data Loading Tutorial

Get Images

While for training your models you can handle the image features as a black box, for testing and anlysis you want to have a look at the actual image. The VQA dataset utilizes the 2014 version of MSCOCO. Many tasks like Image Captioning or Object Detection are using this dataset, so it might me worthwhile to download it. However, it is quite big (13GB train, 6GB validation, 6GB test). For this project you will only need the training dataset, since all datapoints are from this split. The image can be retrieved via the file name. The image id is equal the last digits of the image file name. Besides downloading the dataset, we are providing a second option. The MSCOCO annotations come with a flickr url where the images can be found online. The file imgid2imginfo.json contains the flickr url (and more image information) for the MSCOCO training and validation dataset. The file can be utilized as follows:

import json

# load the image info file
with open('data/imgid2imginfo.json', 'r') as file:
    imgid2info = json.load(file)

print(imgid2info['265781'])
# RETURNS:
#{'coco_url': 'http://images.cocodataset.org/train2014/COCO_train2014_000000265781.jpg',
# 'date_captured': '2013-11-14 21:01:15',
# 'file_name': 'COCO_train2014_000000265781.jpg',
# 'flickr_url': 'http://farm6.staticflickr.com/5199/5882642496_9e58939526_z.jpg',
# 'height': 424,
# 'id': 265781,
# 'license': 1,
# 'width': 640}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

NLP1_IR

Projects

Links

Get Images

Files

README.md

Latest commit

History

README.md

File metadata and controls

NLP1_IR

Projects

Links

Get Images