GitHub - JonghwanMun/TextguidedATT: The implementation of Text-guided Attention Model for Image Captioning

Text-guided Attention Model for Image Captioning

Created by Jonghwan Mun, Minsu Cho and Bohyung Han at POSTECH cvlab.
If you want to know details of our paper, please refer to arXiv preprint or visit our project page.
Also, if you use this code in a publication, please cite our paper using following bibtex.

   @inproceedings{mun2017textguided,
      title={Text-guided Attention Model for Image Captioning},
      author={Mun, Jonghwan and Cho, Minsu and Han, Bohyung},
      booktitle={AAAI},
      year={2017}
   }

Dependencies (This project is tested on linux 14.04 64bit with gpu Titan)

Dependencies for torch

torch ['https://github.com/torch/distro']
cutorch (luarocks install cutorch)
cunn (luarocks install cunn)
cudnn ['https://github.com/soumith/cudnn.torch']
display ['https://github.com/szym/display']
cv ['https://github.com/VisionLabs/torch-opencv']
hdf5 (luarocks install hdf5)
image (luarocks install image)
loadcaffe ['https://github.com/szagoruyko/loadcaffe']

Dependencies for python (we test on python 2.7.11 with anaconda 4.0)

json
h5py
cPickle
numpy
Maybe all dependencies for python are installed if you use anaconda.

Download pre-trained model

bash get_pretrained_model.sh

Running (data construction, training, testing)

bash running_script.sh

Licence

This software is being made available for research purpose only. Check LICENSE file for details.

Acknowledgements

This work is funded by the Samsung Electronics Co., (DMC R&D center).
Also, thanks to Andrej Karpathy since this work is implemented based on his code (https://github.com/karpathy/neuraltalk2)

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
000_data_construction		000_data_construction
001_train/resNet		001_train/resNet
002_inference		002_inference
coco-caption		coco-caption
data		data
layers		layers
misc		misc
model		model
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
get_pretrained_model.sh		get_pretrained_model.sh
running_script.sh		running_script.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Text-guided Attention Model for Image Captioning

Dependencies (This project is tested on linux 14.04 64bit with gpu Titan)

Dependencies for torch

Dependencies for python (we test on python 2.7.11 with anaconda 4.0)

Download pre-trained model

Running (data construction, training, testing)

Licence

Acknowledgements

About

Releases

Packages

Languages

License

JonghwanMun/TextguidedATT

Folders and files

Latest commit

History

Repository files navigation

Text-guided Attention Model for Image Captioning

Dependencies (This project is tested on linux 14.04 64bit with gpu Titan)

Dependencies for torch

Dependencies for python (we test on python 2.7.11 with anaconda 4.0)

Download pre-trained model

Running (data construction, training, testing)

Licence

Acknowledgements

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages