This repository hosts the contributor source files for the vgg-19 model. ModelHub integrates these files into an engine and controlled runtime environment. A unified API allows for out-of-the-box reproducible implementations of published models. For more information, please visit www.modelhub.ai or contact us [email protected].
id | f7201cb5-37af-476b-86e0-4c8120d8235e |
application_area | ImageNet |
task | Classification |
task_extended | ImageNet classification |
data_type | Image/Photo |
data_source | http://www.image-net.org/ |
title | Very Deep Convolutional Networks for Large-Scale Image Recognition |
source | arXiv |
url | https://arxiv.org/abs/1409.1556 |
year | 2015 |
authors | Karen Simonyan, Andrew Zisserman |
abstract | In this work we investigate the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting. Our main contribution is a thorough evaluation of networks of increasing depth using an architecture with very small (3x3) convolution filters, which shows that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 weight layers. These findings were the basis of our ImageNet Challenge 2014 submission, where our team secured the first and the second places in the localisation and classification tracks respectively. We also show that our representations generalise well to other datasets, where they achieve state-of-the-art results. We have made our two best-performing ConvNet models publicly available to facilitate further research on the use of deep visual representations in computer vision. |
google_scholar | https://scholar.google.com/scholar?cites=15993525775437884507&as_sdt=40000005&sciodt=0,22&hl=en |
bibtex | @article{DBLP:journals/corr/SimonyanZ14a, author = {Karen Simonyan and Andrew Zisserman}, title = {Very Deep Convolutional Networks for Large-Scale Image Recognition}, journal = {CoRR}, volume = {abs/1409.1556}, year = {2014}, url = {http://arxiv.org/abs/1409.1556}, archivePrefix = {arXiv}, eprint = {1409.1556}, timestamp = {Mon, 13 Aug 2018 16:46:51 +0200}, biburl = {https://dblp.org/rec/bib/journals/corr/SimonyanZ14a}, bibsource = {dblp computer science bibliography, https://dblp.org}} |
description | VGG-19 is a convolutional neural network trained on more than a million images from the ImageNet database. It is 19 layers deep and can classify images into 1000 object categories. It took part in the ImageNet ILSVRC-2014 challenge, where it secured the first and the second places in the localisation and classification tasks respectively. |
provenance | https://mxnet.apache.org/model_zoo/index.html |
architecture | Convolutional Neural Network (CNN) |
learning_type | Supervised learning |
format | .json |
I/O | model I/O can be viewed here |
license | model license can be viewed here |
To run this model and view others in the collection, view the instructions on ModelHub.
To contribute models, visit the ModelHub docs.