Skip to content

Latest commit

 

History

History
25 lines (15 loc) · 1.05 KB

README.md

File metadata and controls

25 lines (15 loc) · 1.05 KB

Neural Document Classification

Document image classification with neural networks on a subset of the RVL-CDIP dataset [1].

Getting Started

The classification problem is tackled with two different approaches:

  • Visual approach over the image pixels with dense only and convolutional neural networks: skeleton.ipynb
  • Textual approach over the recognized image words with bag-of-words and word embedding models: skeleton_ocr.ipynb

It is recommended to begin with the visual approach as it includes more details about the computing environment setup and the dataset.

For a better experience, execute the notebooks within a Google Colab environment.

Authors

References

[1] A. W. Harley, A. Ufkes, K. G. Derpanis, "Evaluation of Deep Convolutional Nets for Document Image Classification and Retrieval," in ICDAR, 2015