Skip to content

Latest commit

 

History

History
68 lines (45 loc) · 1.77 KB

README.md

File metadata and controls

68 lines (45 loc) · 1.77 KB

Key Information Extraction from Scanned Invoices

Key information extraction from invoice document with Graph Convolution Network

Models

  • Background subtraction: U2Net
  • Image alignment: computer vision techniques, cv2
  • Text detection: CRAFT and an in-house text-detection model
  • Text recognition: VietOCR and an in-house text-recognition model
  • KIE: Graph Convolution

Currently, I dont have the invoice-direction classifier model. But you can also develop a model to rotate the image if the image is rotated horizontally or upside down.

Pretrained model

Data

Pipeline

TODO

Command

  • Create virtual environment using conda or virtualenv
# with virtualenv
virtualenv -p python3 invoice_env
# activate environment
source invoice_env/bin/activate
# install prerequisite libraries
pip install -r requirements.txt
# 1st command, run API
make serve
# 2nd command, run web-gui with streamlit
make runapp

Then access the localhost server at: 0.0.0.0:7778

Preview

TODO

  • Add preprocess data script

Reference