OCR on Product Photos #17

cod3monk · 2022-11-03T05:13:30Z

Goal: make items easier findable, without having to manually describe them in detail.

Idea:

Run OCR on all photos and extract text
Store extracts with photo
Extend item search to include photo OCR-extracts

Things to consider:

Allow this process to be done image-by-image, so that it can be improved in future
Primary goal would be to run this process in batch, possibly outside of django, but if the implementation is capable of doing the same live on newly uploaded photos this would be a nice feature
Test cases to determine quality of extracts would be good to have, e.g. compare automatic extract to manual extractions
Consider comparing multiple OCR systems
Also extract and store EANs or other barcodes present in photos

danieloeh · 2022-11-13T19:52:42Z

So far, i have implemented a basic prototype of this feature which lets you run OCR on all images via python manage.py ocr. It uses pytesseract for the OCR. The result is shown below the description of each image in the "Update Item" view.

Feature branch: https://github.com/danieloeh/inventory_management/tree/feature/ocr

cod3monk mentioned this issue Nov 17, 2022

First draft of OCR command #18

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OCR on Product Photos #17

OCR on Product Photos #17

cod3monk commented Nov 3, 2022 •

edited

Loading

danieloeh commented Nov 13, 2022

OCR on Product Photos #17

OCR on Product Photos #17

Comments

cod3monk commented Nov 3, 2022 • edited Loading

danieloeh commented Nov 13, 2022

cod3monk commented Nov 3, 2022 •

edited

Loading