Finetune smaller model on computer screenshots ? #113

apirrone · 2024-05-26T18:18:12Z

Hi !

First, great work, from what I tested it seems to work really well, congrats !

I have an use case where I need to perform OCR/Layout analysis etc on computer screenshots. surya actually works really well for such images, but I wonder how a smaller model trained only on such images would perform. In my use case, the screenshots would need to be fully processed quite fast (ideally under 2 seconds per screenshot) and without taking too much memory or CPU/GPU.

Maybe I am wrong, but the problem seems simpler than training a general model that works on any kind of document like surya does. Do you think a small model could do the job ?

Thanks !

metatrot · 2024-06-20T05:38:57Z

I'm also looking for a screenshot use-case. Most OCR seems geared to photos, handwriting, or PDFs. They don't do great on normal GUI text.

yechens · 2025-01-08T05:32:15Z

Hi !

First, great work, from what I tested it seems to work really well, congrats !

I have an use case where I need to perform OCR/Layout analysis etc on computer screenshots. surya actually works really well for such images, but I wonder how a smaller model trained only on such images would perform. In my use case, the screenshots would need to be fully processed quite fast (ideally under 2 seconds per screenshot) and without taking too much memory or CPU/GPU.

Maybe I am wrong, but the problem seems simpler than training a general model that works on any kind of document like surya does. Do you think a small model could do the job ?

Thanks !

Perhaps you could try Baidu PPOCR models, which are fast, accurate, lightweight, and easy to fine-tune with your own dataset.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Finetune smaller model on computer screenshots ? #113

Finetune smaller model on computer screenshots ? #113

apirrone commented May 26, 2024

metatrot commented Jun 20, 2024

yechens commented Jan 8, 2025

Finetune smaller model on computer screenshots ? #113

Finetune smaller model on computer screenshots ? #113

Comments

apirrone commented May 26, 2024

metatrot commented Jun 20, 2024

yechens commented Jan 8, 2025