Skip to content

Releases: VikParuchuri/surya

Revert thread changes

22 Oct 20:33
c3cb6c7
Compare
Choose a tag to compare

There were some issues with threading on certain devices. Will re-release after fixing.

Layout and text detection speedup

22 Oct 16:34
62c82a8
Compare
Choose a tag to compare

Overlap postprocessing with inference.

  • 20% text detection speedup
  • 30% layout speedup

Fix table recognition bug

18 Oct 17:21
bd9a2f7
Compare
Choose a tag to compare

There was an issue with columns not being detected properly

Fix bug with MPS and PyTorch 2.5

18 Oct 15:53
0013f92
Compare
Choose a tag to compare

This caused the table rec and OCR models to crash on MPS. Bug is now fixed.

Misc bugfixes

18 Oct 13:19
865306f
Compare
Choose a tag to compare
  • Fix issue with loading from folders
  • Bump pdftext version
  • Fix transformers warning

v0.6.3

14 Oct 21:48
641de07
Compare
Choose a tag to compare

Bump minimum python version to 3.10, update other packages.

Refactor cell assignment

14 Oct 18:09
4a76a39
Compare
Choose a tag to compare
  • Move cell assignment logic into a separate library I'm creating, tabled
  • Improve cell extraction from PDFs

Minor bugfixes

08 Oct 16:34
986677b
Compare
Choose a tag to compare
  • Small bugfix after the table recognition release

Table recognition model release!

08 Oct 16:11
a87dede
Compare
Choose a tag to compare
  • Add a new table recognition model that detects rows/columns and cells
  • Add benchmarks for accuracy and speed (seems to be very accurate wrt to current state of the art open model)
  • Improve memory efficiency of layout and text detection (hopefully no more memory leaks)
  • Improve resolution handling for layout/text detection/ocr, which should improve accuracy quite a bit

OCR v2

16 Aug 17:40
8d5affa
Compare
Choose a tag to compare

A new version of the OCR model with a custom architecture.

  • 20% faster
  • Automatic language detection, with support for optional language hints
  • Better accuracy on old/noisy documents
  • Basic english handwriting support (to be improved soon)