-
Notifications
You must be signed in to change notification settings - Fork 7
Workflow Guide font style annotation
These processors can determine the font style (e.g. italic, bold, underlined) and font family text recognition results.
Note: ocrd-tesserocr-fontshape
can either use existing segmentation or
segment on-demand. It can detect the following font styles:
fontSize
fontFamily
bold
italic
underlined
monospace
serif
Note: ocrd-tesserocr-fontshape
needs the old, pre-LSTM models to work at
all. You can use the pre-installed osd
(which is purely rule-based), but
there might be better alternatives for your language and script. You can still
get the old models from Tesseract's Github repo at the last
revision
before the LSTM
models
replaced them, usually under the same name. (Thus, deu.traineddata
used to be
a rule-based model but now is an LSTM model. deu-frak.traineddata
is still
only available as rule-based model and was complemented by the new LSTM models
frk.traineddata
and script/Fraktur.traineddata
.) If you do need one of the
models that was replaced completely, then you should at least rename the old
one (e.g. to deu3.traineddata
).
Processor | Parameter | Remarks | Call |
---|---|---|---|
ocrd-tesserocr-fontshape | -P model osd -P padding 2 |
Download other pre-LSTM models [from GitHub](https://github.com/tesseract-ocr/tessdata/commit/3cf1e2df1fe1d1da29295c9ef0983796c7958b7d) | ocrd-tesserocr-fontshape -I OCR-D-OCR -O OCR-D-OCR-FONT |
E.g.
- which parameters do you use with what values?
- which parameters are insufficiently documented?
- which aspects of a processor should be parameterizable but are not?
E.g. which processors worked best with what material? -- feel free to post sample images here, too.
Welcome to the OCR-D wiki, a companion to the OCR-D website.
Articles and tutorials
- Running OCR-D on macOS
- Running OCR-D in Windows 10 with Windows Subsystem for Linux
- Running OCR-D on POWER8 (IBM pSeries)
- Running browse-ocrd in a Docker container
- OCR-D Installation on NVIDIA Jetson Nano and Xavier
- Mapping PAGE to ALTO
- Comparison of OCR formats (outdated)
- A Practicioner's View on Binarization
- How to use the bulk-add command to generate workspaces from existing files
- Evaluation of (intermediary) steps of an OCR workflow
- A quickstart guide to ocrd workspace
- Introduction to parameters in OCR-D
- Introduction to OCR-D processors
- Introduction to OCR-D workflows
- Visualizing (intermediate) OCR-D-results
- Guide to updating ocrd workspace calls for 2.15.0+
- Introduction to Docker in OCR-D
- How to import Abbyy-generated ALTO
- How to create ALTO for DFG Viewer
- How to create searchable fulltext data for DFG Viewer
- Setup native CUDA Toolkit for Qurator tools on Ubuntu 18.04
- OCR-D Code Review Guidelines
- OCR-D Recommendations for Using CI in Your Repository
Expert section on OCR-D- workflows
Particular workflow steps
Workflow Guide
- Workflow Guide: preprocessing
- Workflow Guide: binarization
- Workflow Guide: cropping
- Workflow Guide: denoising
- Workflow Guide: deskewing
- Workflow Guide: dewarping
- Workflow Guide: region-segmentation
- Workflow Guide: clipping
- Workflow Guide: line-segmentation
- Workflow Guide: resegmentation
- Workflow Guide: olr-evaluation
- Workflow Guide: text-recognition
- Workflow Guide: text-alignment
- Workflow Guide: post-correction
- Workflow Guide: ocr-evaluation
- Workflow Guide: adaptation-of-coordinates
- Workflow Guide: format-conversion
- Workflow Guide: generic transformations
- Workflow Guide: dummy processing
- Workflow Guide: archiving
- Workflow Guide: recommended workflows