v3.0.0a2
Pre-release
Pre-release
Changed:
- 🔥
OcrdPage
as proxy ofPcGtsType
instead of alias; also containsetree
andmapping
now - 🔥
Processor.zip_input_files
now can throwocrd.NonUniqueInputFile
andocrd.MissingInputFile
(the latter only ifOCRD_MISSING_INPUT=ABORT
) - 🔥
Processor.zip_input_files
does not by default userequire_first
anymore
(so the first file in any input file tuple per page can beNone
as well) - 🔥 no more
Workspace.overwrite_mode
, merely delegate toOCRD_EXISTING_OUTPUT=OVERWRITE
- 🎨 improve on docs result for
ocrd_utils.config
Added:
- 👉
OCRD_DOWNLOAD_INPUT
for whether input files should be downloaded before processing - 👉
OCRD_MISSING_INPUT
for how to handle missing input files (SKIP
orABORT
) - 👉
OCRD_MISSING_OUTPUT
for how to handle processing failures (SKIP
orABORT
orCOPY
)
the latter behaves like ocrd-dummy for the failed page(s) - 👉
OCRD_EXISTING_OUTPUT
for how to handle existing output files (SKIP
orABORT
orOVERWRITE
) - new CLI option
--debug
as short-hand forABORT
choices above Processor.logger
set up by constructor already (for re-use by processor implementors)default
-expand and validateocrd_tool.json
inProcessor
constructor, log invalidities- handle JSON
deprecation
inocrd_tool.json
by reporting warnings