-
Notifications
You must be signed in to change notification settings - Fork 17
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
empty OCR #412
Comments
It says HTTP 403 to me. |
Wrong permissions after scp?! ... Please try again. |
OCR-D-OCR...xml is missing in zip archive, therefor I post it here:
|
That says it all. We are chasing the same bug (regression) that haunts us everywhere now, see OCR-D/ocrd_tesserocr#201. (Last I checked, I could not reproduce though.) |
This has the same invalid physical
@jbarth-ubhd Did you produce this with an ocrd/all Docker image? |
With this ocrd.sif from docker ocrd/all maximum : 8687316992 2024-02-21 15:30:33 +0100 ocrd.sif |
I was using roughly the same version, I think. I have no experience with singularity but i was using the maximum image from a few days ago. |
@bertsky Just out of curiosity: What is wrong with that part of the XML? |
that the original image is referencing the derived image (from deskewing). It's essentially what happens if the METS is broken in the way your snippet shows. I can reproduce this now – even without |
I can now say that it's a caching issue. If I run with The default in the Docker builds is now Line 51 in 5af34e7
|
Did add this to my singularity ocrd.env, helps. |
This is OCR-D/core#1195 |
It required a new core v2.63.3 to appear on PyPI, then a rebuild of ocrd/core and then of ocrd/all:* before this was actually fixed. |
with this workflow
there is no text in OCR-D-OCR*.xml
All files (see run.sh for workflow and ocrd.log for log):
https://digi.ub.uni-heidelberg.de/diglitData/v/christliche_kunstblaetter1862--08--empty-ocr.zip
The text was updated successfully, but these errors were encountered: