port processor to core v3 #130

kba · 2024-08-23T16:22:34Z

With this PR, eynollah supports OCR-D/core#1240. It simplifies it a lot too.

I'll update the ocrd-tool.json with the changed/added flags here as well.

Draft, please don't merge until v3 stable is released

# Conflicts: # qurator/eynollah/processor.py

# Conflicts: # setup.py

# Conflicts: # qurator/eynollah/processor.py

bertsky

Thanks – LGTM!

Have not tested yet, though.

Current main also looks very promising – will give it a try myself

qurator/eynollah/processor.py

bertsky · 2024-08-24T23:05:29Z

qurator/eynollah/processor.py

+            image_filename=page.imageFilename,
+            image_pil=page_image


Note: that filename might not be where that image came from in workspace.image_from_page. It could well be a derived image generated by some previous processor (just not a cropped, deskewed or binarized image, because that would have changed its coordinate system).

It's still a bit hazy for me when image_filename is actually used. Ideally, image_pil should take preference and image_filename is only for the plotter/writer, at least in the "single image mode" we're using.

One of the aspects I hope I'll be able to improve a bit with https://github.com/qurator-spk/eynollah/tree/refactoring-2024-08/

Perhaps we can also re-use session across Eynollah invokations in addition to models?

In theory, yes, but with standalone eynollah being focused on batch processing now, I am honestly not sure how/where sessions are defined for the non-dir_in option - @vahidrezanezhad can you tell us?

qurator/eynollah/processor.py

bertsky · 2024-08-24T23:12:25Z

qurator/eynollah/processor.py

+        # if not('://' in page.imageFilename):
+        #     image_filename = next(self.workspace.mets.find_files(local_filename=page.imageFilename)).local_filename
+        # else:
+        #     # could be a URL with file:// or truly remote
+        #     image_filename = self.workspace.download_file(next(self.workspace.mets.find_files(url=page.imageFilename))).local_filename


Suggested change

# if not('://' in page.imageFilename):

# image_filename = next(self.workspace.mets.find_files(local_filename=page.imageFilename)).local_filename

# else:

# # could be a URL with file:// or truly remote

# image_filename = self.workspace.download_file(next(self.workspace.mets.find_files(url=page.imageFilename))).local_filename

This whole effort was to ensure we can pass a working local filename, as (was) needed by Eynollah. The approach by OCR-D is Workspace.image_from_page / Workspace.image_from_segment which will search for the right original or derived image, download it if necessary and load it into memory.

I don't recall what the new behaviour of Eynollah is. If both an image filename and an image object are passed, who wins?

Assuming it's the memory object: this can be removed. (But then I wonder why we still pass the image filename at all...)

Well, currently we have

if image_pil: self._imgs = self._cache_images(image_pil=image_pil) else: self._imgs = self._cache_images(image_filename=image_filename) [...] def _cache_images(self, image_filename=None, image_pil=None): ret = {} if image_filename: ret['img'] = cv2.imread(image_filename) self.dpi = check_dpi(image_filename) else: ret['img'] = pil2cv(image_pil) self.dpi = check_dpi(image_pil)

image_filename is (should) then only used passively, to generate filenames of plotted debug images as well as for PAGE serialization.

So I think image_pil should win but for now we need both. But as I said above, one of those things I would love to untangle in the refactoring.

Co-authored-by: Robert Sachunsky <[email protected]>

…tor)

…pport in core)

OCR-D v3 API: fixes

bertsky · 2024-09-02T11:13:53Z

BTW, I just tested under (METS Server and) OCRD_MAX_PARALLEL_PAGES=2 – it works, but you need lots of GPU memory, otherwise GPU OOM happens. (It does work with CUDA_VISIBLE_DEVICES=, but of course the CPU utilization grows, so that might stall the system.)

I'm not sure if this warrants adding max_workers = 1 to EynollahProcessor ...

# Conflicts: # pyproject.toml # src/eynollah/cli.py

kba added 6 commits August 23, 2024 18:22

port processor to core v3

0a3f525

class Eynollah: add typing, consistent interface in CLI and OCR-D CLI

4a13781

ocrd-tool: add "allow_enhancement" parameter

9ce02a5

update processor to the latest change in bertsky/core#14

0d83db7

ocrd interface: add light_mode parameter

87adc4b

ocrd interface: add textline_light

39b16e5

kba force-pushed the v3-api branch from cc0e8e3 to 39b16e5 Compare August 24, 2024 16:04

kba and others added 8 commits August 24, 2024 18:05

ocrd interface: add right_to_left

ddcc019

ocrd interface: add ignore_page_extraction

d7caeb2

adapt to ocrd>=2.54 url vs local_filename

8dfecb7

# Conflicts: # qurator/eynollah/processor.py

adapt to OcrdFile.local_filename now :Path

3381e5a

# Conflicts: # qurator/eynollah/processor.py

fix namespace pkg setup

49c1a8f

non-legacy namespace package

c37d95d

# Conflicts: # setup.py

processor: reuse loaded models across pages, use derived images

61bcb43

# Conflicts: # qurator/eynollah/processor.py

check_dpi: fix Pillow type detection

d98fa2a

kba mentioned this pull request Aug 24, 2024

Revert "Merge pull request #97 from qurator-spk/420-namespace-package" #108

Draft

kba force-pushed the v3-api branch from cddbce2 to d98fa2a Compare August 24, 2024 17:19

bertsky approved these changes Aug 24, 2024

View reviewed changes

kba and others added 10 commits August 26, 2024 10:39

processor.py: Simplify import

ecd202e

Co-authored-by: Robert Sachunsky <[email protected]>

procesor.py: simplify imports further

d26079d

processor: no more DPI info lost

7b92620

Co-authored-by: Robert Sachunsky <[email protected]>

require ocrd >= 3.0.0b1

aef46a4

setuptools: fix (packages.find.where prevented finding namespace qura…

dfc4ac2

…tor)

undo customizing metadata_filename (not correct with namespace pkg su…

1e90257

…pport in core)

adapt tool json to v3

17eafc1

Merge pull request #134 from bertsky/v3-api

9b274dc

OCR-D v3 API: fixes

Merge branch 'main' into v3-api

f9c2d85

require ocrd>=3.0.0b4

fdedae2

Merge branch 'main' into v3-api

c6e0e05

# Conflicts: # pyproject.toml # src/eynollah/cli.py

bertsky mentioned this pull request Dec 18, 2024

switch to core API 3.0 branches OCR-D/ocrd_all#454

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

port processor to core v3 #130

port processor to core v3 #130

kba commented Aug 23, 2024

bertsky left a comment

bertsky Aug 24, 2024

kba Aug 26, 2024

bertsky Aug 26, 2024

kba Aug 26, 2024

bertsky Aug 24, 2024

kba Aug 26, 2024

bertsky commented Sep 2, 2024

port processor to core v3 #130

Are you sure you want to change the base?

port processor to core v3 #130

Conversation

kba commented Aug 23, 2024

bertsky left a comment

Choose a reason for hiding this comment

bertsky Aug 24, 2024

Choose a reason for hiding this comment

kba Aug 26, 2024

Choose a reason for hiding this comment

bertsky Aug 26, 2024

Choose a reason for hiding this comment

kba Aug 26, 2024

Choose a reason for hiding this comment

bertsky Aug 24, 2024

Choose a reason for hiding this comment

kba Aug 26, 2024

Choose a reason for hiding this comment

bertsky commented Sep 2, 2024