Enhanced page layout extraction #360

shihanmax · 2023-03-21T04:21:20Z

Added new function get_multi_page_layouts() based on existing get_page_layout().

The new function returns a list of PDFMiner LTPage objects and page dimensions for each page of a multi-page PDF file. This is achieved by iterating through each page of the PDF file and extracting the LTPage object and page dimension for that page.

Add get_multi_page_layouts for layout extraction of multi-page pdfs.

MartinThoma · 2024-02-25T11:11:37Z

Hey!

As camelot is dead, we try to build a maintained fork at pypdf_table_extraction.

Do you want to open the PR against that branch so that we can merge your improvement?

Enhanced page layout extraction

c40a304

Add get_multi_page_layouts for layout extraction of multi-page pdfs.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enhanced page layout extraction #360

Enhanced page layout extraction #360

shihanmax commented Mar 21, 2023

MartinThoma commented Feb 25, 2024

Enhanced page layout extraction #360

Are you sure you want to change the base?

Enhanced page layout extraction #360

Conversation

shihanmax commented Mar 21, 2023

MartinThoma commented Feb 25, 2024