Incorrectly recognize borderlines with different weights #941
Closed
bdthanh
started this conversation in
Ask for help with specific PDFs
Replies: 1 comment 4 replies
-
Experimenting with >>> pdf.pages[0].extract_table(table_settings={"snap_tolerance": 5})
[['ID', 'Scenario\n<Description>', 'Likelihood', 'Impact\n<Impact>'],
['1', 'Description Description\n1', 'Likelihood 1', 'Impact 1'],
['2', 'Description Description\n2', 'Likelihood 2', 'Impact 2'],
['3', 'Description 3', 'Likelihood 3', 'Impact 3']] |
Beta Was this translation helpful? Give feedback.
4 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi, it's me again. Thanks for helping me with the previous problem. I am currently having this pdf where the table lines has different weights. If I add this code to remove invisible lines:
then the table extractor cannot extract lines with heavy weights, see following image:
Here is the pdf file that I used:
ID.pdf
If I remove that 4 lines then the result look like this:
The code I used:
Thanks for your attention!
Beta Was this translation helpful? Give feedback.
All reactions