Extract checkboxes from pdf #474
Atul997
started this conversation in
Ask for help with specific PDFs
Replies: 1 comment 2 replies
-
Hi @Atul997, and interesting challenge. For this particular PDF, each checkbox appears to be composed of four lines. E.g., the following code: colors = [ "red", "green", "orange", "blue" ]
page = pdf.pages[3]
im = page.to_image(resolution=300)
for i, line in enumerate(page.lines):
im.draw_line(line, stroke_width=4, stroke=colors[i % 4]) ... produces this output (which I have cropped): So, it seems likely that you use the data in |
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
How can I extract marked check-boxes and corresponding text from the pdf file. I tried
extract_words()
working fine. For check-boxes it returns only the ticked asX
and corresponding text. What should I do to extract only check-boxes ?Residential Land Appraisal Form.PDF
This is the sample file from which I have to extract check boxes
Beta Was this translation helpful? Give feedback.
All reactions