Skip to content

Commit

Permalink
Fix: take only span text not whole dictionary
Browse files Browse the repository at this point in the history
  • Loading branch information
dmacko232 committed Oct 18, 2023
1 parent ca45d8f commit 2dc75d6
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion src/sec_certs/utils/pdf.py
Original file line number Diff line number Diff line change
Expand Up @@ -175,7 +175,7 @@ def segmented_pdf_to_text(segmented_pdf: list[dict[str, Any]]) -> str:
for line in block["lines"]:
spans = []
for span in line["spans"]:
spans.append(span.strip())
spans.append(span["text"].strip())
line = " ".join(spans)
if len(line.strip()) > 0:
lines.append(line)
Expand Down

0 comments on commit 2dc75d6

Please sign in to comment.