Skip to content

Extracting text from libreoffice PDFs #810

Discussion options

You must be logged in to vote

Just for your question not feeling so lonely, here some non-answer:
I haven't analyzed LibreOffice's PDF outputs yet, but did some from MS Word's.
As is to be expected, there was (almost) nothing special. Text reading sequence can be expected to be normal. Of course you need to take care if your doc pages contain multi-column text, but that is an issue independent from Word / LibreOffice.

Some text glyphs did not deliver the expected characters though, but that may be a Word peculiarity.

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@shueffner
Comment options

Answer selected by JorjMcKie
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
2 participants