Replies: 1 comment 1 reply
-
This is a problem caused by a font. Text extraction can only work if a complete and correct back-translation table from the "glyph" (program that writes the visible character) to the originating Unicode exists. |
Beta Was this translation helpful? Give feedback.
-
garbled_26282171_1.pdf
in above pdf, I get some garbled '5\x17\x04\x07\x08;\x02?\x12\x12\x02@9A\x13\x0231\x0231\x02\x111\x02!\x12\x02+\x06\x05\x02;\x02?\x12\x12\x02@9A\x13\x02!\x15\x02\x15!\x02!1\x02\x11!\x02\n\x04\x04\x03', and some normal eg 'on September 16, 2015. © 2015 Am..'
how to get the right content? it seems the encoding problem , but why some is right, and some is \x...
Beta Was this translation helpful? Give feedback.
All reactions