Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Newlines #7

Open
hugows opened this issue May 24, 2021 · 2 comments
Open

Newlines #7

hugows opened this issue May 24, 2021 · 2 comments

Comments

@hugows
Copy link

hugows commented May 24, 2021

Hi,
I'm very impressed with the quality of the OCR of this little tool - and paired with Alfred it became super powerful. Thank you for writing this!

The issue: I noticed that every character is correctly detected in my tests, but no newline is ever added in the output.
Is this a known limitation?

This is the only thing missing here that would make it seamless for extracting tabular data from PDFs.

@Tatsh
Copy link

Tatsh commented May 24, 2021

The delimiter to join strings is currently hard-coded to be a single space.

@TheFutureIsVoluntary
Copy link

How can this be changed in the code? I might see if I can figure out how to fork this JUST for this one change. It's nowhere near as useful to me as it could be without newlines being included.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants