You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Is your feature request related to a problem? Please describe.
I noticed that on some complex PDF, with tables, the system pdftotext produce better result than pdf-reader gem.
I started this PR on an open-source project using langchainrb: https://github.com/nosia-ai/nosia/pull/20/files
Describe the solution you'd like
Maybe another option than pdf-reader here:
Is your feature request related to a problem? Please describe.
I noticed that on some complex PDF, with tables, the system
pdftotext
produce better result thanpdf-reader
gem.I started this PR on an open-source project using langchainrb: https://github.com/nosia-ai/nosia/pull/20/files
Describe the solution you'd like
Maybe another option than
pdf-reader
here:langchainrb/lib/langchain/processors/pdf.rb
Line 17 in 2054ef0
Or an improvement in
pdf-reader
: https://github.com/yob/pdf-readerDescribe alternatives you've considered
pdftotext
: https://www.xpdfreader.com/pdftotext-man.htmlThe text was updated successfully, but these errors were encountered: