You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hey, I found out that in the output of doc.metadata was : {"source":"/Users/../doc.pdf","pdf_numpages":1945,"loc":{"lines":{"from":74528,"to":74551}}}>
I wonder if it is possible to extract directly the page number instead of the lines refering to a particular chunck of test.
I tried my best but I can't understand where is the "loc" being built in the code and how to modify it. If anyone has an idea on how to manage it, let me know !
The text was updated successfully, but these errors were encountered:
Hey, I found out that in the output of doc.metadata was :
{"source":"/Users/../doc.pdf","pdf_numpages":1945,"loc":{"lines":{"from":74528,"to":74551}}}>
I wonder if it is possible to extract directly the page number instead of the lines refering to a particular chunck of test.
I tried my best but I can't understand where is the
"loc"
being built in the code and how to modify it. If anyone has an idea on how to manage it, let me know !The text was updated successfully, but these errors were encountered: