Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

not doing well on recognizing references in footnotes #929

Open
fredzannarbor opened this issue Jun 30, 2022 · 1 comment
Open

not doing well on recognizing references in footnotes #929

fredzannarbor opened this issue Jun 30, 2022 · 1 comment
Labels
enhancement question There's no such thing as a stupid question

Comments

@fredzannarbor
Copy link

Hi,

I have a 316 page PDF document about space warfare strategy with 342 footnotes, most of which contain references. I don't know the exact number, but most of those should be references -- 200 or more. Grobid is only finding 50-60 references. I see in #839 that finding references in footnotes is a known weak spot. That was in October 2021. What's happening now, and what are some strategies I could use to improve detection?

Fred

@kermitt2
Copy link
Owner

Hi @fredzannarbor !

Nothing happened on the topic since last October. There's very few training data for references in footnotes at the moment and the normal approach would be to add training data to cover at least minimally this case.

@kermitt2 kermitt2 added enhancement question There's no such thing as a stupid question labels Jun 30, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement question There's no such thing as a stupid question
Projects
None yet
Development

No branches or pull requests

2 participants