Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improving facets and views with keyterms/key-concepts/key-categories #69

Open
kermitt2 opened this issue Dec 20, 2016 · 0 comments
Open

Comments

@kermitt2
Copy link
Member

As compared to individual extractions, there are currently quite a lot of spurious terms/concepts/categories in the corresponding frontend search facets and dashboard views.

The problem is that grobid-keyterm tends to extract a lot of keyterms (by default 40), but normally rank them from the most important to the less one. The facets and views simply use the occurrence of the keyterms (or key concepts or key categories) and do not consider the score associated to the keyterm to decrease its importance.

For improving these facets/views, we could either reduce the number of the extracted keyterms or to exploit the score of the keyterms when ranking the term in the facets/views (with an ElasticSearch script).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant