This repository has been archived by the owner on Oct 4, 2022. It is now read-only.
igorschoester
released this
08 Jun 06:39
·
2778 commits
to develop
since this release
1.76.0 June 8th, 2020
Enhancements
- Adds a check for the exception list of French verbs with multiple stems and stems them by returning the indicated canonical stem.
- Adds a stemmer for the Italian language.
- Adds an exception check for words ending in -is/us/os where -s should not be stemmed.
- Improves the way keyphrases containing words ending in "ent" are recognized in the text.
- Stems French words that are considered too short to be stemmed according to the stemming rules, but that should nevertheless be stemmed.