Releases will be numbered with the following semantic versioning format:
<major>.<minor>.<patch>
And constructed with the following guidelines:
- Breaking backward compatibility bumps the major (and resets the minor and patch)
- New additions without breaking backward compatibility bumps the minor (and resets the patch)
- Bug fixes and misc changes bumps the patch
BUG FIXES
NEW FEATURES
MINOR FEATURES
IMPROVEMENTS
CHANGES
CHANGES
- koRpus package now has a heavy link to language modules. koRpus.lang.en added as a dependency package per Meik Michalke's API change/workflow.
BUG FIXES
lemmatize_strings
andstem_strings
would split numbers with decimals rather than treating it as a single token. This issue has been corrected (see issue #3).
This package is collection of tools that stem and lemmatize text. Stemming is a process that removes endings such as suffixes. Lemmatization is the process of grouping inflected forms together as a single base form.