Tokenizer 1.36.0
New features
- [Python] Add argument
vocabulary
in theTokenizer
constructor to set the vocabulary with a list of tokens instead of using a file - [Python] Add function
pyonmttok.is_valid_language
to check if a language code is valid and can be passed to theTokenizer
constructor