If you'd like to add another language, read on!
Language Detector works by looking for characters that are nearly unique to a language's script. For example, if we see ß
in some text, the language is likely German because we almost never see this character in other languages.
- Create a folder with the Language's Name in title-case in ./prep/sources
- Add in at least five files of text in the new language in .txt format. For an example, see Kurmanci. Newspapers are great sources!
- Add the Language and its corresponding code to language_to_code in init.py
- Add a test for detecting the new language to ./test/test.py
- Make a Pull Request with changes to the master branch (the default)
Reach out with questions to Daniel J. Dufour at [email protected]