A comparable corpus of Kalaallisut and Danish web-crawled sentences, along with some noisy aligned texts and code for MT finetuning experiments. Looking to improve the quality of pseudoparallel data. Final project for LING28/Computational Linguistics, Dartmouth College, Winter 2022.
-
Notifications
You must be signed in to change notification settings - Fork 1
A comparable corpus of Kalaallisut and Danish web-crawled sentences, along with some noisy aligned texts and code for MT finetuning experiments between Kalaallisut and English. Currently looking to improve the quality of pseudoparallel data. Final project for LING28/Computational Linguistics, Dartmouth College, Winter 2022.
AlexJonesNLP/KALComp
About
A comparable corpus of Kalaallisut and Danish web-crawled sentences, along with some noisy aligned texts and code for MT finetuning experiments between Kalaallisut and English. Currently looking to improve the quality of pseudoparallel data. Final project for LING28/Computational Linguistics, Dartmouth College, Winter 2022.
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published