-
Notifications
You must be signed in to change notification settings - Fork 1
Unsupervised Morphological Segmentation of Low Resource Poly-synthetic Language
License
NLPRL/EP-2020-Unsupervised-Morphological-Segmentation
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
_________________________________ Yash Malik (18075065) B.Tech. (Semester-4) Computer Science and Engineering _________________________________ -------------------------------------------------------------------------------------------------------------------- The project is based on the paper "Unsupervised Morphological Segmentation for Low-Resource Polysynthetic Languages" Eskander et. al (2019), available at 'https://www.aclweb.org/anthology/W19-4222.pdf'. All these files are also hosted at 'https://github.com/yash-malik/Exploratory-Project' For further queries contact '[email protected]' -------------------------------------------------------------------------------------------------------------------- ---------- Code+Data ---------- All the data and all the code used in the project are provided herewith. To understand the working of the setup, refer to the report provided. 1. The code has been used and tested on Linux distribution 'Ubuntu' (and should produce results similar to those shown in the report under normal circumstances). 2. All the extra files (test output results and temporary files) have been deleted, so a fresh setup is being provided. 3. Further details about the data and the code are given in the README corresponding to the 'Code+Data' folder. ----------- evaluation ----------- The evaluation folder consists of the metrics used to evaluate the results, run the python script files for more info. e.g., 'python BPR.py' from the terminal. ------- py-cfg ------- The py-cfg folder contains the Adaptor Grammars framework by Mark Johnson. The copy provided herewith is a fresh copy from the official source. (except a minor change, changing 'auto_ptr' to 'shared_ptr' in utility.h, auto_ptr is now deprecated.) (for details please see 'https://stackoverflow.com/questions/3697686/why-is-auto-ptr-being-deprecated'.) The makefile needs to be run to generate the relevant files and binaries. For Ubuntu, just the 'make' command from the terminal will do (for more info, refer to README by Mark Johnson in the 'py-cfg' folder)
About
Unsupervised Morphological Segmentation of Low Resource Poly-synthetic Language
Resources
License
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published