Skip to content

Unsupervised Morphological Segmentation of Low Resource Poly-synthetic Language

License

Notifications You must be signed in to change notification settings

NLPRL/EP-2020-Unsupervised-Morphological-Segmentation

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

21 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

_________________________________

Yash Malik
(18075065)
B.Tech. (Semester-4)
Computer Science and Engineering
_________________________________


--------------------------------------------------------------------------------------------------------------------
The project is based on the paper "Unsupervised Morphological Segmentation for Low-Resource Polysynthetic Languages"
Eskander et. al (2019), available at 'https://www.aclweb.org/anthology/W19-4222.pdf'.

All these files are also hosted at 'https://github.com/yash-malik/Exploratory-Project'
For further queries contact '[email protected]'
--------------------------------------------------------------------------------------------------------------------


----------
Code+Data
----------

All the data and all the code used in the project are provided herewith.
To understand the working of the setup, refer to the report provided.

1. The code has been used and tested on Linux distribution 'Ubuntu' (and should produce results similar to
 those shown in the report under normal circumstances).
2. All the extra files (test output results and temporary files) have been deleted, so a fresh setup is being provided.
3. Further details about the data and the code are given in the README corresponding to the 'Code+Data' folder.


-----------
evaluation
-----------

The evaluation folder consists of the metrics used to evaluate the results, run the python script files for more info.
e.g., 'python BPR.py' from the terminal.


-------
py-cfg
-------

The py-cfg folder contains the Adaptor Grammars framework by Mark Johnson.
The copy provided herewith is a fresh copy from the official source.
(except a minor change, changing 'auto_ptr' to 'shared_ptr' in utility.h, auto_ptr is now deprecated.)
(for details please see 'https://stackoverflow.com/questions/3697686/why-is-auto-ptr-being-deprecated'.)

The makefile needs to be run to generate the relevant files and binaries.
 For Ubuntu, just the 'make' command from the terminal will do
 (for more info, refer to README by Mark Johnson in the 'py-cfg' folder)

About

Unsupervised Morphological Segmentation of Low Resource Poly-synthetic Language

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published