Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Compatability with UD Treebanks #1

Open
meliksahturker opened this issue Feb 8, 2022 · 1 comment
Open

Compatability with UD Treebanks #1

meliksahturker opened this issue Feb 8, 2022 · 1 comment

Comments

@meliksahturker
Copy link

Lately I am developing NN based systems for Dependency Parsing and POS Tagging task for Turkish using UD Treebank (v 2.9).
I wanted to include your two data files in my training sets, however there are many differences between your labels (both POS and Dependency labels) and UD labels.
Would you consider offering a utility function to convert them to UD standards? (Although I guess some are not subset of UD and conversion may not be possible? e.g: mwe)

Nevertheless, thank you for the hand tagged gold dataset.

@TKayadelen
Copy link
Collaborator

Thank you for using our dataset. That's correct, our segmentation and the tagset is currently not UD compatible. We do have a plan to create a UD version of this dataset, but due to low bandwidth it is not clear when we'll get to it.
As you suggest, it is likely that a fully automated conversion is not going to be possible due to some subtle differences.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants