Dutch UD syntax model, 12 layers, 384 hidden units, ALBERT architecture
Dutch UD syntax model:
- Universal tags
- Lemmas
- Morphology
- Universal dependencies
Distilled from finetuned XLM-RoBERTa large, into a transformer with 12 hidden layers, 384 hidden units, and 12 attention heads. The model uses the ALBERT architecture with 6 layer groups and 128 dimensional piece embeddings.