-
Notifications
You must be signed in to change notification settings - Fork 0
/
eval.log
49 lines (49 loc) · 3.11 KB
/
eval.log
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
Running the following version of UD tools:
commit c1984d97df0ecdcc1b50fbeaa8c96419c6321432
Author: Dan Zeman <[email protected]>
Date: Sun Nov 10 10:33:45 2024 +0100
Evaluating the following revision of UD_Czech-PUD:
commit a4cf13e34fa213ffeca565939445ae61988d8714
Merge: bec3c28 140adf7
Author: Dan Zeman <[email protected]>
Size: counted 18610 of 18610 words (nodes).
Size: min(0, log((N/1000)**2)) = 5.84739814130832.
Size: maximum value 13.815511 is for 1000000 words or more.
Split: Did not find more than 10000 training words.
Split: Did not find at least 10000 development words.
Split: Found at least 10000 test words.
Lemmas: source of annotation (from README) factor is 0.8.
Universal POS tags: 15 out of 17 found in the corpus.
Universal POS tags: source of annotation (from README) factor is 0.8.
Features: 14313 out of 18610 total words have one or more features.
Features: source of annotation (from README) factor is 0.8.
Universal relations: 31 out of 37 found in the corpus.
Universal relations: source of annotation (from README) factor is 1.
Udapi:
TOTAL 30
Udapi: found 30 bugs.
Udapi: worst expected case (threshold) is one bug per 10 words. There are 18610 words.
Genres: found 2 out of 17 known.
/net/work/people/zeman/unidep/tools/validate.py --lang cs --max-err=10 UD_Czech-PUD/cs_pud-ud-test.conllu
[Line 763 Sent n01015036 Node 30]: [L3 Warning leaf-det] 'det' not expected to have children (30:tolik:det --> 36:Kiera:nmod)
[Line 2133 Sent n01035004 Node 32]: [L3 Warning leaf-det] 'det' not expected to have children (32:víc:det --> 35:odpovědí:nmod)
[Line 3730 Sent n01059054 Node 11]: [L3 Warning leaf-det] 'det' not expected to have children (11:víc:det --> 15:prvků:advcl)
[Line 5350 Sent n01089007 Node 7]: [L3 Warning leaf-det] 'det' not expected to have children (7:víc:det --> 10:jídla:nmod)
[Line 6810 Sent n01112014 Node 29]: [L3 Warning leaf-det] 'det' not expected to have children (29:víc:det --> 32:ryb:nmod)
[Line 19948 Sent n04002020 Node 9]: [L3 Warning leaf-det] 'det' not expected to have children (9:méně:det --> 14:Madrid:nmod)
[Line 20527 Sent n05002004 Node 18]: [L3 Warning leaf-det] 'det' not expected to have children (18:více:det --> 17:600 000:nmod)
[Line 20535 Sent n05002004 Node 18]: [L3 Warning leaf-det] 'det' not expected to have children (18:více:det --> 25:rokem:advcl)
Warnings: 8
*** PASSED ***
Validity: 1
(weight=0.0769230769230769) * (score{features}=0.8) = 0.0615384615384615
(weight=0.0769230769230769) * (score{genres}=0.117647058823529) = 0.00904977375565611
(weight=0.0769230769230769) * (score{lemmas}=0.8) = 0.0615384615384615
(weight=0.256410256410256) * (score{size}=0.423248791043589) = 0.108525331036818
(weight=0.0512820512820513) * (score{split}=0.34) = 0.0174358974358974
(weight=0.0769230769230769) * (score{tags}=0.705882352941177) = 0.0542986425339367
(weight=0.307692307692308) * (score{udapi}=0.983879634605051) = 0.302732195263093
(weight=0.0769230769230769) * (score{udeprels}=0.837837837837838) = 0.0644490644490645
(TOTAL score=0.679567827551388) * (availability=1) * (validity=1) = 0.679567827551388
STARS = 3.5
UD_Czech-PUD 0.679567827551388 3.5