Do medium and small models really have the same accuracy? #13199

ryanheise · 2023-12-15T11:54:41Z

ryanheise
Dec 15, 2023

For en_core_web_md and en_core_web_lg (3.7.1), the reported accuracy is identical by every metric (POS tagging, sentence segmentation, deps, ENTS_P, ENTS_F) except for one: ENTS_R. For the Korean models (3.7.0), everything is identical except for those related to named entities. And many other models as of the time of writing also report identical accuracy on most metrics except for those related to named entities.

This is obviously good news if it means that an application that doesn't deal with named entities can downgrade to a more efficient model and expect no measurable difference in accuracy on the tagger/parser/lemmatizer. But I would like to confirm, is there in fact no measurable difference in accuracy?

Answered by adrianeboyd

Dec 18, 2023

The small and medium models usually don't have similar accuracy, but the medium and large models are often very similar. You can see the raw scores in nlp.meta["performance"] or for all published models in the meta/ directory of https://github.com/explosion/spacy-models. (Be aware that the reported scores are for the dev set for most resources.)

And since the types of errors may not be identical for two different models even if the accuracy is the same, you probably still want to run a detailed evaluation for your own data/task.

View full answer

adrianeboyd · 2023-12-18T07:17:17Z

adrianeboyd
Dec 18, 2023

The small and medium models usually don't have similar accuracy, but the medium and large models are often very similar. You can see the raw scores in nlp.meta["performance"] or for all published models in the meta/ directory of https://github.com/explosion/spacy-models. (Be aware that the reported scores are for the dev set for most resources.)

And since the types of errors may not be identical for two different models even if the accuracy is the same, you probably still want to run a detailed evaluation for your own data/task.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Do medium and small models really have the same accuracy? #13199

{{title}}

Replies: 1 comment

{{title}}

Select a reply

Do medium and small models really have the same accuracy? #13199

ryanheise Dec 15, 2023

Replies: 1 comment

adrianeboyd Dec 18, 2023

ryanheise
Dec 15, 2023

adrianeboyd
Dec 18, 2023