Replies: 2 comments 6 replies
-
You are checking this by hand? That could be a large mission. I must admit, I cannot tell if these 2 u exampled a same person or not, lol. A detail, by your folder name |
Beta Was this translation helpful? Give feedback.
-
hello...
As a result of the similarity evaluation of the ms1mv3 dataset,
approximately more than 1000 classes overlap.
I will have my subordinates do this, and I will send it to you when it is
completed.
I've received a lot of help from you, and I think it's worth it enough.
I hope you achieve an engine of sota performance in the tensorflow
framework.
thanks.
2022년 2월 7일 (월) 오전 10:02, leondgarse ***@***.***>님이 작성:
… Just end of our new year vacation. :)
Ya, you are right on these samples. I once have a cleaning test using
trained model, that deleting samples in a same class with a small
similarity, and merging classes with a large similarity. As the trained
model will have accuracy > 99% on training dataset, may kind of ease this
issue. What do you think of this idea?
—
Reply to this email directly, view it on GitHub
<#79 (reply in thread)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ADRFGOHG6PLKYFCVHDHAXLDUZ4K3PANCNFSM5NECQPYQ>
.
Triage notifications on the go with GitHub Mobile for iOS
<https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675>
or Android
<https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.
You are receiving this because you authored the thread.Message ID:
***@***.***
com>
--
유빈아빠
|
Beta Was this translation helpful? Give feedback.
-
Hello
There are many problems with ms1vm3 dataset.
There are more than a thousand datasets classified as different classes even though they are the same person as below.
it takes too much time to organize these datasets.
Of course, glint360k and web600k have the same problem.
The bigger problem is that there are many cases where different characters are included in the same class.
I'm currently organizing this.
In particular, Asian data is serious.
I think these problems are one of the reasons for the poor accuracy of Asians.
[sample]
Beta Was this translation helpful? Give feedback.
All reactions