-
Notifications
You must be signed in to change notification settings - Fork 277
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add descriptive_stats
to all tasks
#1475
Comments
Hi @Samoed, thanks for all the efforts you put in! I'll run what's missing 🙂 |
@Samoed question: do you know why we don't have the number of qrels for retieval datasets? e.g.,
|
I forgot to add them😅. Can you add missing fileds? I can recompute rest of the tasks |
@imenelydiaker how is this one going? do you need any help with that? |
Launched it one hour ago, it's running I'm at MSMarcov2, the datasets are quite huge. Maybe we can split? Do you think you can do datasets starting from Neuclir*? |
So just checkout to this branch above and run the script above from |
Yes, better if you create a new branch from it so we won't have conflicts (because I'm working on it). For the PR, open it directly on the branch |
@imenelydiaker @Samoed
Currently, I have only access to D8s_v3 VM with 32gb of RAM and 8CPUs, 120gb SSD, so I'd say not too bad. So maybe some optimizations are needed, or even more beefy machine. I'll be OOK for the next 4 days, can take a look back at those if you have some pointers. |
I can try to run them on working machine |
I found a bug in the statistics calculation (I accidentally swapped docs and queries when calculating lengths) and will recalculate the incorrect stats |
I've added descriptive statistics to almost all tasks, but I need help running some of them. To do this, you can run the script from the
v2.0.0
branch.cc @imenelydiaker
The text was updated successfully, but these errors were encountered: