Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

PSR dataset improvement #46

Open
fmocking opened this issue Jan 25, 2022 · 0 comments
Open

PSR dataset improvement #46

fmocking opened this issue Jan 25, 2022 · 0 comments

Comments

@fmocking
Copy link

Hello, thank you for providing such a great dataset collection to the community. I was recently working on the PSR dataset and noticed some possible improvements can be made. Mainly, about maintainability and some more information.

  1. It would be beneficial to keep the dataset up to date. For example at the time of the dataset publication, CASP 11 stage 2 was selected as a test set. I think this can be updated to a more recent version. This can require some versioning at the dataset level to make it consistent. It can be named according to the last full CASP name.
  2. It is related to the last point, currently, it is challenging to extend it and keep it consistent. Maybe some guidelines can be helpful.
  3. I couldn't find a way to tell if a sample is from stage 1 or stage 2. I'm not sure if I'm missing something but there is no information available for custom splits.

Again, thank you for sharing your great work.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant