-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Training datasets for ML/AI - publication centric #1181
Comments
JSON makes sense. How urgent is this? |
It would be good to have it in a few of weeks I think to keep the ball rolling. I'm meeting the ePMC ML person on Monday if you want to join (forwarded the invite) |
I'll start this on Monday. It might take a couple of days because the existing code needs improving first. A lot was written in a hurry for PomBase v2. Now I've had time (7 years?) to think about it, there are better ways to do things. Proposed JSON structure (work in progress): PMID:
|
From Zoom: make sure to include annotation comments in the output. |
related: #1185 we'll discuss this on the next call.... |
After the chat with ePMC a while ago, I'm wondering if it's useful to create a file like this in advance. It sounded like there are particular file formats that each group uses. So perhaps we should create files when asked? Unless it's a especially wacky format I think I could create files on request with a 24 turn-around. |
OK keep this on the back burner. |
Create a "publication centric" file containing all entities / annotations (all datatypes) for each publication.
Json?
The text was updated successfully, but these errors were encountered: