-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Agents duplication on ontology parsing #644
Comments
Here are two possible solutions:
|
I will rephrase expression only one "proposed" solution : |
OK
Not really possible for now as we don't have a configuration workflow in the UI ontoportal-lirmm/bioportal_web_ui#836 |
In fact, I did not meant to have this in a "general" admin panel. But in an ontology specific panel, the one we are talking about doing to split the "Edit submission" page into 2 main part: (i) one related to metadata and (ii) one related to how AgroPortal deal with the ontology. So typically, this would go in the second "part". And for the moment, this perspective to seperate Edit submission into 2 part is a UI only contribition, which means all of these would still be based on properties of a submlsison. In other words, we only have to create a boolean property extractAgentsFromSourceFile and then use it in the processing workflow to skip or not the exclusion of agent extraction. |
|
I don't see the need to overcomplicate things for such a small feature, but if we plan to add more options for controlling how AgroPortal handles each ontology separately, then why not (to be discussed in the next meet). I also don't prefer the solution of extracting only from the first submission. |
The feature was enabled in ontoportal-lirmm/ontologies_linked_data#154 The current code is here: https://github.com/ontoportal-lirmm/ontologies_linked_data/blob/master/lib/ontologies_linked_data/services/submission_process/operations/submission_extract_metadata.rb#L276 Discussed today: We accept the consequence that extraction of an agent in ontology2 could recreate an agent that exists already for ontology1. In other words: any parsing with extraction of agents need a curation of the agents. When implementing the new ontology parsing report: we shall list the agents extracted. This solution allows to implement a solution independant from the ontology and not relying on a parameter (global or ontology specific). Note: the behviour proposed for "person and organization" category is the opposite of the default behaviour which consists to always give the priority to what is in the file compared to what we have in the metadata record. |
This solution allows to implement a solution independant from the ontology and not relying on a parameter (global or ontology specific). Note: the behviour proposed for "person and organization" category is the opposite of the default behaviour which consists to always give the priority to what is in the file compared to what we have in the metadata record. |
Another solution would consist of remembering the fact that a "agent string" has already been extracted ... for instance by :
Solution not preferred as this will require some curation again of things that have been already curated if the "agent string" would change in any ways. |
Describe the bug
When parsing a new submission of an ontology, it sometimes create duplicate Agents objects with the same name.
The conditions for it to happen are not always clear, but people described with a string concatenating more than just names in the ontology file (e.g.
"Guillaume Alviset https://orcid.org/0009-0004-4295-6593"
) will trigger that behavior.Screenshots
The text was updated successfully, but these errors were encountered: