-
Notifications
You must be signed in to change notification settings - Fork 21
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merge edges from different KPs by primary_knowledge_source #2381
Comments
also see #1951 |
Based on the preliminary exploration it looks like the edges are not being merged due the the subject or object of the edges not getting the preferred curie in the edge key. |
@sundareswarpullela - if you remember from the other day when we explored this, the edge keys ARAX assigns do use the preferred curies, but include the KP name instead of the primary knowledge source (which is why they're not being merged between KPs) |
In the Example 1 acetaminophen test query: Edge 1: Edge 2: |
ah, I see, ok. yes, you're right! thanks for the examples. for some reason I thought we had determined that the final edge keys were being assigned after canonicalization, but I guess that must not be true. interesting. though it still is true that you'll also need to stop including the KP name in the edge keys (and instead include the primary KS). so both of those things will need to be addressed here. |
hey @sundareswarpullela - thanks for all the work on this! I was playing around with it on /test and had a couple questions. in comparing the same result to the CI version (without merging on primary KS), I noticed that when edges are merged it seems like only some of their and here's the edge from service provider: on /test, there is indeed only one such edge from bindingdb (yay), but the merged edge for bindingdb appears to only contain the the |
discussed in today's AHM
right now Expand does not merge edges from different KPs
but there is some duplicated information between KPs, which merging based on
primary_knowledge_source
may help eliminateit may not be perfect (e.g., the clinical trials KP may list different primary sources vs. what KG2 lists for its ingested CTKP edges), but it should at least be an improvement! and we could refine the tricky merging over time...
The text was updated successfully, but these errors were encountered: