Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Investigate if we can collect patent citations #39

Open
india-kerle opened this issue Jan 23, 2023 · 2 comments · May be fixed by #48
Open

Investigate if we can collect patent citations #39

india-kerle opened this issue Jan 23, 2023 · 2 comments · May be fixed by #48
Assignees

Comments

@india-kerle
Copy link
Contributor

to create CD index

@india-kerle india-kerle self-assigned this Jan 23, 2023
@georgerichardson
Copy link
Contributor

image

Hopefully this makes sense. What we have collected already are the 'focal patents'. We need the IDs of the patents that our focal patents cite. We also need the IDs of the patents which cite our focal patents and their references.

To be consistent with OpenAlex:

  1. Back citations: Collect citation data for the focal patents and save as a json like
{
    focal_patent_id_0: [cited_patent_id_0, cited_patent_id_1, ...],
    focal_patent_id_0: [cited_patent_id_1, cited_patent_id_6, ...],
}
  1. Forward citations: For all of the focal patents, find the patents that cite them. For each of these patents, collect their citation data in the same format above. Save separately as forward citations. This might yield a lot of patents. For OpenAlex I had to save the outputs by year.

@india-kerle india-kerle linked a pull request Jan 26, 2023 that will close this issue
11 tasks
@india-kerle
Copy link
Contributor Author

notes from chat w/ george to threshold citations:

Threshold:

  • Types of citations (based on the data dictionary)
  • minimum # of citations per paper
  • what about just doing a couple of years? Latest date is 2017 - DON’T COLLECT 2018 - 2022
  • what about collecting for two years - 2007 AND 2017

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants