Skip to content

Latest commit

 

History

History
62 lines (42 loc) · 3.65 KB

README.md

File metadata and controls

62 lines (42 loc) · 3.65 KB

Tidying and mapping Scopus and Journal Metrics data

This repository creates user-friendly (tidy) TSVs of data from Scopus and Journal Metrics and converts data to NLM journal IDs for PubMed integration. Data pulled from Scopus include journal subject areas and open access status. Data pulled from Journal Metrics include journal three measures (CiteScore, SJR, SNIP) of journal prestige and a Scopus–ISSN mapping.

Execution is performed by running notebooks in the following order:

Scopus titles

The data directory contains the following tidy outputs:

Scopus mappings

The data directory contains the following tidy outputs:

Journal metrics

Source and version info

This repository is built from the following publicly-available inputs in download:

  • extlistJuly2021.xlsx: Scopus title list (from "Download Scopus Source List" at source)
  • CiteScore 2011-2020 new methodology - May 2021.xlsb: Journal Metrics
  • pubmed-journals.tsv: PubMed journal information (source via process-nlm-catalog.ipynb)
  • 20210912.ISSN-to-ISSN-L.txt.gz: The "ISSN-L matching table" is extracted and compressed from issnltables.zip which is available upon request from ISSN.

Environment

This repository uses conda to manage its environment as specified in environment.yml. Install the environment with:

conda env create --file=environment.yml

Then use conda activate scopus and conda deactivate to activate or deactivate the environment.

License

All original work in this repository is dedicated to the public domain under CC0 1.0 Universal. Note that this repository incorporates publicly available datasets that were not explicitly released with a public license. The authors of this repository claim no ownership of this content.