This is the first formal release of the Softcite dataset. This release includes:
- Softcite dataset corpus file:
softcite_corpus-full.tei.xml
- Softcite Dataset: A Dataset of Software Mentions in Biomedical and Economic Research Publications, our paper that describes the design consideration and creation process of the dataset. This is a preprint version of our forthcoming publication in the Journal of the Association for Information Science and Technology.