Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Medline abstracts not read in correctly #30

Open
ellereve opened this issue May 15, 2020 · 2 comments
Open

Medline abstracts not read in correctly #30

ellereve opened this issue May 15, 2020 · 2 comments

Comments

@ellereve
Copy link

When reading in a RIS txt file, abstracts for MEDLINE are not always properly read in. Instead of all information in the single abstract column, several columns are created for each abstract subsection (for example, when a journal divides its abstract into the explicit sections Background, Objectives, Methods, Results, Conclusions, etc.) The abstract column then only contains the information in the first subsection (e.g., Background) and separate columns are generated for each proceeding subsection (e.g., CONC, where the non-na column contents always start with LUSIONS followed by the conclusions text from the abstract).

@ellereve
Copy link
Author

ellereve commented May 18, 2020

It seems like quite a task to try and fix that since the abstract subsection words aren't extremely regular. Maybe just a note in the function documentation about the issue would be helpful to others. I've worked around the chopped-up abstract problem by converting my RIS to BIB in Zotero, reading both the cis and bib into R, and merging the columns I feel are correct into a single data frame (not so elegant..)

@mtnbikerjoshua
Copy link

Hi Kelly,
I'm having trouble reproducing this issue. I tried reading a MEDLINE file like this one: pubmed-21061207.txt and had no issues. Could you upload an example file?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants