Medline abstracts not read in correctly #30

ellereve · 2020-05-15T10:03:22Z

When reading in a RIS txt file, abstracts for MEDLINE are not always properly read in. Instead of all information in the single abstract column, several columns are created for each abstract subsection (for example, when a journal divides its abstract into the explicit sections Background, Objectives, Methods, Results, Conclusions, etc.) The abstract column then only contains the information in the first subsection (e.g., Background) and separate columns are generated for each proceeding subsection (e.g., CONC, where the non-na column contents always start with LUSIONS followed by the conclusions text from the abstract).

ellereve · 2020-05-18T12:36:46Z

It seems like quite a task to try and fix that since the abstract subsection words aren't extremely regular. Maybe just a note in the function documentation about the issue would be helpful to others. I've worked around the chopped-up abstract problem by converting my RIS to BIB in Zotero, reading both the cis and bib into R, and merging the columns I feel are correct into a single data frame (not so elegant..)

mtnbikerjoshua · 2021-12-22T20:30:38Z

Hi Kelly,
I'm having trouble reproducing this issue. I tried reading a MEDLINE file like this one: pubmed-21061207.txt and had no issues. Could you upload an example file?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Medline abstracts not read in correctly #30

Medline abstracts not read in correctly #30

ellereve commented May 15, 2020

ellereve commented May 18, 2020 •

edited

Loading

mtnbikerjoshua commented Dec 22, 2021

Medline abstracts not read in correctly #30

Medline abstracts not read in correctly #30

Comments

ellereve commented May 15, 2020

ellereve commented May 18, 2020 • edited Loading

mtnbikerjoshua commented Dec 22, 2021

ellereve commented May 18, 2020 •

edited

Loading