You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When reading in a RIS txt file, abstracts for MEDLINE are not always properly read in. Instead of all information in the single abstract column, several columns are created for each abstract subsection (for example, when a journal divides its abstract into the explicit sections Background, Objectives, Methods, Results, Conclusions, etc.) The abstract column then only contains the information in the first subsection (e.g., Background) and separate columns are generated for each proceeding subsection (e.g., CONC, where the non-na column contents always start with LUSIONS followed by the conclusions text from the abstract).
The text was updated successfully, but these errors were encountered:
It seems like quite a task to try and fix that since the abstract subsection words aren't extremely regular. Maybe just a note in the function documentation about the issue would be helpful to others. I've worked around the chopped-up abstract problem by converting my RIS to BIB in Zotero, reading both the cis and bib into R, and merging the columns I feel are correct into a single data frame (not so elegant..)
Hi Kelly,
I'm having trouble reproducing this issue. I tried reading a MEDLINE file like this one: pubmed-21061207.txt and had no issues. Could you upload an example file?
When reading in a RIS txt file, abstracts for MEDLINE are not always properly read in. Instead of all information in the single abstract column, several columns are created for each abstract subsection (for example, when a journal divides its abstract into the explicit sections Background, Objectives, Methods, Results, Conclusions, etc.) The abstract column then only contains the information in the first subsection (e.g., Background) and separate columns are generated for each proceeding subsection (e.g., CONC, where the non-na column contents always start with LUSIONS followed by the conclusions text from the abstract).
The text was updated successfully, but these errors were encountered: