Skip to content

conorjudge/robotsearch

 
 

Repository files navigation

RobotSearch

Welcome to RobotSearch, software for filtering RCTs from a search result as described in our paper in the Journal of Research Synthesis Methods.

Marshall I, Storr AN, Kuiper J, Thomas J, Wallace BC. Machine Learning for Identifying Randomized Controlled Trials: an evaluation and practitioner's guide. Res Syn Meth. 2018. https://doi.org/10.1002/jrsm.1287

We offer RobotSearch free of charge, but we'd be most grateful if you would cite us if you use it. We're academics, and thrive on links and citations! Getting RobotSearch widely used and cited helps us obtain the funding to maintain the project and make RobotSearch better.

It also makes your methods transparent to your readers, and not least we'd love to see where RobotSearch is used! :)

The easy way

For most people, we encourage you to use RobotSearch via our website.

No need to install anything, simply upload your RIS files (Ovid or PubMed format), and instantly download the filtered version containing RCTs only. N.B. To ensure the highest accuracy, please make sure that your RIS files are exported with all the fields needed (see below for instructions for Ovid and Pubmed).

RobotSearch web screenshot

RobotSearch online

For those who are particularly technically minded, or have a pressing need to run the software on their own machines, read on...

Installation instructions

Currently this software runs from the Command Prompt (in Windows), or the Terminal (in Mac, or Linux).

  1. Before installing RobotSearch, you will need to install Python 3. We recommend that you use the MiniConda Python distribution (N.B. choose version 3.6 or higher). You can download this here.

  2. Open up the Terminal (or Command Prompt in Windows). This is also how you will interact with RobotSearch when you use it.

  3. Install RobotSearch using the following command (easiest to copy/paste): pip install -U https://github.com/ijmarshall/robotsearch/archive/master.zip

  4. RobotSearch should be automatically downloaded and installed on your machine. The software is >200MB in size, so this process may take some time depending on how fast your internet connection is.

  5. You should be ready to go!

The website

We include the code for the online version also in this repository. The easiest way to run is via Docker.

From within the code directory run:

docker build -t robotsearch

If the build is successful, you can then start the website locally by running:

./start.sh

You can then access the website on any webbrowser on your local machine at: http://localhost:5050.

To stop the websever, run:

docker stop robotsearch

How it works

RobotSearch uses machine learning as an alternative to string-based study design filters.

It works with MEDLINE searches exported from either PubMed or Ovid.

  1. You should conduct your search as normal (NB do not use any search terms or filters to restrict to RCTs at this stage!).

  2. Then export your results in RIS format (see more detailed instrutions below on how to do this in PubMed and Ovid)

  3. From the Command Prompt/Terminal, run the robotsearch command: robotsearch my_file.ris

  4. Your search results will be saved as my_file_robotreviewer_RCTs.ris

Changing settings

By default, RobotSearch runs a sensitive search (i.e. very high likelihood that all RCTs will be retrieved, at expense of sometimes including non-RCTs) - this is suitable for a systematic review.

To run a precise search (i.e. the retrieved articles have a very high likelihood of being RCTs, but at the expense of missing a tiny proportion), run with an extra -p flag, e.g.:

robotsearch my_file.ris -p

Exporting from PubMed

How to export from PubMed

  1. Select 'Send to' (located in the upper left of the search results)
  2. Under 'Choose Destination' select File
  3. Under 'Format', select 'MEDLINE' from the pulldown menu
  4. Click the 'Create File' button

Exporting from Ovid

1. Exporting from Ovid - select 'All' in the top left, then click the Export text 2. Exporting from Ovid - select 'EndNote' as the export format, then select 'Custom Fields', and click the 'Select Fields' button (Endnote format includes the 'publication type' field, the RIS format does not) 2. Exporting from Ovid - select the following 3 fields to export: ab: Abstract, pt: Publication Type, and ti: Title

  1. Click the tickbox next to 'All' above the search results on the left hand side.
  2. Click on Export
  3. In the 'Export to' pulldown menu, select 'RIS'
  4. Under 'Select Fields to Display' select 'Custom Fields', then click the 'Select Fields' button
  5. In the Select Fields box, select the following fields: ab: Abstract, pt: Publication Type, and ti: Title. You may deselect any others.
  6. Click 'Save' in the bottom left
  7. Click 'Export Citation(s)' in the bottom right
  8. The citations will be exported, but there may be a wait of a minute or two.

Testing

RobotSearch has a optional test mode, which runs through a standardised search result, and double checks that the software is returning the same results as in the validation study. NB this takes around 5--10 minutes to run, depending on the speed of your machine.

To run this, type:

robotsearch -t

Contributing/Using our machine learning in other tools

We would love software developers, and people who make biblographic databases to integrate this method into their systems. All the software is open source under the GNU GPL v3. Please contact us ([email protected]) to discuss further.

We also welcome contributions from the community; please contact us (or submit an issue via Github) if you are interested in improving the software.

Using RobotSearch as a Python module

RobotSearch may be called as a Python module, from within the root directory. See the example IPython notebook for how to do this.

Acknowledgements

This work is dependent on the work of the Cochrane Crowd, for whom we'd like to express enormous gratitude for their work, and in making their data available for training our machine learning models.

Support

This work is supported by: National Institutes of Health (NIH) under the National Library of Medicine, grant R01-LM012086-01A1, "Semi-Automating Data Extraction for Systematic Reviews", and by NIH grant 5UH2CA203711-02, "Crowdsourcing Mark-up of the Medical Literature to Support Evidence-Based Medicine and Develop Automated Annotation Capabilities", and the UK Medical Research Council (MRC), through its Skills Development Fellowship program, grant MR/N015185/1

Cite

If you use RobotSearch, we'd love it if you could cite our paper, and also let us know! (you can email [email protected])

Marshall IJ, Noel-Storr A, Kuiper J, Thomas J, Wallace BC. Machine Learning for Identifying Randomized Controlled Trials: an evaluation and practitioner's guide. Res Syn Meth. 2018. https://doi.org/10.1002/jrsm.1287

About

Machine learning RCT classifier

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • JavaScript 72.5%
  • Python 12.0%
  • Jupyter Notebook 10.3%
  • HTML 3.5%
  • Dockerfile 0.8%
  • CSS 0.8%
  • Shell 0.1%