Systematic review using a spiral approach with machine learning

Amirhossein Saeidmehr; Piers David Gareth Steel; Faramarz F Samavati

doi:10.1186/s13643-023-02421-z

Systematic review using a spiral approach with machine learning

Syst Rev. 2024 Jan 17;13(1):32. doi: 10.1186/s13643-023-02421-z.

Authors

Amirhossein Saeidmehr¹, Piers David Gareth Steel², Faramarz F Samavati³

Affiliations

¹ Computer Science Department, University of Calgary, 2500 University Dr., Calgary, Canada. amir.saeidmehr@cpsc.ucalgary.ca.
² Haskayne School of Business, University of Calgary, 2500 University Dr., Calgary, Canada.
³ Computer Science Department, University of Calgary, 2500 University Dr., Calgary, Canada.

Abstract

With the accelerating growth of the academic corpus, doubling every 9 years, machine learning is a promising avenue to make systematic review manageable. Though several notable advancements have already been made, the incorporation of machine learning is less than optimal, still relying on a sequential, staged process designed to accommodate a purely human approach, exemplified by PRISMA. Here, we test a spiral, alternating or oscillating approach, where full-text screening is done intermittently with title/abstract screening, which we examine in three datasets by simulation under 360 conditions comprised of different algorithmic classifiers, feature extractions, prioritization rules, data types, and information provided (e.g., title/abstract, full-text included). Overwhelmingly, the results favored a spiral processing approach with logistic regression, TF-IDF for vectorization, and maximum probability for prioritization. Results demonstrate up to a 90% improvement over traditional machine learning methodologies, especially for databases with fewer eligible articles. With these advancements, the screening component of most systematic reviews should remain functionally achievable for another one to two decades.

Keywords: Active learning; Machine learning; Systematic review; Technology-assisted review.

MeSH terms

Computer Simulation
Machine Learning*
Systematic Reviews as Topic*