Identification of Axial Spondyloarthritis Patients in a Large Dataset: The Development and Validation of Novel Methods

J Rheumatol. 2020 Jan;47(1):42-49. doi: 10.3899/jrheum.181005. Epub 2019 Mar 15.

Abstract

Objective: Observational axial spondyloarthritis (axSpA) research in large datasets has been limited by a lack of adequate methods for identifying patients with axSpA, because there are no billing codes in the United States for most subtypes of axSpA. The objective of this study was to develop methods to accurately identify patients with axSpA in a large dataset.

Methods: The study population included 600 chart-reviewed veterans, with and without axSpA, in the Veterans Health Administration between January 1, 2005, and June 30, 2015. AxSpA identification algorithms were developed with variables anticipated by clinical experts to be predictive of an axSpA diagnosis [demographics, billing codes, healthcare use, medications, laboratory results, and natural language processing (NLP) for key SpA features]. Random Forest and 5-fold cross validation were used for algorithm development and testing in the training subset (n = 451). The algorithms were additionally tested in an independent testing subset (n = 149).

Results: Three algorithms were developed: Full algorithm, High Feasibility algorithm, and Spond NLP algorithm. In the testing subset, the areas under the curve with the receiver-operating characteristic analysis were 0.96, 0.94, and 0.86, for the Full algorithm, High Feasibility algorithm, and Spond NLP algorithm, respectively. Algorithm sensitivities ranged from 85.0% to 95.0%, specificities from 78.0% to 93.6%, and accuracies from 82.6% to 91.3%.

Conclusion: Novel axSpA identification algorithms performed well in classifying patients with axSpA. These algorithms offer a range of performance and feasibility attributes that may be appropriate for a broad array of axSpA studies. Additional research is required to validate the algorithms in other cohorts.

Keywords: ANKYLOSING SPONDYLITIS; COHORT STUDIES; DATABASES; SPONDYLOARTHROPATHY.

Publication types

  • Observational Study
  • Research Support, Non-U.S. Gov't
  • Validation Study

MeSH terms

  • Adult
  • Aged
  • Algorithms*
  • Anti-Citrullinated Protein Antibodies / blood
  • Antirheumatic Agents / therapeutic use
  • Area Under Curve
  • Biological Products / therapeutic use
  • Blood Sedimentation
  • C-Reactive Protein / analysis
  • Cohort Studies
  • Comorbidity
  • Datasets as Topic*
  • Female
  • HLA-B27 Antigen / blood
  • Humans
  • Male
  • Middle Aged
  • ROC Curve
  • Spondylitis, Ankylosing / blood
  • Spondylitis, Ankylosing / classification*
  • Spondylitis, Ankylosing / drug therapy

Substances

  • Anti-Citrullinated Protein Antibodies
  • Antirheumatic Agents
  • Biological Products
  • HLA-B27 Antigen
  • C-Reactive Protein