Refining adverse drug reaction signals by incorporating interaction variables identified using emergent pattern mining

Comput Biol Med. 2016 Feb 1:69:61-70. doi: 10.1016/j.compbiomed.2015.11.014. Epub 2015 Dec 4.

Abstract

Purpose: To develop a framework for identifying and incorporating candidate confounding interaction terms into a regularised cox regression analysis to refine adverse drug reaction signals obtained via longitudinal observational data.

Methods: We considered six drug families that are commonly associated with myocardial infarction in observational healthcare data, but where the causal relationship ground truth is known (adverse drug reaction or not). We applied emergent pattern mining to find itemsets of drugs and medical events that are associated with the development of myocardial infarction. These are the candidate confounding interaction terms. We then implemented a cohort study design using regularised cox regression that incorporated and accounted for the candidate confounding interaction terms.

Results: The methodology was able to account for signals generated due to confounding and a cox regression with elastic net regularisation correctly ranking the drug families known to be true adverse drug reactions above those that are not. This was not the case without the inclusion of the candidate confounding interaction terms, where confounding leads to a non-adverse drug reaction being ranked highest.

Conclusions: The methodology is efficient, can identify high-order confounding interactions and does not require expert input to specify outcome specific confounders, so it can be applied for any outcome of interest to quickly refine its signals. The proposed method shows excellent potential to overcome some forms of confounding and therefore reduce the false positive rate for signal analysis using longitudinal data.

Keywords: Confounding; Data mining; Emergent pattern mining; Medical informatics; Observational data; Signal refinement.

MeSH terms

  • Adverse Drug Reaction Reporting Systems*
  • Data Mining / methods*
  • Databases, Factual*
  • Humans
  • Myocardial Infarction / drug therapy*
  • Pattern Recognition, Automated / methods*