Automatically Assessing Quality of Online Health Articles

IEEE J Biomed Health Inform. 2021 Feb;25(2):591-601. doi: 10.1109/JBHI.2020.3032479. Epub 2021 Feb 5.

Abstract

Today Information in the world wide web is overwhelmed by unprecedented quantity of data on versatile topics with varied quality. However, the quality of information disseminated in the field of medicine has been questioned as the negative health consequences of health misinformation can be life-threatening. There is currently no generic automated tool for evaluating the quality of online health information spanned over broad range. To address this gap, in this paper, we applied data mining approach to automatically assess the quality of online health articles based on 10 quality criteria. We have prepared a labelled dataset with 53012 features and applied different feature selection methods to identify the best feature subset with which our trained classifier achieved an accuracy of [Formula: see text] varied over 10 criteria. Our semantic analysis of features shows the underpinning associations between the selected features & assessment criteria and further rationalize our assessment approach. Our findings will help in identifying high quality health articles and thus aiding users in shaping their opinion to make right choice while picking health related help from online.

MeSH terms

  • Communication*
  • Data Mining*
  • Humans
  • Internet
  • Semantics