Data Mining in HIV-AIDS Surveillance System : Application to Portuguese Data

J Med Syst. 2017 Apr;41(4):51. doi: 10.1007/s10916-017-0697-4. Epub 2017 Feb 18.

Abstract

The Human Immunodeficiency Virus (HIV) is an infectious agent that attacks the immune system cells. Without a strong immune system, the body becomes very susceptible to serious life threatening opportunistic diseases. In spite of the great progresses on medication and prevention over the last years, HIV infection continues to be a major global public health issue, having claimed more than 36 million lives over the last 35 years since the recognition of the disease. Monitoring, through registries, of HIV-AIDS cases is vital to assess general health care needs and to support long-term health-policy control planning. Surveillance systems are therefore established in almost all developed countries. Typically, this is a complex system depending on several stakeholders, such as health care providers, the general population and laboratories, which challenges an efficient and effective reporting of diagnosed cases. One issue that often arises is the administrative delay in reports of diagnosed cases. This paper aims to identify the main factors influencing reporting delays of HIV-AIDS cases within the portuguese surveillance system. The used methodologies included multilayer artificial neural networks (MLP), naive bayesian classifiers (NB), support vector machines (SVM) and the k-nearest neighbor algorithm (KNN). The highest classification accuracy, precision and recall were obtained for MLP and the results suggested homogeneous administrative and clinical practices within the reporting process. Guidelines for reductions of the delays should therefore be developed nationwise and transversally to all stakeholders.

Keywords: Data mining; HIV-AIDS; Reporting delay; Surveillance data; Surveillance system.

MeSH terms

  • Acquired Immunodeficiency Syndrome / diagnosis
  • Acquired Immunodeficiency Syndrome / epidemiology
  • Algorithms*
  • Bayes Theorem
  • Data Mining / methods*
  • HIV Infections / diagnosis*
  • HIV Infections / epidemiology*
  • Humans
  • Neural Networks, Computer
  • Portugal / epidemiology
  • Support Vector Machine