False-positive reduction in computer-aided mass detection using mammographic texture analysis and classification

Comput Methods Programs Biomed. 2018 Jul:160:75-83. doi: 10.1016/j.cmpb.2018.03.026. Epub 2018 Mar 31.

Abstract

Background and objective: The aim of computer-aided-detection (CAD) systems for mammograms is to assist radiologists by marking region of interest (ROIs) depicting abnormalities. However, the confusing appearance of some normal tissues that visually look like masses results in a large proportion of marked ROIs with normal tissues. This paper copes with this problem and proposes a framework to reduce false positive masses detected by CAD.

Methods: To avoid the error induced by the segmentation step, we proposed a segmentation-free framework with particular attention to improve feature extraction and classification steps. We investigated for the first time in mammogram analysis, Hilbert's image representation, Kolmogorov-Smirnov distance and maximum subregion descriptors. Then, a feature selection step is performed to select the most discriminative features. Moreover, we considered several classifiers such as Random Forest, Support Vector Machine and Decision Tree to distinguish between normal tissues and masses. Our experiments were carried out on a large dataset of 10168 ROIs (8254 normal tissues and 1914 masses) constructed from the Digital Database for Screening Mammography (DDSM). To simulate practical scenario, our normal regions are false positives asserted by a CAD system from healthy cases.

Results: The combination of all the descriptors yields better results than each feature set used alone, and the difference is statistically significant. Besides, the feature selection steps yields a statistically significant increase in the accuracy values for the three classifiers. Finally, the random forest achieves the highest accuracy (81.09%), outperforming the SVM classifier (80.01%)) and decision tree (79.12%), but the difference is not statistically significant.

Conclusions: The accuracy of discrimination between normal and abnormal ROIs in mammograms obtained with the proposed gray level texture features sets are encouraging and comparable to these obtained with multiresolution features. Combination of several features as well as feature selection steps improve the results. To improve false positives reduction in CAD systems for breast cancer diagnosis, these features could be combined with multiresolution features.

Keywords: Breast cancer diagnosis; False-positive reduction; Hilbert’s image representation; Mammography.

MeSH terms

  • Breast Neoplasms / diagnostic imaging*
  • Databases, Factual / statistics & numerical data
  • Decision Trees
  • False Positive Reactions
  • Female
  • Humans
  • Mammography / statistics & numerical data*
  • Radiographic Image Interpretation, Computer-Assisted / methods*
  • Software Design
  • Support Vector Machine