Breast Cancer Detection with Reduced Feature Set

Comput Math Methods Med. 2015:2015:265138. doi: 10.1155/2015/265138. Epub 2015 May 19.

Abstract

This paper explores feature reduction properties of independent component analysis (ICA) on breast cancer decision support system. Wisconsin diagnostic breast cancer (WDBC) dataset is reduced to one-dimensional feature vector computing an independent component (IC). The original data with 30 features and reduced one feature (IC) are used to evaluate diagnostic accuracy of the classifiers such as k-nearest neighbor (k-NN), artificial neural network (ANN), radial basis function neural network (RBFNN), and support vector machine (SVM). The comparison of the proposed classification using the IC with original feature set is also tested on different validation (5/10-fold cross-validations) and partitioning (20%-40%) methods. These classifiers are evaluated how to effectively categorize tumors as benign and malignant in terms of specificity, sensitivity, accuracy, F-score, Youden's index, discriminant power, and the receiver operating characteristic (ROC) curve with its criterion values including area under curve (AUC) and 95% confidential interval (CI). This represents an improvement in diagnostic decision support system, while reducing computational complexity.

Publication types

  • Research Support, Non-U.S. Gov't
  • Validation Study

MeSH terms

  • Biopsy, Fine-Needle
  • Breast Neoplasms / diagnosis*
  • Breast Neoplasms / pathology
  • Computational Biology
  • Databases, Factual
  • Decision Support Systems, Clinical
  • Female
  • Humans
  • Models, Statistical
  • Neural Networks, Computer
  • Principal Component Analysis
  • ROC Curve
  • Radiographic Image Interpretation, Computer-Assisted / methods