The Rametrix PRO Toolbox v1.0 for MATLAB®

PeerJ. 2020 Jan 6:8:e8179. doi: 10.7717/peerj.8179. eCollection 2020.

Abstract

Background: Existing tools for chemometric analysis of vibrational spectroscopy data have enabled characterization of materials and biologicals by their broad molecular composition. The Rametrix LITE Toolbox v1.0 for MATLAB® is one such tool available publicly. It applies discriminant analysis of principal components (DAPC) to spectral data to classify spectra into user-defined groups. However, additional functionality is needed to better evaluate the predictive capabilities of these models when "unknown" samples are introduced. Here, the Rametrix PRO Toolbox v1.0 is introduced to provide this capability.

Methods: The Rametrix PRO Toolbox v1.0 was constructed for MATLAB® and works with the Rametrix LITE Toolbox v1.0. It performs leave-one-out analysis of chemometric DAPC models and reports predictive capabilities in terms of accuracy, sensitivity (true-positives), and specificity (true-negatives). RametrixPRO is available publicly through GitHub under license agreement at: https://github.com/SengerLab/RametrixPROToolbox. Rametrix PRO was used to validate Rametrix LITE models used to detect chronic kidney disease (CKD) in spectra of urine obtained by Raman spectroscopy. The dataset included Raman spectra of urine from 20 healthy individuals and 31 patients undergoing peritoneal dialysis treatment for CKD.

Results: The number of spectral principal components (PCs) used in building the DAPC model impacted the model accuracy, sensitivity, and specificity in leave-one-out analyses. For the dataset in this study, using 35 PCs in the DAPC model resulted in 100% accuracy, sensitivity, and specificity in classifying an unknown Raman spectrum of urine as belonging to a CKD patient or a healthy volunteer. Models built with fewer or greater number of PCs showed inferior performance, which demonstrated the value of Rametrix PRO in evaluating chemometric models constructed with Rametrix LITE.

Keywords: Discriminant analysis; MATLAB; Nephrology; Prediction; Principal component analysis; Raman spectroscopy; Spectral data analysis; Urine.

Grants and funding

Ryan Senger receives salary support from HATCH funding. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.