OtoMatch: Content-based eardrum image retrieval using deep learning

Seda Camalan; Muhammad Khalid Khan Niazi; Aaron C Moberly; Theodoros Teknos; Garth Essig; Charles Elmaraghy; Nazhat Taj-Schaal; Metin N Gurcan

doi:10.1371/journal.pone.0232776

OtoMatch: Content-based eardrum image retrieval using deep learning

PLoS One. 2020 May 15;15(5):e0232776. doi: 10.1371/journal.pone.0232776. eCollection 2020.

Authors

Seda Camalan¹, Muhammad Khalid Khan Niazi¹, Aaron C Moberly², Theodoros Teknos³, Garth Essig², Charles Elmaraghy², Nazhat Taj-Schaal⁴, Metin N Gurcan¹

Affiliations

¹ Center for Biomedical Informatics, Wake Forest School of Medicine, Winston-Salem, North Carolina, United States of America.
² Department of Otolaryngology, Ohio State University, Columbus, Ohio, United States of America.
³ UH Seidman Cancer Center, Cleveland, Ohio, United States of America.
⁴ Department of Internal Medicine, Ohio State University, Columbus, Ohio, United States of America.

Abstract

Acute infections of the middle ear are the most commonly treated childhood diseases. Because complications affect children's language learning and cognitive processes, it is essential to diagnose these diseases in a timely and accurate manner. The prevailing literature suggests that it is difficult to accurately diagnose these infections, even for experienced ear, nose, and throat (ENT) physicians. Advanced care practitioners (e.g., nurse practitioners, physician assistants) serve as first-line providers in many primary care settings and may benefit from additional guidance to appropriately determine the diagnosis and treatment of ear diseases. For this purpose, we designed a content-based image retrieval (CBIR) system (called OtoMatch) for normal, middle ear effusion, and tympanostomy tube conditions, operating on eardrum images captured with a digital otoscope. We present a method that enables the conversion of any convolutional neural network (trained for classification) into an image retrieval model. As a proof of concept, we converted a pre-trained deep learning model into an image retrieval system. We accomplished this by changing the fully connected layers into lookup tables. A database of 454 labeled eardrum images (179 normal, 179 effusion, and 96 tube cases) was used to train and test the system. On a 10-fold cross validation, the proposed method resulted in an average accuracy of 80.58% (SD 5.37%), and maximum F1 score of 0.90 while retrieving the most similar image from the database. These are promising results for the first study to demonstrate the feasibility of developing a CBIR system for eardrum images using the newly proposed methodology.

Publication types

Research Support, N.I.H., Extramural

MeSH terms

Adult
Algorithms*
Child
Databases as Topic
Deep Learning*
Humans
Image Processing, Computer-Assisted*
Information Storage and Retrieval*
Reproducibility of Results
Tympanic Membrane / diagnostic imaging*

Grants and funding

R21 DC016972/DC/NIDCD NIH HHS/United States