Injury narrative text classification using factorization model

BMC Med Inform Decis Mak. 2015;15 Suppl 1(Suppl 1):S5. doi: 10.1186/1472-6947-15-S1-S5. Epub 2015 May 20.

Abstract

Narrative text is a useful way of identifying injury circumstances from the routine emergency department data collections. Automatically classifying narratives based on machine learning techniques is a promising technique, which can consequently reduce the tedious manual classification process. Existing works focus on using Naive Bayes which does not always offer the best performance. This paper proposes the Matrix Factorization approaches along with a learning enhancement process for this task. The results are compared with the performance of various other classification approaches. The impact on the classification results from the parameters setting during the classification of a medical text dataset is discussed. With the selection of right dimension k, Non Negative Matrix Factorization-model method achieves 10 CV accuracy of 0.93.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Data Mining / methods*
  • Humans
  • Machine Learning*
  • Medical Informatics / methods*
  • Medical Records*
  • Narration*
  • Wounds and Injuries*