A Text Structuring Method for Chinese Medical Text Based on Temporal Information

Int J Environ Res Public Health. 2018 Feb 27;15(3):402. doi: 10.3390/ijerph15030402.

Abstract

Chinese Electronic Medical Records (EMRs) contains a large number of complex medical free text which includes a variety of information, such as temporal information, patients' symptoms and laboratory data. However, as an important knowledge base, these unstructured text data in EMR are hard to process directly by computer to support further medical research. This paper proposes a novel text structuring method to extract knowledge from EMR texts and reorganize them in chronological order according to the temporal information in the text. By implementing some entropy-based algorithms as contrast, experiments evaluate the performance of the proposed method, which indicates the new method can significantly reduce the complexity of EMR text. This work is significant in structuring the EMR free text into temporal-structured data for further medical analysis.

Keywords: Chinese; electronic medical records; information entropy; temporal information; text structuring method.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • China
  • Electronic Health Records*
  • Humans
  • Information Storage and Retrieval / methods*
  • Language*
  • Time Factors