Construction of the integrated multicentre discharge summary database

Stud Health Technol Inform. 2013:192:1064.

Abstract

We started a multi-year project to collect discharge summaries from multiple hospitals and create a big text database to build a common document vector space, and develop various applications such as the autoselection of the disease. As the first step, we extracted discharge summary from two hospitals. Using a text mining method, we carried out a DPC selection. There was a difference in term structure and number of terms between the discharge summaries from both hospitals. Nevertheless, the selection rate of the disease is resembled closely.

Publication types

  • Multicenter Study

MeSH terms

  • Clinical Coding / methods*
  • Data Mining / methods*
  • Databases, Factual*
  • Electronic Health Records / organization & administration*
  • Information Dissemination / methods
  • Japan
  • Medical Record Linkage / methods*
  • Natural Language Processing
  • Patient Discharge Summaries / classification*
  • Systems Integration
  • Vocabulary, Controlled*