Medical Data Mining Course Development in Postgraduate Medical Education: Web-Based Survey and Case Study

JMIR Med Educ. 2021 Oct 1;7(4):e24027. doi: 10.2196/24027.

Abstract

Background: Medical postgraduates' demand for data capabilities is growing, as biomedical research becomes more data driven, integrative, and computational. In the context of the application of big data in health and medicine, the integration of data mining skills into postgraduate medical education becomes important.

Objective: This study aimed to demonstrate the design and implementation of a medical data mining course for medical postgraduates with diverse backgrounds in a medical school.

Methods: We developed a medical data mining course called "Practical Techniques of Medical Data Mining" for postgraduate medical education and taught the course online at Peking Union Medical College (PUMC). To identify the background knowledge, programming skills, and expectations of targeted learners, we conducted a web-based questionnaire survey. After determining the instructional methods to be used in the course, three technical platforms-Rain Classroom, Tencent Meeting, and WeChat-were chosen for online teaching. A medical data mining platform called Medical Data Mining - R Programming Hub (MedHub) was developed for self-learning, which could support the development and comprehensive testing of data mining algorithms. Finally, we carried out a postcourse survey and a case study to demonstrate that our online course could accommodate a diverse group of medical students with a wide range of academic backgrounds and programming experience.

Results: In total, 200 postgraduates from 30 disciplines participated in the precourse survey. Based on the analysis of students' characteristics and expectations, we designed an optimized course structured into nine logical teaching units (one 4-hour unit per week for 9 weeks). The course covered basic knowledge of R programming, machine learning models, clinical data mining, and omics data mining, among other topics, as well as diversified health care analysis scenarios. Finally, this 9-week course was successfully implemented in an online format from May to July in the spring semester of 2020 at PUMC. A total of 6 faculty members and 317 students participated in the course. Postcourse survey data showed that our course was considered to be very practical (83/83, 100% indicated "very positive" or "positive"), and MedHub received the best feedback, both in function (80/83, 96% chose "satisfied") and teaching effect (80/83, 96% chose "satisfied"). The case study showed that our course was able to fill the gap between student expectations and learning outcomes.

Conclusions: We developed content for a data mining course, with online instructional methods to accommodate the diversified characteristics of students. Our optimized course could improve the data mining skills of medical students with a wide range of academic backgrounds and programming experience.

Keywords: course development; medical data mining; online teaching; postgraduate medical education.