Agreement on intrapartum cardiotocogram recordings between expert obstetricians

J Eval Clin Pract. 2015 Aug;21(4):694-702. doi: 10.1111/jep.12368. Epub 2015 May 26.

Abstract

Rationale, aims and objectives: To evaluate obstetricians' inter- and intra-observer agreement on intrapartum cardiotocogram (CTG) recordings and to examine obstetricians' evaluations with respect to umbilical artery pH and base deficit.

Methods: Nine experienced obstetricians annotated 634 intrapartum CTG recordings. The evaluation of each recording was divided into four steps: evaluation of two 30-minute windows in the first stage of labour, evaluation of one window in the second stage of labour and labour outcome prediction. The complete set of evaluations used for this experiment is available online. The inter- and intra-observer agreement was evaluated using proportion of agreement and kappa coefficient. Clinicians' sensitivity and specificity was computed with respect to umbilical artery pH, base deficit and to Apgar score at the fifth minute.

Results: The overall proportion of agreement between clinicians reached 48% with 95% confidence intervals (CI) (CI: 47-50). Regarding the different classes, proportion of agreement ranged from 57% (CI: 54-60) for normal to 41% (CI: 36-46) for pathological class. The sensitivity of clinicians' majority vote to objective outcome was 39% (CI: 16-63) for the umbilical artery base deficit and 27% (CI: 16-42) for pH. The specificity was 89% (CI: 86-92) for both types of objective outcome.

Conclusions: The reported inter-/intra-observer variability is large and this holds irrespective of clinicians' experience or work place. The results support the need of modernized guidelines for CTG evaluation and/or objectivization and repeatability by introduction of a computerized approach that could standardize the process of CTG evaluation within the delivery ward.

Keywords: cardiotocography; fetal heart rate; inter-observer variability; intra-observer variability.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Cardiotocography / statistics & numerical data*
  • Clinical Competence*
  • Humans
  • Hydrogen-Ion Concentration
  • Observer Variation
  • Obstetrics / statistics & numerical data*
  • Reproducibility of Results
  • Sensitivity and Specificity
  • Software