Semi-supervised machine learning approaches for predicting the chronology of archaeological sites: A case study of temples from medieval Angkor, Cambodia

PLoS One. 2018 Nov 5;13(11):e0205649. doi: 10.1371/journal.pone.0205649. eCollection 2018.

Abstract

Archaeologists often need to date and group artifact types to discern typologies, chronologies, and classifications. For over a century, statisticians have been using classification and clustering techniques to infer patterns in data that can be defined by algorithms. In the case of archaeology, linear regression algorithms are often used to chronologically date features and sites, and pattern recognition is used to develop typologies and classifications. However, archaeological data is often expensive to collect, and analyses are often limited by poor sample sizes and datasets. Here we show that recent advances in computation allow archaeologists to use machine learning based on much of the same statistical theory to address more complex problems using increased computing power and larger and incomplete datasets. This paper approaches the problem of predicting the chronology of archaeological sites through a case study of medieval temples in Angkor, Cambodia. For this study, we have a large dataset of temples with known architectural elements and artifacts; however, less than ten percent of the sample of temples have known dates, and much of the attribute data is incomplete. Our results suggest that the algorithms can predict dates for temples from 821-1150 CE with a 49-66-year average absolute error. We find that this method surpasses traditional supervised and unsupervised statistical approaches for under-specified portions of the dataset and is a promising new method for anthropological inquiry.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Archaeology*
  • Architecture*
  • Calibration
  • Cambodia
  • Geography
  • Linear Models
  • Machine Learning*
  • Time Factors

Grants and funding

The survey of temple sites in Cambodia was funded by the National Science Foundation Dissertation Improvement Grant (1638137), the Rust Family Foundation, the Graduate Research Support Grant, Graduate and Professional Student Association (GPSA), the Office of Graduate Education, and the Office of the Vice President for Research and Economic Affairs. Fellowship support was provided for the authors from the National Science Foundation Graduate Research Fellowship (DGE-1122374), the Endeavour Fellowship from the Australian Government, Department of Education and Training, the ASU College of Liberal Arts and Sciences Graduate Dissertation Completion Fellowship, and the Dartmouth College General Fellowship for Graduate Research. This research has also benefited from funding from the Australian Research Council and the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (grant agreement No. 639828).