The components of paraphrase evaluations

Philip M McCarthy; Rebekah H Guess; Danielle S McNamara

doi:10.3758/BRM.41.3.682

The components of paraphrase evaluations

Behav Res Methods. 2009 Aug;41(3):682-90. doi: 10.3758/BRM.41.3.682.

Authors

Philip M McCarthy¹, Rebekah H Guess, Danielle S McNamara

Affiliation

¹ FedEx Institute of Technology, University of Memphis, Memphis, Tennessee 38152, USA. pmmccrth@memphis.edu

PMID: 19587179
DOI: 10.3758/BRM.41.3.682

Abstract

Two sentences are paraphrases if their meanings are equivalent but their words and syntax are different. Paraphrasing can be used to aid comprehension, stimulate prior knowledge, and assist in writing-skills development. As such, paraphrasing is a feature of fields as diverse as discourse psychology, composition, and computer science. Although automated paraphrase assessment is both commonplace and useful, research has centered solely on artificial, edited paraphrases and has used only binary dimensions (i.e., is or is not a paraphrase). In this study, we use an extensive database (N=1,998) of natural paraphrases generated by high school students that have been assessed along 10 dimensions (e.g., semantic completeness, lexical similarity, syntactical similarity). This study investigates the components of paraphrase quality emerging from these dimensions and examines whether computational approaches can simulate those human evaluations. The results suggest that semantic and syntactic evaluations are the primary components of paraphrase quality, and that computationally light systems such as latent semantic analysis (semantics) and minimal edit distances (syntax) present promising approaches to simulating human evaluations of paraphrases.

Publication types

Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

Computer Simulation*
Humans
Psycholinguistics / methods*
Semantics