Intelligent bar chart plagiarism detection in documents

ScientificWorldJournal. 2014:2014:612787. doi: 10.1155/2014/612787. Epub 2014 Sep 17.

Abstract

This paper presents a novel features mining approach from documents that could not be mined via optical character recognition (OCR). By identifying the intimate relationship between the text and graphical components, the proposed technique pulls out the Start, End, and Exact values for each bar. Furthermore, the word 2-gram and Euclidean distance methods are used to accurately detect and determine plagiarism in bar charts.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Artificial Intelligence*
  • Computer Graphics
  • Data Mining / methods*
  • Humans
  • Pattern Recognition, Automated / methods*
  • Plagiarism*
  • Semantics