Recognition of merged characters based on forepart prediction, necessity-sufficiency matching, and character-adaptive masking

IEEE Trans Syst Man Cybern B Cybern. 2005 Feb;35(1):2-11. doi: 10.1109/tsmcb.2004.837588.

Abstract

Merged characters are the major cause of recognition errors. We classify the merging relationship between two involved characters into three types: "linear," "nonlinear," and "overlapped." Most segmentation methods handle the first type well, however, their capabilities of handling the other two types are limited. The weakness of handling the nonlinear and overlapped types results from character segmentation by linear, usually vertical, cuts assumed in these methods. This paper proposes a novel merged character segmentation and recognition method based on forepart prediction, necessity-sufficiency matching and character-adaptive masking. This method utilizes the information obtained from the forepart of merged characters to predict candidates for the leftmost character, and then applies character-adaptive masking and character recognition to verifying the prediction. Therefore, the arbitrary-shaped cutting path will follow the right shape of the leftmost character so as to preserve the shape of the next character. This method handles the first two types well and greatly improves the segmentation accuracy of the overlapped type. The experimental results and the performance comparisons with other methods demonstrate the effectiveness of the proposed method.

Publication types

  • Comparative Study
  • Evaluation Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Artificial Intelligence*
  • Computer Graphics
  • Documentation / methods
  • Electronic Data Processing / methods*
  • Image Enhancement / methods
  • Image Interpretation, Computer-Assisted / methods*
  • Information Storage and Retrieval / methods*
  • Numerical Analysis, Computer-Assisted
  • Pattern Recognition, Automated / methods*
  • Printing
  • Reading*
  • Reproducibility of Results
  • Sensitivity and Specificity
  • Signal Processing, Computer-Assisted