Improved Handwritten Digit Recognition Using Convolutional Neural Networks (CNN)

Savita Ahlawat; Amit Choudhary; Anand Nayyar; Saurabh Singh; Byungun Yoon

doi:10.3390/s20123344

Improved Handwritten Digit Recognition Using Convolutional Neural Networks (CNN)

Sensors (Basel). 2020 Jun 12;20(12):3344. doi: 10.3390/s20123344.

Authors

Savita Ahlawat¹, Amit Choudhary², Anand Nayyar³, Saurabh Singh⁴, Byungun Yoon⁴

Affiliations

¹ Department of Computer Science and Engineering, Maharaja Surajmal Institute of Technology, New Delhi 110058, India.
² Department of Computer Science, Maharaja Surajmal Institute, New Delhi 110058, India.
³ Graduate School, Duy Tan University, Da Nang 550000, Vietnam.
⁴ Department of Industrial & Systems Engineering, Dongguk University, Seoul 04620, Korea.

Abstract

Traditional systems of handwriting recognition have relied on handcrafted features and a large amount of prior knowledge. Training an Optical character recognition (OCR) system based on these prerequisites is a challenging task. Research in the handwriting recognition field is focused around deep learning techniques and has achieved breakthrough performance in the last few years. Still, the rapid growth in the amount of handwritten data and the availability of massive processing power demands improvement in recognition accuracy and deserves further investigation. Convolutional neural networks (CNNs) are very effective in perceiving the structure of handwritten characters/words in ways that help in automatic extraction of distinct features and make CNN the most suitable approach for solving handwriting recognition problems. Our aim in the proposed work is to explore the various design options like number of layers, stride size, receptive field, kernel size, padding and dilution for CNN-based handwritten digit recognition. In addition, we aim to evaluate various SGD optimization algorithms in improving the performance of handwritten digit recognition. A network's recognition accuracy increases by incorporating ensemble architecture. Here, our objective is to achieve comparable accuracy by using a pure CNN architecture without ensemble architecture, as ensemble architectures introduce increased computational cost and high testing complexity. Thus, a CNN architecture is proposed in order to achieve accuracy even better than that of ensemble architectures, along with reduced operational complexity and cost. Moreover, we also present an appropriate combination of learning parameters in designing a CNN that leads us to reach a new absolute record in classifying MNIST handwritten digits. We carried out extensive experiments and achieved a recognition accuracy of 99.87% for a MNIST dataset.

Keywords: OCR; convolutional neural networks; handwritten digit recognition; pre-processing.

Grants and funding

2019R1A2C1085388/National Research Foundation of Korea