Development of Handwritten Text Recognition system for the Kazakh Language

Authors

  • A. Razaque Satbayev University, Kazakhstan
  • B. Makezhanuly Satbayev University, Kazakhstan
  • O. Alimseitov Satbayev University, Kazakhstan
  • Zh. Kalpeyeva Satbayev University, Kazakhstan
  • A. Ayapbergenova Satbayev University, Kazakhstan

DOI:

https://doi.org/10.51301/ce.2024.i4.01

Keywords:

handwritten text recognition, machine learning, kazakh language, deep learning, convolutional neural networks, recurrent neural networks, character error rate, word error rate

Abstract

The low digitalization of the Kazakh language is a problem that affects bureaucracy efficiency, the accessibility of literature, and education in the Kazakh language. This research introduces a modern approach to handwritten text recognition (HTR) for the Kaz akh language. It optimizes document flow and text mining, increases accessibility to Kazakh literature and historical resources, helps teachers in students’ essay scoring, and judges in decision -making. This solution optimizes operational processes in business, education, and government services. The state -of-the-art algorithms are integrated to achie ve improved accuracy and performance of text translation. HTR for the Kazakh language uses effective machine learning (ML) methods to create an HTR system specifically tuned for the Kazakh script. The se leverage features of Convolutional Neural Networks (CNN), Recurrent Neural Networks (RNN), image augmentation, transfer learning, and classic ML methods. HTR is implemented using Python programming language, O penCV, PyTorch, and Scikit- learn libraries. The system was trained on a large dataset of Kazakh handwritten text with different topics.

Downloads

Published

2024-12-31

How to Cite

Razaque, A. ., Makezhanuly, B. ., Alimseitov, O. ., Kalpeyeva, Z. ., & Ayapbergenova, A. . (2024). Development of Handwritten Text Recognition system for the Kazakh Language. Computing &Amp; Engineering, 2(4), 1–7. https://doi.org/10.51301/ce.2024.i4.01

Issue

Section

Digital technologies and software solutions