Development of Handwritten Text Recognition system for the Kazakh Language
DOI:
https://doi.org/10.51301/ce.2024.i4.01Keywords:
handwritten text recognition, machine learning, kazakh language, deep learning, convolutional neural networks, recurrent neural networks, character error rate, word error rateAbstract
The low digitalization of the Kazakh language is a problem that affects bureaucracy efficiency, the accessibility of literature, and education in the Kazakh language. This research introduces a modern approach to handwritten text recognition (HTR) for the Kaz akh language. It optimizes document flow and text mining, increases accessibility to Kazakh literature and historical resources, helps teachers in students’ essay scoring, and judges in decision -making. This solution optimizes operational processes in business, education, and government services. The state -of-the-art algorithms are integrated to achie ve improved accuracy and performance of text translation. HTR for the Kazakh language uses effective machine learning (ML) methods to create an HTR system specifically tuned for the Kazakh script. The se leverage features of Convolutional Neural Networks (CNN), Recurrent Neural Networks (RNN), image augmentation, transfer learning, and classic ML methods. HTR is implemented using Python programming language, O penCV, PyTorch, and Scikit- learn libraries. The system was trained on a large dataset of Kazakh handwritten text with different topics.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2024 Computing & Engineering

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
<div class="pkpfooter-son">
<a rel="license" href="http://creativecommons.org/licenses/by-nc/4.0/"><img alt="Creative Commons License" style="border-width:0" src="https://i.creativecommons.org/l/by-nc/4.0/80x15.png"></a><br>This work is licensed under a <a rel="license" href="http://creativecommons.org/licenses/by-nc/4.0/">Creative Commons Attribution-NonCommercial 4.0 International License</a>.
</div>