دورية أكاديمية

Enhancing handwritten text recognition accuracy with gated mechanisms

التفاصيل البيبلوغرافية
العنوان: Enhancing handwritten text recognition accuracy with gated mechanisms
المؤلفون: Ravikumar Chinthaginjala, C. Dhanamjayulu, Tai-hoon Kim, Suhaib Ahmed, Si-Yeong Kim, A. S. Kumar, Visalakshi Annepu, Shafiq Ahmad
المصدر: Scientific Reports, Vol 14, Iss 1, Pp 1-16 (2024)
بيانات النشر: Nature Portfolio, 2024.
سنة النشر: 2024
المجموعة: LCC:Medicine
LCC:Science
مصطلحات موضوعية: Convolutional recurrent neural networks, Handwritten transcript recognition, Natural language processing, Gated convolutional neural networks, Deep learning, Medicine, Science
الوصف: Abstract Handwritten Text Recognition (HTR) is a challenging task due to the complex structures and variations present in handwritten text. In recent years, the application of gated mechanisms, such as Long Short-Term Memory (LSTM) networks, has brought significant advancements to HTR systems. This paper presents an overview of HTR using a gated mechanism and highlights its novelty and advantages. The gated mechanism enables the model to capture long-term dependencies, retain relevant context, handle variable length sequences, mitigate error propagation, and adapt to contextual variations. The pipeline involves preprocessing the handwritten text images, extracting features, modeling the sequential dependencies using the gated mechanism, and decoding the output into readable text. The training process utilizes annotated datasets and optimization techniques to minimize transcription discrepancies. HTR using a gated mechanism has found applications in digitizing historical documents, automatic form processing, and real-time transcription. The results show improved accuracy and robustness compared to traditional HTR approaches. The advancements in HTR using a gated mechanism open up new possibilities for effectively recognizing and transcribing handwritten text in various domains. This research does a better job than the most recent iteration of the HTR system when compared to five different handwritten datasets (Washington, Saint Gall, RIMES, Bentham and IAM). Smartphones and robots are examples of low-cost computing devices that can benefit from this research.
نوع الوثيقة: article
وصف الملف: electronic resource
اللغة: English
تدمد: 2045-2322
57222754
Relation: https://doaj.org/toc/2045-2322
DOI: 10.1038/s41598-024-67738-8
URL الوصول: https://doaj.org/article/ebefe57222754dec9fac9f021aeb33da
رقم الأكسشن: edsdoj.befe57222754dec9fac9f021aeb33da
قاعدة البيانات: Directory of Open Access Journals
الوصف
تدمد:20452322
57222754
DOI:10.1038/s41598-024-67738-8