ADVERTISEMENT

Home|Journals|Articles by Year|Audio Abstracts
 

Original Article

JJCIT. 2025; 11(4): 418-431


An Enhanced Word Level Arabic Ocr Based on Dual Encoder Transformer Architecture

Khulood Gaashan, Maram Bani Younes.



Abstract
Download PDF Post

Arabic script is one of the most sophisticated and difficult scripts. It uses different shapes of characters with
complex diacritical marks that are difficult to distinguish from characters’ dots. This script’s distinctive
features make the Optical character recognition (OCR) procedure more difficult and cause low-accuracy
recognition. Different studies have aimed to introduce high-accuracy Arabic OCR in the literature. How-
ever, enhancing the accuracy of reading the words has been an open issue that depends on the used dataset
and the developed recognition model. Besides, considering diacritics has been limited and not sufficiently
addressed. Experimental tests on words with diacritics in prior models have shown bad accuracy that does
not exceed 60%. Consequently, this work aims to introduce a new, accurate deep-learning model for Ara-
bic OCR that considers words with and without diacritical marks. It utilizes a dual encoder transformer
(DTrOCR), a deep-learning architecture that has demonstrated superior performance in identification
and classification tasks. The proposed DTrOCR creates multi-batch sizes. It has been trained using a com-
prehensive, generated Arabic word-based dataset named MFSRHRD and tested on unseen datasets. The
accuracy of configuring Arabic words without diacritics reaches 98.5%. However, for words with diacritics,
it achieved an accuracy of 89.9%.

Key words: Arabic OCR; Multi-Batch Size; Transformer; Dual Encoder Transformer; Decoder; Feature extraction; Self Attention Mechanism.







Bibliomed Article Statistics

76
57
51
57
34
19
6
R
E
A
D
S

51

124

71

101

56

54

3
D
O
W
N
L
O
A
D
S
12010203040506
20252026

Full-text options


Share this Article


Online Article Submission
• ejmanager.com




ejPort - eJManager.com
Author Tools
About BiblioMed
License Information
Terms & Conditions
Privacy Policy
Contact Us

The articles in Bibliomed are open access articles licensed under Creative Commons Attribution 4.0 International License (CC BY), which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.