ADVERTISEMENT

Home|Journals|Articles by Year|Audio Abstracts
 

Original Article

JJCIT. 2025; 11(4): 432-447


LIGHT-WEIGHT, SEMI CONTEXT-FREE, RULE-BASED ARABIC TEXT CLASSIFIER FOR POS TAGGING

Bilal Ibrahim Alqudah, Mohanad Alhasanat, Abdullah Alhasanat, Hatem Alqudah.



Abstract
Download PDF Post

In this paper, we address the problem of Arabic text part-of-speech tagging (POS) and the morphological classification for Arabic text. Our focus is the Classical Arabic (CA) language and Modern Arabic (MSA), where the text is vocalized and has diacritics in most of its letters. Our proposed method of classification is lexicon-free, tokenization-agnostic, stemming processes, or artificial intelligence techniques. The goal is to lower the needed resources to classify the Arabic text. It is built upon the fact that each verb in the Arabic language follows a rule (وزن) that can be used to identify a word. The process is determined by a finite state machine translated to regular expressions. Each verb tense is presented in a set of regular expressions (RE). The order in which a set of regular expressions is processed is significant to the result accuracy. Whenever a match is found, the word is marked so no further matches occur. The provided method is lightweight and provides a best-effort classifier where the closest match is assigned as a tag.

Key words: Part of Speech Tagging, Arabic Rule Based Classifier, Natural Languages, Context Free.







Bibliomed Article Statistics

75
37
62
49
21
14
14
R
E
A
D
S

63

44

66

56

44

17

1
D
O
W
N
L
O
A
D
S
12010203040506
20252026

Full-text options


Share this Article


Online Article Submission
• ejmanager.com




ejPort - eJManager.com
Author Tools
About BiblioMed
License Information
Terms & Conditions
Privacy Policy
Contact Us

The articles in Bibliomed are open access articles licensed under Creative Commons Attribution 4.0 International License (CC BY), which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.