Extracting numerical data from unstructured Arabic texts (ENAT)

Unstructured data becomes challenges because in recent years have observed the ability to gather a massive amount of data from annotated documents. This paper interested with Arabic unstructured text analysis. Manipulating unstructured text and converting it into a form understandable by computer is...

Full description

Saved in:
Bibliographic Details
Main Authors: K. AL-Mashhadany, Abeer (Author), N. Hamood, Dalal (Author), Sadiq Al-Obaidi, Ahmed T. (Author), K. Al-Mashhsdany, Waleed (Author)
Format: EJournal Article
Published: Institute of Advanced Engineering and Science, 2021-03-01.
Subjects:
Online Access:Get fulltext
Tags: Add Tag
No Tags, Be the first to tag this record!
LEADER 02639 am a22003253u 4500
001 ijeecs23031_14750
042 |a dc 
100 1 0 |a K. AL-Mashhadany, Abeer  |e author 
100 1 0 |e contributor 
700 1 0 |a N. Hamood, Dalal  |e author 
700 1 0 |a Sadiq Al-Obaidi, Ahmed T.  |e author 
700 1 0 |a K. Al-Mashhsdany, Waleed  |e author 
245 0 0 |a Extracting numerical data from unstructured Arabic texts (ENAT) 
260 |b Institute of Advanced Engineering and Science,   |c 2021-03-01. 
500 |a https://ijeecs.iaescore.com/index.php/IJEECS/article/view/23031 
520 |a Unstructured data becomes challenges because in recent years have observed the ability to gather a massive amount of data from annotated documents. This paper interested with Arabic unstructured text analysis. Manipulating unstructured text and converting it into a form understandable by computer is a high-level aim. An important step to achieve this aim is to understand numerical phrases. This paper aims to extract numerical data from Arabic unstructured text in general. This work attempts to recognize numerical characters phrases, analyze them and then convert them into integer values. The inference engine is based on the Arabic linguistic and morphological rules. The applied method encompasses rules of numerical nouns with Arabic morphological rules, in order to achieve high accurate extraction method. Arithmetic operations are applied to convert the numerical phrase into integer value. The proper operation is determined depending on linguistic and morphological rules. It will be shown that applying Arabic linguistic rules together with arithmetic operations succeeded in extracting numerical data from Arabic unstructured text with high accuracy reaches to 100%. 
540 |a Copyright (c) 2021 Institute of Advanced Engineering and Science 
540 |a http://creativecommons.org/licenses/by-nc/4.0 
546 |a eng 
690 |a Computer Science; Artificial Intelligence; Natural Language Processing; Text Mining 
690 |a Arabic linguistic rules; Numerical dictionary; Related words; Text data mining; Unstructured data 
655 7 |a info:eu-repo/semantics/article  |2 local 
655 7 |a info:eu-repo/semantics/publishedVersion  |2 local 
655 7 |2 local 
786 0 |n Indonesian Journal of Electrical Engineering and Computer Science; Vol 21, No 3: March 2021; 1759-1770 
786 0 |n 2502-4760 
786 0 |n 2502-4752 
786 0 |n 10.11591/ijeecs.v21.i3 
787 0 |n https://ijeecs.iaescore.com/index.php/IJEECS/article/view/23031/14750 
856 4 1 |u https://ijeecs.iaescore.com/index.php/IJEECS/article/view/23031/14750  |z Get fulltext