DSpace

King Saud University Repository >
King Saud University >
COLLEGES >
Science Colleges >
College of Computer and Information Sciences >
College of Computer and Information Sciences >

Please use this identifier to cite or link to this item: http://hdl.handle.net/123456789/15340

Title: Naive Bayes Classifier based Arabic document categorization
Authors: Noaman, H.M.
Elmougy, S.
Ghoneim, A
Hamza, T.
Keywords: Naïve Bayes Classifier, document categorization, machine learning, natural language processing for Arabic language
Issue Date: 2010
Publisher: IEEE Explorer - Proceeding of the the 7th International Conference on Informatics and Systems (INFOS)
Abstract: Text Categorization aims to assign an electronic document to one or more categories based on its contents. Due to the rapid growth of the number of online Arabic documents, the information libraries and Arabic document corpus, automatic Arabic document classification becomes an important task. This paper suggests the use of rooting algorithm with Naive Bayes Classifier to the problem of document categorization of Arabic language and reports the algorithm performance in terms of error rate, accuracy, and micro-average recall measures. Our experimental study shows that using rooting algorithm with Naive Bayes (NB) Classifier gives ~62.23% average accuracy and decreases the dimensionality of the training documents.
URI: http://hdl.handle.net/123456789/15340
Appears in Collections:College of Computer and Information Sciences

Files in This Item:

File Description SizeFormat
Dr. Samir Elmougy-4-conf.docx15.17 kBMicrosoft Word XMLView/Open

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

 

DSpace Software Copyright © 2002-2007 MIT and Hewlett-Packard - Feedback