Performance of NB and SVM Classifiers in Arabic Text Data

Abstract:

Text classification is a supervised learning technique that uses labelled training data to derive a classification system (classifier) and then automatically classifies unlabelled text data using the derived classifier. This paper investigates Naïve Bayesian method (NB) and Support Vector Machine (SVM) on different Arabic data sets. The bases of our comparison are the most popular text evaluation measures. The Experimental results against different Arabic text categorization data sets reveal that NB algorithm outperforms the SVM with regards to F1 measure.