Some Considerations about Text Classification Systems using Weka

Abstract:

We analyze the SMO, Naïve Bayes and J48 performance on a text training dataset. We have done the same analysis having pairs of synonyms, when taking into account the assumption that the synonyms selection must be done according to the meaning of the information in the document.