Improved Word Representation for Arabic Sentiment Analyzing

Abstract:

 In this article, we investigate a variety of documents to vector representation algorithms that may be used for text categorization and sentiment analysis. The document to vector representation technique known as doc2vec was the very first contender for such a job that we examined. This is followed by the TF and TFIDF as basic approaches, and eventually the word2vec technique. Numerous techniques, combinations of these vectors, and other approaches to machine learning have been evaluated and assessed.  Even when compared to deep pre-trained models such as the Bert model and others, the accuracy rates increase by least 2% to 3% in average when the results of the created scheme in this study are compared to the stateof-the-art results on various Arabic sentiment analysis benchmarks.