IMPLEMENTASI NAÏVE BAYES CLASSIFIER PADA TWITTER DALAM ANALISIS SENTIMEN TERHADAP VARIAN BARU CORONA VIRUS OMICRON
No Thumbnail Available
Date
2023-06
Authors
Journal Title
Journal ISSN
Volume Title
Publisher
Elfitra
Abstract
Omicron is a new variant of the Corona virus that was first reported to the World Health
Organization (WHO) from South Africa on November 24, 2021. The amount of information
related to the Omicron virus that is spread in the form of text on the internet and on social
media sites like Twitter is of interest to researchers. This study aims to determine public
sentiment from Twitter users about the omicron virus, then determine sentiment classification
using the Nave Bayes Classifier and calculate the level of accuracy of the method in sentiment
analysis. The data used is 1350 data points using two data testing scenarios, namely the first
scenario, Split Validation, using a data ratio of 70%:30%, 80%:20%, and 90%:10%. The
second scenario uses cross-validation, divided into 10 tests. Sentiment classifiers are divided
into three classes: neutral, fearful, and fearless. The classification process is carried out by
means of data pre-processing and weighting using the TF-IDF technique. The final results
of the classification process get the highest accuracy values of 85%, 81% precision, and 86%
recall in the split validation test with a 90:10 data division. Testing data with cross-validation
resulted in the highest accuracy of 93,1% accuracy, 36,6% precision, and 98% recall, with
a fold value of 1.
Description
Keywords
Sentiment Analysis, Cross Validation, Fold, Split Validation, Naïve Bayes Classifier, Omicron, TF-IDF, Data Testing, Twitter, Virus
Citation
Perpustakaan