75022

Автор(ы): 

Автор(ов): 

4

Параметры публикации

Тип публикации: 

Статья в журнале/сборнике

Название: 

Exploratory Data Analysis and Natural Language Processing Model for Analysis and Identification of the Dynamics of COVID-19 Vaccine Opinions on Small Datasets

ISBN/ISSN: 

1078-6236

DOI: 

https://doi.org/10.25728/assa.2023.23.3.1381

Наименование источника: 

  • Advances in Systems Science and Applications

Обозначение и номер тома: 

Vol 23 No 3

Город: 

  • Slippery Rock, USA

Издательство: 

  • The International Institute for General Systems Studies (IIGSS)

Год издания: 

2023

Страницы: 

108-126
Аннотация
In this study, the successful implementation of an active learning algorithm on small-scale datasets is demonstrated. The study also examines the dynamics of public opinions on COVID-19 vaccinations using VK (social network) commentaries related to the COVID- 19 vaccine and masks for opinion evaluation. The proposed methodology includes several stages such as natural language processing, classification with active learning, exploratory data analysis, and opinion dynamics. Natural language processing is used for text preprocessing, tokenization, and feature extraction. A machine learning model with active learning is employed to identify opinions as positive, negative, or neutral/unknown. The model includes classical machine learning, machine learning and deep learning models. The results show that the highest classification accuracy is 69.1% and 73.1% without and with the active learning algorithm, respectively. The experimental results suggest that classifiers using active learning perform better than simple natural language processing classifiers on small-scale datasets.

Библиографическая ссылка: 

Мельничук В.С., Губанов Д.А., Сыч В.В., Чхартишвили А.Г. Exploratory Data Analysis and Natural Language Processing Model for Analysis and Identification of the Dynamics of COVID-19 Vaccine Opinions on Small Datasets // Advances in Systems Science and Applications. 2023. Vol 23 No 3. С. 108-126.