Advanced Information Retrieval System: Theoretical and Experimental Perspective

A Framework for Sentiment Mining in YouTube Comments Using Information Retrieval Methods

Author(s): Urmila Pilania*, Manoj Kumar* and Sanjay Singh *

Pp: 50-60 (11)

DOI: 10.2174/9798898813666126010007

* (Excluding Mailing and Handling)

Abstract

With the continuous growth of user-generated data on social media platforms, its analysis requires a deep understanding of sentiment trends. Data from social sites is collected using web scraping techniques and then filtered for consistency through pre-processing methods. In this study, multiple techniques are applied to analyze YouTube comments. The BERT transformer model is used to classify comments into positive, negative, and neutral categories, enabling sentiment analysis and providing insights into user opinions and trends. Additionally, Latent Dirichlet Allocation (LDA) is employed for thematic analysis to identify key discussion topics within the comments. The performance of the proposed approach is evaluated using the F1-score metric. For future improvements, deep learning techniques could be explored to enhance the accuracy of sentiment analysis.


Keywords: BERT method, LDA method, Sentiment analysis, Thematic analysis, Web scraping.