Research Article

Comparison of Machine Learning for Sentiment Analysis in Movie Reviews

Author(s): Jannatul Ferdous Mirza, Muhammad Mahbub Sarwar Shafi, Saeed Hossain Moheb

Article Information

Article Info: Journal of FST, ISSN: 2959-4812, Volume - 03, Issue - 01, July 2025, Article #10

Publish Date: July 1, 2025

Author(s): Jannatul Ferdous Mirza, Muhammad Mahbub Sarwar Shafi, Saeed Hossain Moheb

Keywords: Sentiment Analysis, Text Classification, Natural Language Processing

User Activity: Views: 224, Downloads: 286

Abstract

This study presents a comparative analysis of machine learning and deep learning algorithms for sentiment classification in movie reviews. Three benchmark datasets—IMDb (50K and 20K reviews) and Rotten Tomatoes—were used to evaluate six classifiers: Na¨ıve Bayes, Logistic Regression, Support Vector Machine, Decision Tree, Random Forest, and BERT. Preprocessing included tokenization, stop-word removal, stemming, and feature extraction using Count Vectorizer and TF-IDF. Evaluation metrics such as accuracy, precision, recall, sensitivity, specificity, and F1-score were used to assess model performance. Logistic Regression achieved 88% accuracy on the IMDb dataset, while Random Forest exhibited the highest specificity. BERT outperformed traditional models in both accuracy and F1-score across all datasets, particularly in handling informal and context-heavy language. The results highlight the impact of dataset characteristics on classification performance and provide insights for deploying sentiment analysis in real-world applications like recommendation systems and audience profiling.

Citation Information

Jannatul Ferdous Mirza, Muhammad Mahbub Sarwar Shafi, Saeed Hossain Moheb. (July 1, 2025). Comparison of Machine Learning for Sentiment Analysis in Movie Reviews. Journal of FST, Volume 03, Issue 01, 147-170.

DOI: https://doi.org/10.64494/JFST/v3i1/jm/2025/07/147-170