Research Article
Comparison of Machine Learning for Sentiment Analysis in Movie Reviews
Author(s): Jannatul Ferdous Mirza, Muhammad Mahbub Sarwar Shafi, Saeed Hossain Moheb
Article Information
Article Info: Journal of FST, ISSN: 2959-4812, Volume - 03, Issue - 01, July 2025, Article #10
Publish Date: July 1, 2025
Author(s): Jannatul Ferdous Mirza, Muhammad Mahbub Sarwar Shafi, Saeed Hossain Moheb
Keywords: Sentiment Analysis, Text Classification, Natural Language Processing
User Activity: Views: 224, Downloads: 286
Abstract
This study presents a comparative analysis of machine learning and deep learning algorithms for sentiment classification in movie reviews. Three benchmark datasets—IMDb (50K and 20K reviews) and Rotten Tomatoes—were used to evaluate six classifiers: Na¨ıve Bayes, Logistic Regression, Support Vector Machine, Decision Tree, Random Forest, and BERT. Preprocessing included tokenization, stop-word removal, stemming, and feature extraction using Count Vectorizer and TF-IDF. Evaluation metrics such as accuracy, precision, recall, sensitivity, specificity, and F1-score were used to assess model performance. Logistic Regression achieved 88% accuracy on the IMDb dataset, while Random Forest exhibited the highest specificity. BERT outperformed traditional models in both accuracy and F1-score across all datasets, particularly in handling informal and context-heavy language. The results highlight the impact of dataset characteristics on classification performance and provide insights for deploying sentiment analysis in real-world applications like recommendation systems and audience profiling.
Citation Information
Jannatul Ferdous Mirza, Muhammad Mahbub Sarwar Shafi, Saeed Hossain Moheb. (July 1, 2025). Comparison of Machine Learning for Sentiment Analysis in Movie Reviews. Journal of FST, Volume 03, Issue 01, 147-170.
