Dive into the world of movie reviews and analyze sentiments using Python! π₯πΏ
This repository contains a Python-based Sentiment Analysis tool which classifies IMDb movie reviews into positive (π) and negative (π) sentiments. Whether you're a movie buff, a data enthusiast, or someone who just loves playing with texts, this tool is for you!
.
βββ IMDB
β βββ Train.csv
β βββ Valid.csv
β βββ Test.csv
βββ text_analysis.py
- Make sure you have Python3 installed:
python3 --version
- Clone this repository:
git clone [(https://github.com/reecebaileyy/Sentiment-Analysis-for-Text.git)]
cd Sentiment-Analysis-for-Text
- Install required libraries:
pip install pandas numpy matplotlib nltk textblob scikit-learn wordcloud
python3 text_analysis.py
- Displayed filenames in the IMDB folder.
- Pie chart π₯§ showing label distribution.
- Word clouds βοΈ for positive and negative reviews.
- Performance metrics π of the model on validation and test datasets.
- Integrate more advanced NLP models for better accuracy.
- Add a GUI interface.
- Allow custom reviews for prediction.
Feel free to fork this project, raise issues, and submit Pull Requests.
π Acknowledgements IMDb for the dataset. Textblob and NLTK for processing tools.