Sentiment Analysis Prediction
ADS508 Spring 2024 Team 5:
• Conor Fitzpatrick
• Ravita Kartawinata
• Halee Staggs
Company Name: PoliticPulse
Company Industry: Political Opinion Research/ Political Consulting
Company Size: 10
The company is focused on utilizing public discourse on social media and comments on news articles, specifically Twitter (now called X), and the New York Times, to understand public sentiment towards major presidential candidates in swing states.
By leveraging analytics and machine learning techniques, we aim to provide valuable insights to political organizations and campaigns, offering guidance in navigating the ever-changing landscape of public opinion.
Data are stored on public AWS S3 Bucket (s3://ads508team5/). There are 4 files in this bucket:
- Twitter: s3://ads508team5/tweeter/
- NYT comment: s3://ads508team5/nyt/nyt-comments-2020.csv
- US cities: s3://ads508team5/cities/uscities.csv
- Clone : https://github.com/HNStaggs/ADS508_GroupProject.git
- Run all the files in Setup folder
- Open EDA.ipnyb - Run All with ml.m5.large instance
- Open Partition_Transform.ipnyb - Run All with ml.m5.large instance
- Open Modeling.ipnyb - Run All with ml.m5.large instance
- Python (Jupyterlab notebook)
- AWS (Sagemaker, Athena, S3, DataWrangler, AutoPilot)
- Google Doc
- Powerpoint