Reproducing Twitter Sentiment Analysis using Hadoop:

env_dia.yml file outlines all dependencies required to fully reproduce environment.
By using env_dia.yml to create the conda environment, you can avoid having to rerun the code numerous times until everything is installed.
For further information on creating conda environment using .yml file, visit: https://docs.conda.io/projects/conda/en/latest/user-guide/tasks/manage-environments.html?fbclid=IwAR3tWUCr1qIK4BmHfcQ-uRrkkSEDjiRwREMAIM4089dphzgntg66F-apuU8#creating-an-environment-from-an-environment-yml-file

Private .pem file must manually be added to working directory.
This connects local machine to remote ubuntu server for put and get sftp requests.
File is provided in zip folder from moodle submission (tb_ubuntu_mint.pem)

Please note:

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
.gitignore		.gitignore
README.md		README.md
Twitter_API_Module.py		Twitter_API_Module.py
analyse_hdfs_results.py		analyse_hdfs_results.py
env_dia.yml		env_dia.yml
functions_tweet_mapreduce.py		functions_tweet_mapreduce.py
get_data_from_cloud.py		get_data_from_cloud.py
get_twitter_data.py		get_twitter_data.py
hadoop_streaming_jobs.txt		hadoop_streaming_jobs.txt
main.py		main.py
mapper_stop_words.py		mapper_stop_words.py
mapper_twitter_account.py		mapper_twitter_account.py
mapper_twitter_date.py		mapper_twitter_date.py
process_tweets.py		process_tweets.py
reducer_twitter_account.py		reducer_twitter_account.py
reducer_twitter_date.py		reducer_twitter_date.py
send_files_to_cloud.py		send_files_to_cloud.py
twitter_credentials.py		twitter_credentials.py
twitter_mass_media_data.csv		twitter_mass_media_data.csv