Map/Reduce with Hadoop

This practice is about programming map/reduce functions in some datasets to be processed at a Hadoop cluster using Yarn.

Requirements:

. The setup for this tasks using docker can be found in this link. . Complete the tasks below developing the provided scripts

Download file salarios.csv (incoming) avaliable at this link.
Modify the files mapper.py and reducer.py to create mapping and reducing functions to output a list with the 10 highiest incomes, returning the name and income.

Download the weblog file (weblog_entries.txt)
Make the necessary changes to mapper.py and reducer.py to generate a list with the name of the files .html having the number of characters lesser or equal than 5

Download the e-mail weblog file (dovecot.log)
Output: A list with all the user names and how many times they tried to access the mail service, but only the ones with more than 100 entries

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
.env		.env
task_1		task_1
task_2		task_2
task_3		task_3
.gitignore		.gitignore
README.md		README.md