parallel MapReduce problem

This program is about creating a local parallel MapReduce program, counting the total number of each word in different files.

This algorithm uses the "multiprocessing" library to create a pool of worker processes that run in parallel.

The "map_function" reads the contents of a file, splits it into words, and returns a list of tuples containing each word and the number of occurrences of that word.

The "reduce_function" takes a list of these tuples and returns a single tuple containing each word and the total number of occurrences across all files.

The "main" function applies the "map_function" to each file in a list of file names using the worker pool, then applies the "reduce_function" to the resulting list

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
README.md		README.md
mapreduce.py		mapreduce.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

parallel MapReduce problem

About

Releases

Packages

Languages

meftehs/MapReduce

Folders and files

Latest commit

History

Repository files navigation

parallel MapReduce problem

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages