In: Computer Science
Utilize Python to create a key-value program similar to MapReduce on Hadoop. For this assignment, do the following:
How this program could operate in a distributed environment where the dictionary is spread across a cluster of nodes. What would be the limitations of this architecture? Is there a threshold for the amount of data and the overhead of operating on a cluster?