Distributed Algorithms, Map-Reduce Paradigm, Scalable ML using Spark MLlib on Standalone, AWS EMR Cluster with Docker & Nvidia RAPIDS. — Since the early 2000s, the amount of data collected has increased enormously due to the advent of internet giants such as Google, Netflix, Youtube, Amazon, Facebook, etc. Near to 2010, another “data wave” had come about when mobile phones became hugely popular. In 2020s, we anticipate another exponential rise in…