###1. PySpark Algorithms by Mahmoud Parsian
###2. Source code @github.com -- PySpark Algorithms by Mahmoud Parsian
###3. Mining of Massive Datasets by Jure Leskovec, Anand Rajaraman, Jeffrey D. Ullman
###5. Designing Good Mapreduce Algorithms by Ullman
###6: Bigtable: A Distributed Storage System for Structured Data
###7. Relational Algebra and MapReduce
###9. MapReduce and relational algebra