Description
APRESS Pyspark Recipes A Problem-Solution Approach With Pyspark2 by Mishra
Chapter 1: The Era of Big Data, Hadoop, and Other Big Data Processing Frameworks.- Chapter 2: Installation.- Chapter 3: Introduction to Python and NumPy.- Chapter 4: Spark Architecture and Resilient Distributed Dataset.- Chapter 5: The Power of Pairs: Paired RDD.- Chapter 6: IO in PySpark.- Chapter 7: Optimizing PySpark and PySpark Streaming.- Chapter 8: PySparkSQL.- Chapter 9: PySpark MLlib and Linear Regression.