Antolínez García Hands-on Guide to Apache Spark 3
1. Auflage 2023
ISBN: 978-1-4842-9380-5
Verlag: APRESS
Format: PDF
Kopierschutz: 1 - PDF Watermark
Build Scalable Computing Engines for Batch and Stream Data Processing
E-Book, Englisch, 403 Seiten
Reihe: Professional and Applied Computing
ISBN: 978-1-4842-9380-5
Verlag: APRESS
Format: PDF
Kopierschutz: 1 - PDF Watermark
Upon completing this book, you will have the knowledge and skills to seamlessly implement large-scale batch and streaming workloads to analyze real-time data streams with Apache Spark.
What You Will Learn
- Master the concepts of Spark clusters and batch data processing
- Understand data ingestion, transformation, and data storage
- Gain insight into essential stream processing concepts and different streaming architectures
- Implement streaming jobs and applications with Spark Streaming
Who This Book Is ForData engineers, data analysts, machine learning engineers, Python and R programmers
Zielgruppe
Professional/practitioner
Autoren/Hrsg.
Weitere Infos & Material
Part 1: Apache Spark Batch Data Processing.- Chapter 1: Introduction to Apache Spark for Large-Scale Data Analytics.- Chapter 2: Getting Started with Apache Spark.- Chapter 3: Spark Low Level API.- Chapter 4: Spark High-Level APIs.- Chapter 5: Spark Dataset API and Adaptive Query Execution.- Chapter 6: Introduction to Apache Spark Streaming.- Chapter 7: Spark Structured Streaming.- Chapter 8: Streaming Sources and Sinks.- Chapter 9: Event Time Window Operations and Watermarking.- Chapter 10: Future Directions for Spark Streaming.- Bibliography.




