Practical Apache Spark: Using the Scala API
Authors: Subhashini Chellappan
ISBN-10: 1484236513
ISBN-13: 9781484236512
Edition 版次: 1st ed.
Publication Date 出版日期: 2018-12-13
Print Length 页数: 296 pages
Book Description
By finelybook
Work with Apache Spark using Scala to deploy and set up single-node,multi-node,and high-availability clusters. This book discusses various components of Spark such as Spark Core,DataFrames,Datasets and SQL,Spark Streaming,Spark MLib,and R on Spark with the help of practical code snippets for each topic. Practical Apache Spark also covers the integration of Apache Spark with Kafka with examples. You’ll follow a learn-to-do-by-yourself approach to learning – learn the concepts,practice the code snippets in Scala,and complete the assignments given to get an overall exposure.
On completion,you’ll have knowledge of the functional programming aspects of Scala,and hands-on expertise in various Spark components. You’ll also become familiar with machine learning algorithms with real-time usage.
What You Will Learn
Discover the functional programming features of Scala
Understand the complete architecture of Spark and its components
Integrate Apache Spark with Hive and Kafka
Use Spark SQL,DataFrames,and Datasets to process data using traditional SQL queries
Work with different machine learning concepts and libraries using Spark’s MLlib packages
Cover
1. Scala: Functional Programming Aspects
2. Single and Mulinode Cluster Setup
3. Introduction to Apache Spark and Spark Core
4. Spark SQL,DataFrames,and Datasets
5. Introduction to Spark Streaming
6. Spark Structured Streaming
7. Spark Streaming with Kafka
8. Spark Machine Learning Library
9. Working with SparkR
10. Spark Real-Time Use Case[/erphpdown]