Practical Apache Spark: Using the Scala API
Author: Subhashini Chellappan , Dharanitharan Ganesan
Publisher finelybook 出版社: Apress
Edition 版本: First Edition
Publication Date 出版日期: 2018-12-13
Language 语言: English
Print Length 页数: 296 pages
ISBN-10: 1484236513
ISBN-13: 9781484236512
Book Description
Work with Apache Spark using Scala to deploy and set up single-node, multi-node, and high-availability clusters. This book discusses various components of Spark such as Spark Core, DataFrames, Datasets and SQL, Spark Streaming, Spark MLib, and R on Spark with the help of practical code snippets for each topic. Practical Apache Spark also covers the integration of Apache Spark with Kafka with examples. You’ll follow a learn-to-do-by-yourself approach to learning – learn the concepts, practice the code snippets in Scala, and complete the assignments given to get an overall exposure.
On completion, you’ll have knowledge of the functional programming aspects of Scala, and hands-on expertise in various Spark components. You’ll also become familiar with machine learning algorithms with real-time usage.
What You Will Learn
- Discover the functional programming features of Scala
- Understand the completearchitecture of Spark and its components
- Integrate Apache Spark with Hive and Kafka
- Use Spark SQL, DataFrames, and Datasets to process data using traditional SQL queries
- Work with different machine learning concepts and libraries using Spark’s MLlib packages
Who This Book Is For
Developers and professionals who deal with batch and stream data processing.
From the Back Cover
On completion, you’ll have knowledge of the functional programming aspects of Scala, and hands-on expertise in various Spark components. You’ll also become familiar with machine learning algorithms with real-time usage.
You will:
- Discover the functional programming features of Scala
- Understand the complete architecture of Spark and its components
- Integrate Apache Spark with Hive and Kafka
- Use Spark SQL, DataFrames, and Datasets to process data using traditional SQL queries
- Work with different machine learning concepts and libraries using Spark’s MLlib packages
About the Author
Subhashini Chellappan is a technology enthusiast with expertise in the big data and cloud space. She has rich experience in both academia and the software industry. Her areas of interest and expertise are centered on business intelligence, big data analytics and cloud computing.
Dharanitharan Ganesan is a senior analyst with five years of experience in IT. He has a high level of exposure and experience in big data – Apache Hadoop, Apache Spark and various Hadoop ecosystem components. He has a proven track record of improving efficiency and productivity through the automation of various routine and administrative functions in business intelligence and big data technologies. His areas of interest and expertise are centered on machine learning algorithms, statistical modelling and predictive analysis.
下载地址
相关推荐
Kotlin from Scratch: A Project-Based Introduction for the Intrepid Programmer
Arduino Programming using Simulink
Technology-Driven Supply Chain Management in Industrial 4.0
Mathematics in Architecture, Art, Nature, and Beyond
Power Devices and Internet of Things for Intelligent System Design
Blockchain-Enabled Internet of Things Applications in Healthcare: Current Practices and Future Directions
评论 抢沙发
觉得文章有用就打赏一下
您的打赏,我们将继续给力更多优质内容
支付宝扫一扫

微信扫一扫
