Apache Kafka – Building Data Streaming Applications
by: Manish Kumar – Chanchal Singh
ISBN-10: 1787283984
ISBN-13: 9781787283985
Publication Date 出版日期: 2017-10-05
Print Length 页数: 275
Book Description
By finelybook
Apache Kafka is a popular distributed streaming platform that acts as a messaging queue or an enterprise messaging system. It lets you publish and subscribe to a stream of records,and process them in a fault-tolerant way as they occur.
This book is a comprehensive guide to designing and architecting enterprise-grade streaming applications using Apache Kafka and other big data tools. It includes best practices for building such applications,and tackles some common challenges such as how to use Kafka efficiently and handle high data volumes with ease. This book first takes you through understanding the type messaging system and then provides a thorough introduction to Apache Kafka and its internal details. The second part of the book takes you through designing streaming application using various frameworks and tools such as Apache Spark,Apache Storm,and more. Once you grasp the basics,we will take you through more advanced concepts in Apache Kafka such as capacity planning and security.
By the end of this book,you will have all the information you need to be comfortable with using Apache Kafka,and to design efficient streaming data applications with it.
What You Will Learn
Learn the basics of Apache Kafka from scratch
Use the basic building blocks of a streaming application
Design effective streaming applications with Kafka using Spark,Storm &,and Heron
Understand the importance of a low -latency ,high- throughput,and fault-tolerant messaging system
Make effective capacity planning while deploying your Kafka Application
Understand and implement the best security practices
Authors
Manish Kumar
Manish Kumar is a Technical Architect at DataMetica Solution Pvt. Ltd.. He has approximately 11 years,experience in data management,working as a Data Architect and Product Architect. He has extensive experience in building effective ETL pipelines,implementing security over Hadoop,and providing the best possible solutions to Data Science problems. Before joining the world of big data,he worked as an Tech Lead for Sears Holding,India. He is a regular speaker on big data concepts such as Hadoop and Hadoop Security in various events. Manish has a Bachelor’s degree in Information Technology.
Chanchal Singh
Chanchal Singh is a Software Engineer at DataMetica Solution Pvt. Ltd.. He has over three years’ experience in product development and architect design,working as a Product Developer,Data Engineer,and Team Lead. He has a lot of experience with different technologies such as Hadoop,Spark,Storm,Kafka,Hive,Pig,Flume,Java,Spring,and many more. He believes in sharing knowledge and motivating others for innovation. He is the co-organizer of the Big Data Meetup – Pune Chapter.
He has been recognized for putting innovative ideas into organizations. He has a Bachelor’s degree in Information Technology from the University of Mumbai and a Master’s degree in Computer Application from Amity University. He was also part of the Entrepreneur Cell in IIT Mumbai.
Contents
Chapter 1: Introduction to Messaging Systems
Chapter 2: Introducing Kafka the Distributed Messaging Platform
Chapter 3: Deep Dive into Kafka Producers
Chapter 4: Deep Dive into Kafka Consumers
Chapter 5: Building Spark Streaming Applications with Kafka
Chapter 6: Building Storm Applications with Kafka
Chapter 7: Using Kafka with Confluent Platform
Chapter 8: Building ETL Pipelines Using Kafka
Chapter 9: Building Streaming Applications Using Kafka Streams
Chapter 10: Kafka Cluster Deployment
Chapter 11: Using Kafka in Big Data Applications
Chapter 12: Securing Kafka
Chapter 13: Streaming Application Design Considerations