Data Pipelines Pocket Reference: Moving and Processing Data for Analytics
Author: James Densmore
Print Length 页数: 200 pages
Edition 版本: 1
Language 语言: English
Publisher finelybook 出版社: O’Reilly Media
Released: 2021-06-15
ISBN-10: 1492087831
ISBN-13: 9781492087830
Book Description
Data pipelines are the foundation for success in data analytics and machine learning. Moving data from many diverse sources and processing it to provide context is the difference between having data and actually gaining value from it. This pocket reference defines data pipelines and explains how they work in today’s modern data stack.
You’ll learn common considerations and key decision points when implementing pipelines,such as data pipeline design patterns,data ingestion implementation,data transformation,the orchestration of pipelines,and build versus buy decision making. This book addresses the most common decisions made Author: data professionals and discusses foundational concepts that apply to open source frameworks,commercial products,and homegrown solutions.
You’ll learn:
What a data pipeline is and how it works
How data is moved and processed on modern data infrastructure,including cloud platforms
Common tools and products used Author: data engineers to build pipelines
How pipelines support machine learning and analytics needs
Considerations for pipeline maintenance,testing,and alerting