Data Pipelines with Apache Airflow
Author: Bas P. Harenslak (Author), Julian Rutger de Ruiter (Author)
Publisher finelybook 出版社: Manning
Edition 版本: N/A
Publication Date 出版日期: 2021-04-27
Language 语言: English
Print Length 页数: 480 pages
ISBN-10: 1617296902
ISBN-13: 9781617296901
Book Description
From the Back Cover
Pipelines can be challenging to manage, especially when your data has to flow through a collection of application components, servers, and cloud services. Airflow lets you schedule, restart, and backfill pipelines, and its easy-to-use UI and workflows with Python scripting has users praising its incredible flexibility. Data Pipelines with Apache Airflow takes you through best practices for creating pipelines for multiple tasks, including data lakes, cloud deployments, and data science.
Data Pipelines with Apache Airflow teaches you the ins-and-outs of the Directed Acyclic Graphs (DAGs) that power Airflow, and how to write your own DAGs to meet the needs of your projects. With complete coverage of both foundational and lesser-known features, when you’re done you’ll be set to start using Airflow for seamless data pipeline development and management.
Key Features
Framework foundation and best practices
Airflow’s execution and dependency system
Testing Airflow DAGs
Running Airflow in production
For data-savvy developers, DevOps and data engineers, and system
administrators with intermediate Python skills.
About the technology
Data pipelines are used to extract, transform and load data to and from multiple sources, routing it wherever it’s needed — whether that’s visualisation tools, business intelligence dashboards, or machine learning models. Airflow streamlines the whole process, giving you one tool for programmatically developing and monitoring batch data pipelines, and integrating all the pieces you use in your data stack.
Bas Harenslak and Julian de Ruiter are data engineers with extensive experience using Airflow to develop pipelines for major companies including Heineken, Unilever, and Booking.com. Bas is a committer, and both Bas and Julian are active contributors to Apache Airflow.
About the Author
相关文件下载地址
相关推荐
- Crafting Secure Software: An engineering leader’s guide to security by design
- Game Development Patterns with Godot 4: Create resilient game systems using industry-proven solutions in Godot
- Mastering Windows 365: Deploy and Manage Cloud PCs and Windows 365 Link devices, Copilot with Intune, and Intune Suite 2nd edition
- ROS 2 from Scratch: Get started with ROS 2 and create robotics applications with Python and C++
- Scalable Application Development with NestJS: Leverage REST, GraphQL, microservices, testing, and deployment for seamless growth
- Embracing DevOps Release Management: Strategies and tools to accelerate continuous delivery and ensure quality software deployment