Practical Lakehouse Architecture: Designing and Implementing Modern Data Platforms at Scale

Practical Lakehouse Architecture: Designing and Implementing Modern Data Platforms at Scale
by 作者: Gaurav Ashok Thalpati (Author)
Publisher Finelybook 出版社: O’Reilly Media
Edition 版本: 1st
Publication Date 出版日期: 2024-08-27
Language 语言: English
Pages 页数: 283 pages
ISBN-10 书号: 1098153014
ISBN-13 书号: 9781098153014


Book Description

This concise yet comprehensive guide explains how to adopt a data lakehouse architecture to implement modern data platforms. It reviews the design considerations, challenges, and best practices for implementing a lakehouse and provides key insights into the ways that using a lakehouse can impact your data platform, from managing structured and unstructured data and supporting BI and AI/ML use cases to enabling more rigorous data governance and security measures.

Practical Lakehouse Architecture shows you how to:

  • Understand key lakehouse concepts and features like transaction support, time travel, and schema evolution
  • Understand the differences between traditional and lakehouse data architectures
  • Differentiate between various file formats and table formats
  • Design lakehouse architecture layers for storage, compute, metadata management, and data consumption
  • Implement data governance and data security within the platform
  • Evaluate technologies and decide on the best technology stack to implement the lakehouse for your use case
  • Make critical design decisions and address practical challenges to build a future-ready data platform
  • Start your lakehouse implementation journey and migrate data from existing systems to the lakehouse


About the Author

Gaurav Thalpati is an independent consultant with over two decades of experience building data and analytics platforms. He has worked on various data projects and played different roles, including ETL/BI developer, data engineer, data analyst, and data architect. Based in Pune, India, Gaurav is passionate about sharing his knowledge with other data practitioners and guiding them in designing and implementing scalable and cost-effective data platforms.

Amazon page

相关文件下载地址

Formats: PDF(conv), EPUB | 15 MB

打赏
未经允许不得转载:finelybook » Practical Lakehouse Architecture: Designing and Implementing Modern Data Platforms at Scale

评论 抢沙发

觉得文章有用就打赏一下

您的打赏,我们将继续给力更多优质内容

支付宝扫一扫

微信扫一扫