Practical Lakehouse Architecture: Designing and Implementing Modern Data Platforms at Scale
Author: Gaurav Ashok Thalpati (Author)
Publisher finelybook 出版社: O’Reilly Media
Edition 版本: 1st
Publication Date 出版日期: 2024-08-27
Language 语言: English
Print Length 页数: 283 pages
ISBN-10: 1098153014
ISBN-13: 9781098153014
Book Description
This concise yet comprehensive guide explains how to adopt a data lakehouse architecture to implement modern data platforms. It reviews the design considerations, challenges, and best practices for implementing a lakehouse and provides key insights into the ways that using a lakehouse can impact your data platform, from managing structured and unstructured data and supporting BI and AI/ML use cases to enabling more rigorous data governance and security measures.
Practical Lakehouse Architecture shows you how to:
- Understand key lakehouse concepts and features like transaction support, time travel, and schema evolution
- Understand the differences between traditional and lakehouse data architectures
- Differentiate between various file formats and table formats
- Design lakehouse architecture layers for storage, compute, metadata management, and data consumption
- Implement data governance and data security within the platform
- Evaluate technologies and decide on the best technology stack to implement the lakehouse for your use case
- Make critical design decisions and address practical challenges to build a future-ready data platform
- Start your lakehouse implementation journey and migrate data from existing systems to the lakehouse
About the Author
下载地址
相关推荐
Python Data Science Cookbook: Practical solutions across fast data cleaning, processing, and machine learning workflows with pandas, NumPy, and scikit-learn
Ruby for Beginners, From Fundamentals to Building Full-Stack Applications: The Ultimate Guide to Learning Ruby and Creating Interactive, High-Quality Code
Quantum Computing: From Concepts to Code
Computational Intelligence Algorithms for the Diagnosis of Neurological Disorders
Cloud Computing using Salesforce: Build and customize applications for your business using the Salesforce Platform – 2nd Edition
Wireless Sensor Networks in Smart Environments: Enabling Digitalization from Fundamentals to Advanced Solutions