
Apache Polaris: The Definitive Guide: Enriching Apache Iceberg Data Lakehouses with an Open Source Catalog
Author(s): Alex Merced (Author), Andrew Madson (Author), Tomer Shiran (Author)
- Publisher finelybook 出版社: O'Reilly Media
- Publication Date 出版日期: October 21, 2025
- Edition 版次: 1st
- Language 语言: English
- Print length 页数: 258 pages
- ASIN: B0FBRJ7J1Y
- ISBN-13: 9798341608146
Book Description
Revolutionize your understanding of modern data management with Apache Polaris (incubating), the open source catalog designed for data lakehouse industry standard Apache Iceberg. This comprehensive guide takes you on a journey through the intricacies of Apache Iceberg data lakehouses, highlighting the pivotal role of Iceberg catalogs.
Authors Alex Merced, Andrew Madson, and Tomer Shiran explore Apache Polaris's architecture and features in detail, equipping you with the knowledge needed to leverage its full potential. Data engineers, data architects, data scientists, and data analysts will learn how to seamlessly integrate Apache Polaris with popular data tools like Apache Spark, Snowflake, and Dremio to enhance data management capabilities, optimize workflows, and secure datasets.
- Get a comprehensive introduction to Iceberg data lakehouses
- Understand how catalogs facilitate efficient data management and querying in Iceberg
- Explore Apache Polaris's unique architecture and its powerful features
- Deploy Apache Polaris locally, and deploy managed Apache Polaris from Snowflake and Dremio
- Perform basic table operations on Apache Spark, Snowflake, and Dremio
About the Author
Andrew Madson is an experienced data leader with 17 years of experience leading technical teams. Currently the Head of Evangelism and Education at Tobiko - the creators of SQLMesh and SQLGlot, Andrew has held senior leadership positions at institutions such as JP Morgan, LPL Financial, MassMutual, and Arizona State University. In addition to leading data teams, Andrew is a professor of data science and analytics at several universities, where he teaches graduate courses in machine learning, statistics, SQL, R, Python, Tableau, and Power BI.
Tomer Shiran is the Founder and Chief Product Officer of Dremio, an open data lakehouse platform that enables companies to run analytics in the cloud without the cost, complexity and lock-in of data warehouses. As the company's founding CEO, Tomer built a world-class organization that has raised over $400M and now serves hundreds of the world's largest enterprises, including 3 of the Fortune 5. Prior to Dremio, Tomer was the 4th employee and VP Product of MapR, a Big Data analytics pioneer. He also held numerous product management and engineering roles at Microsoft and IBM Research, founded several websites that have served millions of users and hundreds of thousands of paying customers, and is a successful author and presenter on a wide range of industry topics. He holds an MS in Computer Engineering from Carnegie Mellon University and a BS in Computer Science from Technion - Israel Institute of Technology.
finelybook
