Python Polars: The Definitive Guide: Transforming, Analyzing, and Visualizing Data with a Fast and Expressive DataFrame API
Author: Jeroen Janssens (Author), Thijs Nieuwdorp (Author)
Publisher finelybook 出版社: O’Reilly Media
Publication Date 出版日期: 2025-04-01
Edition 版本: 1st
Language 语言: English
Print Length 页数: 501 pages
ISBN-10: 1098156080
ISBN-13: 9781098156084
Book Description
Unlock the power of Polars, a Python package for transforming, analyzing, and visualizing data. In this hands-on guide, Jeroen Janssens and Thijs Nieuwdorp walk you through every feature of Polars, showing you how to use it for real-world tasks like data wrangling, exploratory data analysis, building pipelines, and more.
Whether you’re a seasoned data professional or new to data science, you’ll quickly master Polars’ expressive API and its underlying concepts. You don’t need to have experience with pandas, but if you do, this book will help you make a seamless transition. The many practical examples and real-world datasets are available on GitHub, so you can easily follow along.
- Process data from CSV, Parquet, spreadsheets, databases, and the cloud
- Get a solid understanding of Expressions, the building blocks of every query
- Handle complex data types, including text, time, and nested structures
- Use both eager and lazy APIs, and know when to use each
- Visualize your data with Altair, hvPlot, plotnine, and Great Tables
- Extend Polars with your own Python functions and Rust plugins
- Leverage GPU acceleration to boost performance even further
Review
— Wes McKinney, Creator of pandas, Principal Architect, Posit PBC
Jeroen and Thijs have done an excellent job-not only teaching you the ins and outs of Polars but also helping you unlearn habits from other tools like pandas. They really bring out the power of expressions, which are key to using Polars effectively, guiding you toward a more declarative, functional approach to data processing. As you work through this book, I’m sure you’ll gain a deep understanding of Polars and discover fresh ways to approach data processing.
Polars has brought a ton of much-needed innovation to the data frame world with its much more streamlined API and efficient implementation. As a result, the capabilities of data analysis in Python are pushed to new heights. We also greatly enjoy Ritchie and team as a part of the Amsterdam data ecosystem.
I greatly respect Jeroen’s commitment to teaching data science in an accessible way, whether it be on the command line or elsewhere. His and Thijs’ book is a testament to this commitment and I recommend it to the data science community.