DuckDB in Action

DuckDB in Action
by 作者: Mark Needham (Author), Michael Hunger (Author), Michael Simons (Author)
Publisher Finelybook 出版社: Manning
Publication Date 出版日期: 2024-08-27
Language 语言: English
Pages 页数: 312 pages
ISBN-10 书号: 1633437256
ISBN-13 书号: 9781633437258


Book Description
Dive into DuckDB and start processing gigabytes of data with ease—all with no data warehouse.

DuckDB is a cutting-edge SQL database that makes it incredibly easy to analyze big data sets right from your laptop. In DuckDB in Action you’ll learn everything you need to know to get the most out of this awesome tool, keep your data secure on prem, and save you hundreds on your cloud bill. From data ingestion to advanced data pipelines, you’ll learn everything you need to get the most out of DuckDB—all through hands-on examples.

Open up DuckDB in Action and learn how to:

• Read and process data from CSV, JSON and Parquet sources both locally and remote
• Write analytical SQL queries, including aggregations, common table expressions, window functions, special types of joins, and pivot tables
• Use DuckDB from Python, both with SQL and its “Relational”-API, interacting with databases but also data frames
• Prepare, ingest and query large datasets
• Build cloud data pipelines
• Extend DuckDB with custom functionality

Pragmatic and comprehensive, DuckDB in Action introduces the DuckDB database and shows you how to use it to solve common data workflow problems. You won’t need to read through pages of documentation—you’ll learn as you work. Get to grips with DuckDB’s unique SQL dialect, learning to seamlessly load, prepare, and analyze data using SQL queries. Extend DuckDB with both Python and built-in tools such as MotherDuck, and gain practical insights into building robust and automated data pipelines.

Purchase of the print book includes a free eBook in PDF and ePub formats from Manning Publications.

About the technology

DuckDB makes data analytics fast and fun! You don’t need to set up a Spark or run a cloud data warehouse just to process a few hundred gigabytes of data. DuckDB is easily embeddable in any data analytics application, runs on a laptop, and processes data from almost any source, including JSON, CSV, Parquet, SQLite and Postgres.

About the book

DuckDB in Action guides you example-by-example from setup, through your first SQL query, to advanced topics like building data pipelines and embedding DuckDB as a local data store for a Streamlit web app. You’ll explore DuckDB’s handy SQL extensions, get to grips with aggregation, analysis, and data without persistence, and use Python to customize DuckDB. A hands-on project accompanies each new topic, so you can see DuckDB in action.

What’s inside

• Prepare, ingest and query large datasets
• Build cloud data pipelines
• Extend DuckDB with custom functionality
• Fast-paced SQL recap: From simple queries to advanced analytics

About the reader

For data pros comfortable with Python and CLI tools.

About the author

Mark Needham is a blogger and video creator at @?LearnDataWithMark. Michael Hunger leads product innovation for the Neo4j graph database. Michael Simons is a Java Champion, author, and Engineer at Neo4j.


About the Author

Mark Needham is a blogger, and video creator at @‌LearnDataWithMark, where his series on DuckDB offers viewers hands-on insights into practical database applications.

Michael Hunger works on the open source Neo4j graph database filling many roles, where leads the product innovation and developer product strategy.

Michael Simons is a Java Champion, author, and Staff Software Engineer at Neo4j and has been working professionally as a developer for more than 20 years.

Amazon page

相关文件下载地址

Formats: PDF, EPUB | 23 MB

下载地址 Download解决验证以访问链接!
打赏
未经允许不得转载:finelybook » DuckDB in Action

评论 抢沙发

觉得文章有用就打赏一下

您的打赏,我们将继续给力更多优质内容

支付宝扫一扫

微信扫一扫