
Test-Driven Data Analysis
(Chapman & Hall/CRC Data Science Series)
Author(s): Nicholas J. Radcliffe (Author)
- Publisher Finelybook 出版社: Chapman and Hall/CRC
- Publication Date 出版日期: May 19, 2026
- Edition 版本: 1st
- Language 语言: English
- Print length 页数: 444 pages
- ISBN-10: 1032897155
- ISBN-13: 9781032897158
Book Description
Test-driven data analysis is the synthesis of ideas from test-driven development of software to data-intensive work including data science, data analysis, and data engineering. It is a methodology for improving the quality of data and of analytical pipelines and processes. It can be thought of as data analysis as if the answers actually matter.
Test-driven data analysis can be thought of as a sibling to reproducible research, with similar concerns, but greater emphasis on automated testing, and less requirement for a human to reproduce results. Extensive checklists are provided that can be used to improve quality before,during, and after analysis.
Key Features:
- Prevents costly errors in analytical processes before they reach production through automated data validation and reference testing of data pipelines.
• Provides actionable checklists for issues beyond the reach of automated testing.
• Equips readers with open-source Python tools and language-agnostic command-line interfaces.
• Addresses testing challenges for modern LLM-based systems including chat-bots and coding assistants.
• Instills in analysts an inner voice that is always asking: “How is this misleading data misleading me?”
Editorial Reviews
Editorial Reviews
About the Author
下载地址
PDF, EPUB | 37 MB | 2026-04-14
finelybook
