Data Science: The Hard Parts: Techniques for Excelling at Data Science
Author: Daniel Vaughan (Author)
Publisher finelybook 出版社: O’Reilly Media
Edition 版本: 1st
Publication Date 出版日期: 2023-12-05
Language 语言: English
Print Length 页数: 254 pages
ISBN-10: 1098146476
ISBN-13: 9781098146474
Book Description
This practical guide provides a collection of techniques and best practices that are generally overlooked in most data engineering and data science pedagogy. A common misconception is that great data scientists are experts in the “big themes” of the discipline—machine learning and programming. But most of the time, these tools can only take us so far. In practice, the smaller tools and skills really separate a great data scientist from a not-so-great one.
Taken as a whole, the lessons in this book make the difference between an average data scientist candidate and a qualified data scientist working in the field. Author Daniel Vaughan has collected, extended, and used these skills to create value and train data scientists from different companies and industries.
With this book, you will:
- Understand how data science creates value
- Deliver compelling narratives to sell your data science project
- Build a business case using unit economics principles
- Create new features for a ML model using storytelling
- Learn how to decompose KPIs
- Perform growth decompositions to find root causes for changes in a metric
Daniel Vaughan is head of data at Clip, the leading paytech company in Mexico. He’s the author of Analytical Skills for AI and Data Science (O’Reilly).
About the Author
相关推荐
C++ Software Design: Design Principles and Patterns for High-Quality Software
Think Stats: Exploratory Data Analysis, 3rd Edition
Cloud Application Architecture Patterns: Designing, Building, and Modernizing for the Cloud
Building Generative AI Services with FastAPI: A Practical Approach to Developing Context-Rich Generative AI Applications
Hands-On AWS CDK: Building Cloud Native Applications with Infrastructure as Code
Data Engineering Design Patterns: Recipes for Solving the Most Common Data Engineering Problems