Databricks ML in Action: Learn how Databricks supports the entire ML lifecycle end to end from data ingestion to the model deployment

Databricks ML in Action: Learn how Databricks supports the entire ML lifecycle end to end from data ingestion to the model deployment
by 作者: Stephanie Rivera (Author), Anastasia Prokaieva (Author), Amanda Baker (Author)
Publisher Finelybook 出版社: Packt Publishing
Publication Date 出版日期: 2024-05-17
Language 语言: English
Pages 页数: 280 pages
ISBN-10 书号: 1800564899
ISBN-13 书号: 9781800564893


Book Description

Get to grips with autogenerating code, deploying ML algorithms, and leveraging various ML lifecycle features on the Databricks Platform, guided by best practices and reusable code for you to try, alter, and build on


Key Features

  • Build machine learning solutions faster than peers only using documentation
  • Enhance or refine your expertise with tribal knowledge and concise explanations
  • Follow along with code projects provided in GitHub to accelerate your projects
  • Purchase of the print or Kindle book includes a free PDF eBook


Book Description

Discover what makes the Databricks Data Intelligence Platform the go-to choice for top-tier machine learning solutions. Databricks ML in Action presents cloud-agnostic, end-to-end examples with hands-on illustrations of executing data science, machine learning, and generative AI projects on the Databricks Platform.

You’ll develop expertise in Databricks’ managed MLflow, Vector Search, AutoML, Unity Catalog, and Model Serving as you learn to apply them practically in everyday workflows. This Databricks book not only offers detailed code explanations but also facilitates seamless code importation for practical use. You’ll discover how to leverage the open-source Databricks platform to enhance learning, boost skills, and elevate productivity with supplemental resources.

By the end of this book, you’ll have mastered the use of Databricks for data science, machine learning, and generative AI, enabling you to deliver outstanding data products.


What you will learn

  • Set up a workspace for a data team planning to perform data science
  • Monitor data quality and detect drift
  • Use autogenerated code for ML modeling and data exploration
  • Operationalize ML with feature engineering client, AutoML, VectorSearch, Delta Live Tables, AutoLoader, and Workflows
  • Integrate open-source and third-party applications, such as OpenAI’s ChatGPT, into your AI projects
  • Communicate insights through Databricks SQL dashboards and Delta Sharing
  • Explore data and models through the Databricks marketplace


Who this book is for

This book is for machine learning engineers, data scientists, and technical managers seeking hands-on expertise in implementing and leveraging the Databricks Data Intelligence Platform and its Lakehouse architecture to create data products.

Table of


Contents

  1. Getting Started with This Book and Lakehouse Concepts
  2. Designing Databricks: Day One
  3. Building Out Our Bronze Layer
  4. Getting to Know Your Data
  5. Feature Engineering on Databricks
  6. Searching for a Signal
  7. Productionizing ML on Databricks
  8. Monitoring, Evaluating, and More


About the Author

Stephanie Rivera has worked in big data and machine learning for 12 years. She collaborates with teams and companies as they design their Lakehouse as a Sr. Solutions Architect for Databricks. Previously Stephanie was the VP, Data Intelligence for a global company, taking in 20+ terabytes of data daily. She led the data science, data engineering, and business intelligence teams.

Anastasia Prokaieva began her career 9 years ago as a research scientist at CEA (France), focusing on large data analysis and satellite data assimilation, treating terabytes of data. She has been working within the big data analysis and machine learning domain since then. In 2021, she joined Databricks and became the regional AI subject matter expert. On a daily basis, Anastasia consults Databricks users on best practices for implementing AI projects end-to-end. She also delivers training and workshops to democratize AI. Anastasia holds two MSc degrees in theoretical physics and energy science.

Mandy Baker began her career in data 8 years ago. She loves leveraging her skills as a data scientist to orchestrate transformative journeys for companies across diverse industries as a Solutions Architect for Databricks. Her experiences have brought her from large corporations to small startups and everything in between. Mandy is a graduate of Carnegie Mellon University and the University of Washington.

Hayley Horn started her data career 15 years ago as a data quality consultant on enterprise data integration projects. As a data scientist, she specialized in customer insights and strategy, and presented at Data Science and AI conferences in the US and Europe. She is currently a Sr. Solutions Architect for Databricks, with expertise in data science and technology modernization. A graduate of the MS Data Science program at Southern Methodist University in Dallas, Texas, USA, she is now a capstone advisor to students in their final semesters of the program.

Amazon page

相关文件下载地址

Formats: PDF, EPUB | 49 MB

打赏
未经允许不得转载:finelybook » Databricks ML in Action: Learn how Databricks supports the entire ML lifecycle end to end from data ingestion to the model deployment

评论 抢沙发

觉得文章有用就打赏一下

您的打赏,我们将继续给力更多优质内容

支付宝扫一扫

微信扫一扫