DevOps for Data Science (Chapman & Hall/CRC Data Science Series)
Author: Alex Gold (Author)
Publisher finelybook 出版社: Chapman and Hall/CRC
Edition 版次: 1st
Publication Date 出版日期: 2024-06-19
Language 语言: English
Print Length 页数: 256 pages
ISBN-10: 1032100346
ISBN-13: 9781032100340
Book Description
Data Scientists are experts at analyzing, modelling and visualizing data but, at one point or another, have all encountered difficulties in collaborating with or delivering their work to the people and systems that matter. Born out of the agile software movement, DevOps is a set of practices, principles and tools that help software engineers reliably deploy work to production. This book takes the lessons of DevOps and aplies them to creating and delivering production-grade data science projects in Python and R.
This book’s first section explores how to build data science projects that deploy to production with no frills or fuss. Its second section covers the rudiments of administering a server, including Linux, application, and network administration before concluding with a demystification of the concerns of enterprise IT/Administration in its final section, making it possible for data scientists to communicate and collaborate with their organization’s security, networking, and administration teams.
Key Features:
• Start-to-finish labs take readers through creating projects that meet DevOps best practices and creating a server-based environment to work on and deploy them.
• Provides an appendix of cheatsheets so that readers will never be without the reference they need to remember a Git, Docker, or Command Line command.
• Distills what a data scientist needs to know about Docker, APIs, CI/CD, Linux, DNS, SSL, HTTP, Auth, and more.
• Written specifically to address the concern of a data scientist who wants to take their Python or R work to production.
There are countless books on creating data science work that is correct. This book, on the otherhand, aims to go beyond this, targeted at data scientists who want their work to be than merely accurate and deliver work that matters.
About the Author
Alex leads the Solutions Engineering team at Posit (formerly RStudio). In that role, he has advised hundreds of organizations of all sizes and levels of sophistication to create production-grade open-source data science environments. Before coming to Posit, he was a data scientist and data science team lead and worked on politics, consulting, and healthcare.