Deep Learning at Scale: At the Intersection of Hardware, Software, and Data
Author: Suneeta Mall (Author)
Publisher finelybook 出版社: O’Reilly Media
Edition 版本: 1st
Publication Date 出版日期: 2024-07-30
Language 语言: English
Print Length 页数: 400 pages
ISBN-10: 1098145283
ISBN-13: 9781098145286
Book Description
Bringing a deep-learning project into production at scale is quite challenging. To successfully scale your project, a foundational understanding of full stack deep learning, including the knowledge that lies at the intersection of hardware, software, data, and algorithms, is required.
This book illustrates complex concepts of full stack deep learning and reinforces them through hands-on exercises to arm you with tools and techniques to scale your project. A scaling effort is only beneficial when it’s effective and efficient. To that end, this guide explains the intricate concepts and techniques that will help you scale effectively and efficiently.
You’ll gain a thorough understanding of:
- How data flows through the deep-learning network and the role the computation graphs play in building your model
- How accelerated computing speeds up your training and how best you can utilize the resources at your disposal
- How to train your model using distributed training paradigms, i.e., data, model, and pipeline parallelism
- How to leverage PyTorch ecosystems in conjunction with NVIDIA libraries and Triton to scale your model training
- Debugging, monitoring, and investigating the undesirable bottlenecks that slow down your model training
- How to expedite the training lifecycle and streamline your feedback loop to iterate model development
- A set of data tricks and techniques and how to apply them to scale your training model
- How to select the right tools and techniques for your deep-learning project
- Options for managing the compute infrastructure when running at scale
About the Author
下载地址
相关推荐
Hands-on Splunk on AWS: Complete guide to deploying and administering Splunk for data analysis
LLMs in Production: From language models to successful products
Energy Optimization and Security in Federated Learning for IoT Environments
Security Automation with Python: Practical Python solutions for automating and scaling security operations
Generative AI for Financial Services: Challenges, anti-patterns, and best practices
Enterprise Fortress: The Ultimate Handbook for Enterprise Security Architecture