Accelerating Deep Neural Networks

Accelerating Deep Neural Networks book cover

Accelerating Deep Neural Networks

Author(s): Ryoma Sato (Author)

  • Publisher Finelybook 出版社: Cambridge University Press
  • Publication Date 出版日期: June 4, 2026
  • Language 语言: English
  • Print length 页数: 311 pages
  • ISBN-10: 1009687085
  • ISBN-13: 9781009687089

Book Description

Deep learning models are powerful, but are often large, slow, and expensive to run. This book is a practical guide to accelerating and compressing neural networks using proven techniques such as quantization, pruning, distillation, and fast architectures. It explains how and why these methods work, fostering a comprehensive understanding. Written for engineers, researchers, and advanced students, the book combines clear theoretical insights with hands-on PyTorch implementations and numerical results. Readers will learn how to reduce inference time and memory usage, lower deployment costs, and select the right acceleration strategy for their task. Whether you’re working with large language models, vision systems, or edge devices, this book gives you the tools and intuition needed to build faster, leaner AI systems, without sacrificing performance. It is perfect for anyone who wants to go beyond intuition and take a principled approach to optimizing AI systems

Editorial Reviews

About the Author

Ryoma Sato is Assistant Professor at the National Institute of Informatics, Japan, specializing in graph neural networks, optimal transport, and efficient deep learning. He is the author of ‘Theory and Algorithms of Optimal Transport’ (2023) and ‘Graph Neural Networks’ (2024). He is a former IOI Japan representative and ACM-ICPC World Finalist, as well as lead developer of Readable, an AI-powered PDF translation service.

View on Amazon

下载地址

PDF | 19 MB | 2026-05-11
下载地址 Download请完成验证以访问链接!
打赏
未经允许不得转载:finelybook » Accelerating Deep Neural Networks

评论 抢沙发

觉得文章有用就打赏一下文章作者

您的打赏,我们将继续给力更多优质内容

支付宝扫一扫

微信扫一扫