Scaling Machine Learning with Spark: Distributed ML with MLlib, TensorFlow, and PyTorch


Scaling Machine Learning with Spark: Distributed ML with MLlib, TensorFlow, and PyTorch 1st Edition
by Adi Polak(Author)
Publisher Finelybook 出版社: O'Reilly Media; 1st edition (April 11, 2023)
Language 语言: English
pages 页数: 291 pages
ISBN-10 书号: 1098106822
ISBN-13 书号: 9781098106829


Book Description
Learn how to build end-to-end scalable machine learning solutions with Apache Spark. With this practical guide, author Adi Polak introduces data and ML practitioners to creative solutions that supersede today's traditional methods. You'll learn a more holistic approach that takes you beyond specific requirements and organizational goals--allowing data and ML practitioners to collaborate and understand each other better.
Scaling Machine Learning with Spark examines several technologies for building end-to-end distributed ML workflows based on the Apache Spark ecosystem with Spark MLlib, MLflow, TensorFlow, and PyTorch. If you're a data scientist who works with machine learning, this book shows you when and why to use each technology.
You will:
Explore machine learning, including distributed computing concepts and terminology
Manage the ML lifecycle with MLflow
Ingest data and perform basic preprocessing with Spark
Explore feature engineering, and use Spark to extract features
Train a model with MLlib and build a pipeline to reproduce it
Build a data system to combine the power of Spark with deep learning
Get a step-by-step example of working with distributed TensorFlow
Use PyTorch to scale machine learning and its internal architecture

下载地址 Download
打赏
未经允许不得转载:finelybook » Scaling Machine Learning with Spark: Distributed ML with MLlib, TensorFlow, and PyTorch

相关推荐

  • 暂无文章

觉得文章有用就打赏一下

您的打赏,我们将继续给力更多优质内容

支付宝扫一扫打赏

微信扫一扫打赏