Vectorization: A Practical Guide to Efficient Implementations of Machine Learning Algorithms
Author: Edward DongBo Cui (Author)
Publisher finelybook 出版社: Wiley-IEEE Press
Edition 版本: 1st edition
Publication Date 出版日期: 2024-12-24
Language 语言: English
Print Length 页数: 448 pages
ISBN-10: 1394272944
ISBN-13: 9781394272945
Book Description
Book Description
From the Back Cover
Enables readers to develop foundational and advanced vectorization skills for scalable data science and machine learning and address real-world problems
Offering insights across various domains such as computer vision and natural language processing, Vectorization covers the fundamental topics of vectorization including array and tensor operations, data wrangling, and batch processing. This book illustrates how the principles discussed lead to successful outcomes in machine learning projects, serving as concrete examples for the theories explained, with each chapter including practical case studies and code implementations using NumPy, TensorFlow, and PyTorch.
Each chapter has one or two types of contents: either an introduction/comparison of the specific operations in the numerical libraries (illustrated as tables) and/or case study examples that apply the concepts introduced to solve a practical problem (as code blocks and figures). Readers can approach the knowledge presented by reading the text description, running the code blocks, or examining the figures.
Written by the developer of the first recommendation system on the Peacock streaming platform, Vectorization explores sample topics including:
- Basic tensor operations and the art of tensor indexing, elucidating how to access individual or subsets of tensor elements
- Vectorization in tensor multiplications and common linear algebraic routines, which form the backbone of many machine learning algorithms
- Masking and padding, concepts which come into play when handling data of non-uniform sizes, and string processing techniques for natural language processing (NLP)
- Sparse matrices and their data structures and integral operations, and ragged or jagged tensors and the nuances of processing them
From the essentials of vectorization to the subtleties of advanced data structures, Vectorization is an ideal one-stop resource for both beginners and experienced practitioners, including researchers, data scientists, statisticians, and other professionals in industry, who seek academic success and career advancement.
About the Author
Edward DongBo Cui is a Data Science and Machine Learning Engineering Leader who holds a PhD in Neuroscience from Case Western Reserve University, USA. Edward served as Director of Data Science at NBC Universal, building the first recommendation system on the new Peacock streaming platform. Previously, he was Lead Data Scientist at Nielsen Global Media. He is an expert in ML engineering, research, and MLOps to drive data-centric decision-making and enhance product innovation.
相关文件下载地址
相关推荐
- Building AI Intensive Python Applications: Create intelligent apps with LLMs and vector databases
- Cloud Solution Architect’s Career Master Plan: Proven techniques and effective tips to help you become a successful solution architect
- Designing Distributed Systems: Patterns and Paradigms for Scalable, Reliable Systems Using Kubernetes, 2nd Edition
- DuckDB: Up and Running: Fast Data Analytics and Reporting
- Essential Guide to LLMOps: Implementing effective LLMOps strategies and tools from data to deployment
- Lectures on Advanced Topics in Categorical Data Analysis