Software Engineering for Data Scientists
Author: Andrew Treadway (Author)
Publisher finelybook 出版社: Manning
Publication Date 出版日期: 2023-2024 (estimated)
Language 语言: English
Print Length 页数: 325 pages (estimated)
ISBN-10: 1633438848
ISBN-13: 9781633438842
Book Description
These easy to learn and apply software engineering techniques will radically improve collaboration, scaling, and deployment in your data science projects.
In Software Engineering for Data Scientists you’ll learn to improve performance and efficiency by:
Using source control
Handling exceptions and errors in your code
Improving the design of your tools and applications
Scaling code to handle large data efficiently
Testing model and data processing code before deployment
Scheduling a model to run automatically
Packaging Python code into reusable libraries
Generating automated reports for monitoring a model in production
Software Engineering for Data Scientists presents important software engineering principles that will radically improve the performance and efficiency of data science projects. Author and Meta data scientist Andrew Treadway has spent over a decade guiding models and pipelines to production. This practical handbook is full of his sage advice that will change the way you structure your code, monitor model performance, and work effectively with the software engineering teams.
about the technology
Many basic software engineering skills apply directly to data science! As a data scientist, learning the right software engineering techniques can save you a world of time and frustration. Source control simplifies sharing, tracking, and backing up code. Testing helps reduce future errors in your models or pipelines. Exception handling automatically responds to unexpected events as they crop up. Using established engineering conventions makes it easy to collaborate with software developers. This book teaches you to handle these situations and more in your data science projects.
about the book
In Software Engineering for Data Scientists you’ll find tested software engineering techniques that will make your daily life easier as a data scientist. You’ll quickly get up to speed with how software engineering can solve common problems, then dive straight into source control, object-oriented programming, code testing, and packaging. Hands-on examples make it easy to see how new principles can be put into practice in a data science context.
Improve code structuring and reusability in a customer churn prediction model
Learn to scale data processing code by experimenting with Spotify data
Build a lightweight web app to monitor a machine learning model
Master the software design conventions that make your code easy to share and modify
…and much more!
Software Engineering for Data Scientists (MEAP V04)
1. 1_Software_engineering_principles
2. 2_Source_control_for_data_scientists
3. 3_Writing_robust_code_and_error_handling
4. 4_Object-oriented_programming_for_data_scientists
5. 5_Creating_progress_bars_and_time-outs_in_Python
6. 6_Making_your_code_faster_and_more_efficient
7. 7_Memory_management_with_Python
8. 8_Alternatives_to_Pandas
9. 9_Putting_your_code_into_production
10. 10_Testing_in_Python
11. 11_Scheduling_and_packaging_your_code
12. Appendix_A._Setting_up_Dask_on_a_Remote_Cluster
13. Welcome
14. index
Software Engineering for Data Scientists (MEAP V04) 9781633438842.rar