Mastering Spark with R:The Complete Guide to Large-Scale Analysis and Modeling


Mastering Spark with R:The Complete Guide to Large-Scale Analysis and Modeling
Authors:Javier Luraschi – Kevin Kuo – Edgar Ruiz
ISBN-10 书号:149204637X
ISBN-13 书号:9781492046370
Edition 版次:1
Release Finelybook 出版日期:2019-10-29
pages 页数:296 pages


Book Description
If you’re like most R users, you have deep knowledge and love for statistics. But as your organization continues to collect huge amounts of data, adding tools such as Apache Spark makes a lot of sense. With this practical book, data scientists and professionals working with large-scale data applications will learn how to use Spark from R to tackle big data and big compute problems.
Authors Javier Luraschi, Kevin Kuo, and Edgar Ruiz show you how to use R with Spark to solve different data analysis problems. This book covers relevant data science topics, cluster computing, and issues that should interest even the most advanced users.

Analyze, explore, transform, and visualize data in Apache Spark with R
Create statistical models to extract information and predict outcomes; automate the process in production-ready workflows
Perform analysis and modeling across many machines using distributed computing techniques
Use large-scale data from multiple sources and different formats with ease from within Spark
Learn about alternative modeling frameworks for graph processing, geospatial analysis, and genomics at scale
Dive into advanced topics including custom transformations, real-time data processing, and creating custom Spark extensions
Preface
1.Introduction
2.Getting Started
3.Analysis
4.Modeling
5.Pipelines
6.Clusters
7.Connections
8.Data
9.Tuning
10.Extensions
11.Distributed R
12.Streaming
13.Contributing
A.Supplemental Code References
Index

王者特权隐藏内容需2积分,请先!没有帐号? 注 册 一个!
赞(0) 觉得文章有用就打赏一下
未经允许不得转载:finelybook » Mastering Spark with R:The Complete Guide to Large-Scale Analysis and Modeling

觉得文章有用就打赏一下

支付宝扫一扫打赏

微信扫一扫打赏