PySpark SQL Recipes: With HiveQL,Dataframe and Graphframes


PySpark SQL Recipes: With HiveQL,Dataframe and Graphframes
Authors: Raju Kumar Mishra
ISBN-10: 148424334X
ISBN-13: 9781484243343
Edition 版次: 1st ed.
Publication Date 出版日期: 2019-03-19
Print Length 页数: 348 pages


Book Description
By finelybook

Carry out data analysis with PySpark SQL,graphframes,and graph data processing using a problem-solution approach. This book provides solutions to problems related to dataframes,data manipulation summarization,and exploratory analysis. You will improve your skills in graph data analysis using graphframes and see how to optimize your PySpark SQL code.
PySpark SQL Recipes starts with recipes on creating dataframes from different types of data source,data aggregation and summarization,and exploratory data analysis using PySpark SQL. You’ll also discover how to solve problems in graph analysis using graphframes.
On completing this book,you’ll have ready-made code for all your PySpark SQL tasks,including creating dataframes using data from different file formats as well as from SQL or NoSQL databases.
What You Will Learn
Understand PySpark SQL and its advanced features
Use SQL and HiveQL with PySpark SQL
Work with structured streaming
Optimize PySpark SQL
Master graphframes and graph processing
Cover
1. Introduction to PySpark SQL
2. Installation
3. IO in PySpark SQL
4. Operations on PySpark SQL DataFrames
5. Data Merging and Data Aggregation Using PySparkSQL
6. SQL,NoSQL,and PySparksQL
7. Optimizing PySpark SQL
8. Structured Streaming
9. GraphFrames

相关文件下载地址

下载地址 Download解决验证以访问链接!
打赏
未经允许不得转载:finelybook » PySpark SQL Recipes: With HiveQL,Dataframe and Graphframes

评论 抢沙发

觉得文章有用就打赏一下

您的打赏,我们将继续给力更多优质内容

支付宝扫一扫

微信扫一扫