Apache Hive Essentials: Essential techniques to help you process, and get unique insights from, big data, 2nd Edition 版本

By 作者: Dayong Du

ISBN-10 书号: 1788995090

ISBN-13 书号:: 9781788995092

Edition 版本: 2nd Revised edition

Release 出版日期: 2018-06-30

pages 页数: (210 )


$29.99


Book Description

In this book, we prepare you for your journey into big data by frstly introducing you to backgrounds in the big data domain, alongwith the process of setting up and getting familiar with your Hive working environment.

Next, the book guides you through discovering and transforming the values of big data with the help of examples. It also hones your skills in using the Hive language in an effcient manner. Toward the end, the book focuses on advanced topics, such as performance, security, and extensions in Hive, which will guide you on exciting adventures on this worthwhile big data journey.

By the end of the book, you will be familiar with Hive and able to work effeciently to find solutions to big data problems

Contents
1: OVERVIEW OF BIG DATA AND HIVE
2: SETTING UP THE HIVE ENVIRONMENT
3: DATA DEFINITION AND DESCRIPTION
4: DATA CORRELATION AND SCOPE
5: DATA MANIPULATION
6: DATA AGGREGATION AND SAMPLING
7: PERFORMANCE CONSIDERATIONS
8: EXTENSIBILITY CONSIDERATIONS
9: SECURITY CONSIDERATIONS
10: WORKING WITH OTHER TOOLS
What You Will Learn
Create and set up the Hive environment
Discover how to use Hive’s definition language to describe data
Discover interesting data by joining and filtering datasets in Hive
Transform data by using Hive sorting, ordering, and functions
Aggregate and sample data in different ways
Boost Hive query performance and enhance data security in Hive
Customize Hive to your needs by using user-defined functions and integrate it
with other tools
Authors
Dayong Du
Dayong Du is a big data practitioner, author, and coach with over 10 years’ experience in technology consulting, designing, and implementing enterprise big data architecture and analytics in various industries, including finance, media, travel, and telecoms. He has a master’s degree in computer science from Dalhousie University and is a Cloudera certified Hadoop developer. He is a cofounder of Toronto Big Data Professional Association and the founder of DataFiber website.

由于版权问题,我们将只保留该文章的介绍,不再提供版权文件的下载,对您造成的不便敬请谅解。
您可以登陆 获取帮助.

Apache Hive Essentials, 2nd Edition
Tagged on:

发表评论

电子邮件地址不会被公开。 必填项已用*标注