Data Exploration and Preparation with BigQuery: A practical guide to cleaning, transforming, and analyzing data for business insights


Data Exploration and Preparation with BigQuery: A practical guide to cleaning, transforming, and analyzing data for business insights
Author: Mike Kahn (Author)
Publisher finelybook 出版社:‏ Packt Publishing
Publication Date 出版日期:‏ 2023-11-20
Language 语言: English
Print Length 页数: 264
ISBN-10: 1805125265
ISBN-13: 9781805125266

Book Description


Learn to understand and prepare data using BigQuery to make your data accurate, reliable, and ready for analysis and modeling
Key Features

  • Learn how to explore and prepare data with the BigQuery web UI (Cloud Console) and bq command line interface using real data sources
  • Best practices for optimizing processing, storage, and query performance in BigQuery
  • Six hands on exercises including: loading and transforming data, analyzing google Trends in notebooks, creating visualizations in Looker Studio, as well as full solutions for analyzing advertising, transportation, and customer support data

Book Description


Data professionals encounter a multitude of challenges such as handling large volumes of data, dealing with data silos, and the lack of appropriate tools. Datasets often arrive in different conditions and formats demanding considerable time from analysts engineers, and scientists to process and uncover insights. The complexity of the data life cycle often hinders teams and organizations from extracting the desired value from their data assets.
Data Exploration and Preparation with BigQuery offers a holistic solution to these challenges.
The book begins with the basics of BigQuery while covering the fundamentals of data exploration and preparation. It then progresses to demonstrate how to use BigQuery for these tasks and explores the array of big data tools at your disposal within the google Cloud ecosystem.
The book doesn’t merely offer theoretical insights; its a hands-on companion that walks you through properly structuring your tables for efficiency and ensures adherence to data preparation best practices. You’ll also learn when to use Dataflow, BigQuery, and Dataprep for ETL and ELT workflows. The book will skillfully guide you through various case studies, demonstrating how BigQuery can be used to solve real-world data problems.
By the end of this book, you’ll have mastered the use of SQL to explore and prepare datasets in BigQuery unlocking deeper insights from data. 
What you will learn

  • Assess the quality of a dataset and learn best practices for data cleansing
  • Prepare data for data analysis, visualization, and for machine learning
  • Explore approaches to visualize data in BigQuery
  • Apply acquired knowledge to real-life scenarios and design pattern
  • Set up and organize BigQuery resources
  • Use SQL and other tools to navigate datasets
  • Implement best practices for querying BigQuery datasets
  • Gain proficiency in using data preparation tools, techniques, and strategies

Who This Book Is For
This book is for data analysts who want to learn how to explore and prepare data using BigQuery. If you are a data analyst who is experienced in SQL, reporting, data modeling, and transformation, and looking to begin using BigQuery, this book is for you.
Business users who want to understand how to use BigQuery to make better data driven decisions will also benefit from this book. Program and project manager and other data professionals will also benefit from this books easy to follow approach. This book is excellent for any individuals who are planning to use BigQuery as a data warehouse to provide insights to their business from large datasets.
Table of Contents

  1. Introducing BigQuery and its Components
  2. BigQuery Organization and Design
  3. Exploring Data in BigQuery
  4. Loading and Transforming
  5. Querying BigQuery Data
  6. Exploring Data with Notebooks
  7. Further Exploring and Visualizing Data
  8. Data Preparation Tools
  9. Cleansing and Transforming Data
  10. Best Practices for Data Preparation, Optimization and Cost Control
  11. Hands-On Exercise Analyzing Advertising Data
  12. Hands-On Exercise Analyzing Transportation Data
  13. Hands-On Exercise Analyzing Customer Support Data
  14. Summary of Key Points, Future Directions

Book Description


Data professionals encounter a multitude of challenges such as handling large volumes of data, dealing with data silos, and the lack of appropriate tools. Datasets often arrive in different conditions and formats demanding considerable time from analysts engineers, and scientists to process and uncover insights. The complexity of the data life cycle often hinders teams and organizations from extracting the desired value from their data assets. Data Exploration and Preparation with BigQuery offers a holistic solution to these challenges.
The book begins with the basics of BigQuery while covering the fundamentals of data exploration and preparation. It then progresses to demonstrate how to use BigQuery for these tasks and explores the array of big data tools at your disposal within the google Cloud ecosystem.
The book doesn’t merely offer theoretical insights; its a hands-on companion that walks you through properly structuring your tables for efficiency and ensures adherence to data preparation best practices. You’ll also learn when to use Dataflow, BigQuery, and Dataprep for ETL and ELT workflows. The book will skillfully guide you through various case studies, demonstrating how BigQuery can be used to solve real-world data problems.
By the end of this book, you’ll have mastered the use of SQL to explore and prepare datasets in BigQuery unlocking deeper insights from data. 
What you will learn

  • Assess the quality of a dataset and learn best practices for data cleansing
  • Prepare data for data analysis, visualization, and for machine learning
  • Explore approaches to visualize data in BigQuery
  • Apply acquired knowledge to real-life scenarios and design pattern
  • Set up and organize BigQuery resources
  • Use SQL and other tools to navigate datasets
  • Implement best practices for querying BigQuery datasets
  • Gain proficiency in using data preparation tools, techniques, and strategies

 
What this book covers 
Chapter 1, Introducing BigQuery and Its Components, Learn about how BigQuery operatates to use it more effectively. Review an ‘under the hood’ look of the technologies that deliver BigQuery. Understand data exploration and preparation goals. 
Chapter 2, BigQuery Organization and Design, Understand how to build a secure and collaborative BigQuery environment. Gain strong understanding of all services that deliver the BigQuery service beyond the SQL query. Understand design patterns for deploying BigQuery resources. 
Chapter 3, Exploring Data in BigQuery, Review various ways to explore data in BigQuery and review the process and steps of data exploration.Learn about the different methods to access data in BigQuery and best practices to get started. 
Chapter 4, Loading and Transforming, Explore the techniques and best practices for loading data into BigQuery. Review the tools and methodologies for transforming and processing data with BigQuery. This chapter includes hands-on excercise – data loading and transformation in BigQuery. 
Chapter 5, Querying BigQuery Data, This chapter will familarize you with the structure of a query and give you a strong foundation in crafting queries. More complex querying practices will be reviewed as well. This chapter will give you the skills to begin writing queries.  
Chapter 6, Exploring Data with Notebooks, Understand the value of using notebooks for data exploration. Better understand the notebook options in google Cloud. This chapter includes hands-on exercise- analyzing google Trends data in Workbench. 
Chapter 7, Further Exploring and Visualizing Data, Better understand data attributes, discover patterns, and communicate findings effectively. Learn common practices for exploring data and review techniques and tools to analyze and visualize your data. This chapter includes a hands-on excercise creating visualizations with Looker Studio. 
Chapter 8, Data Preparation Tools, Explore approaches and tools that can be used with BigQuery for data preparation tasks to improve data quality.  
Chapter 9, Cleansing and Transforming Data, Review cleaning and transforming data in greater detail for optimizing table data after loading and initial exploration. Learn the skills to handle situations that you will encounter as you refine query results and reporting accuracy. 
Chapter 10, Best Practices for Data Preparation, Optimization and Cost Control, Introduce cost control and optimization features of BigQuery. Learn how to use BigQuery in a cost effective way. 
Chapter 11, Hands-On Exercise – Analyzing Advertising Data, This hands-on excercise will present a use case including sales, marketing, and advertising data. Follow along with the exercise to learn how to analyze and prepare advertising data and utilize the steps as a repeatable process with your real data. 
Chapter 12, Hands-On Exercise Analyzing Transportation Data, This hands-on exercise will present a use case with vehicle data. Follow along with the exercise to learn how to analyze and prepare transportation data, the steps presented can be replicated with real data. 
Chapter 13, Hands-On Exercise Analyzing Customer Support Data, This hands-on exercise will present a use case with customer support data. Two different customer support data sources will be used as well as BigQuery ML sentiment analysis to better understand customer service data.  
Chapter 14, Summary of Key Points, Future Directions, Recap the key points discussed throughout the book. Look into the future and learn emerging trends and transformative directions that will shape the landscape of data exploration, preparation and analytics with BigQuery.

Amazon page

打赏
未经允许不得转载:finelybook » Data Exploration and Preparation with BigQuery: A practical guide to cleaning, transforming, and analyzing data for business insights

评论 抢沙发

觉得文章有用就打赏一下

您的打赏,我们将继续给力更多优质内容

支付宝扫一扫

微信扫一扫