Mastering Voice Interfaces: Creating Great Voice Apps for Real Users

Mastering Voice Interfaces: Creating Great Voice Apps for Real Users

Mastering Voice Interfaces: Creating Great Voice Apps for Real Users

Author:Ann Thymé-Gobbel (Author), Ann Thymé-Gobbel, Charles Jankowski (Contributor), Charles Jankowski

Publisher finelybook 出版社:‏ Apress

Publication Date 出版日期: 2021-05-30

Edition 版本:‏ 1st ed.

Language 语言: English

Print Length 页数: 720 pages

ISBN-10: 1484270045

ISBN-13: 9781484270042

Book Description

Build great voice apps of any complexity for any domain by learning both the how’s and why’s of voice development. In this book you’ll see how we live in a golden age of voice technology and how advances in automatic speech recognition (ASR), natural language processing (NLP), and related technologies allow people to talk to machines and get reasonable responses. Today, anyone with computer access can build a working voice app. That democratization of the technology is great. But, while it’s fairly easy to build a voice app that runs, it’s still remarkably difficult to build a great one, one that users trust, that understands their natural ways of speaking and fulfills their needs, and that makes them want to return for more.

We start with an overview of how humans and machines produce and process conversational speech, explaining how they differ from each other and from other modalities. This is the background you need to understand the consequences of each design and implementation choice as we dive into the core principles of voice interface design. We walk you through many design and development techniques, including ones that some view as advanced, but that you can implement today. We use the Google development platform and Python, but our goal is to explain the reasons behind each technique such that you can take what you learn and implement it on any platform.

Readers of Mastering Voice Interfaces will come away with a solid understanding of what makes voice interfaces special, learn the core voice design principles for building great voice apps, and how to actually implement those principles to create robust apps. We’ve learned during many years in the voice industry that the most successful solutions are created by those who understand both the human and the technology sides of speech, and that both sides affect design and development. Because we focus on developing task-oriented voice apps for real usersin the real world, you’ll learn how to take your voice apps from idea through scoping, design, development, rollout, and post-deployment performance improvements, all illustrated with examples from our own voice industry experiences.

What You Will Learn

  • Create truly great voice apps that users will love and trust
  • See how voice differs from other input and output modalities, and why that matters
  • Discover best practices for designing conversational voice-first applications, and the consequences of design and implementation choices
  • Implement advanced voice designs, with real-world examples you can use immediately.
  • Verify that your app is performing well, and what to change if it doesn’t

Who This Book Is For

Anyone curious about the real how’s and why’s of voice interface design and development. In particular, it’s aimed at teams of developers, designers, and product owners who need a shared understanding of how to create successful voice interfaces using today’s technology. We expect readers to have had some exposure to voice apps, at least as users.

Editorial Reviews

From the Back Cover

We live in a golden age of voice technology. Advances in automatic speech recognition (ASR), natural language processing (NLP) and other technologies have made it extremely viable for people to be talking to machines and getting reasonable answers. Platforms like Amazon Alexa and Google Home, and the associated tools, have made it so anyone can build a voice app, and this is excellent. What we have seen though is that it’s fairly easy to build a voice app, but still remarkably difficult to build a great app, one that gets the user what they need, and hopefully the user comes back for more.
In Mastering Voice Interfaces we want to show you how to build great voice apps. We start with the basics of voice interfaces, and how they are different from others, then dive into basic design principles that we’ve learned in many years building these apps in the industry. As we cover a design principle, we’ll also demonstrate how to implement it with one of the established voice platforms (Google Home), and show how, though the tools are great, you don’t have to go too far to have to do some custom work to get what you really want. We’ll walk through many design and development techniques that some would view as advanced, but that can make a huge difference in the quality of the app.
Readers of Mastering Voice Interfaces will come away with a very good understanding of what makes voice interfaces so special, learn the basic design principles are for building great voice apps, and how to actually implement those principles and create working apps.
What you will learn:

  • What makes voice special and different from other input and output modalities, and why that matters.
  • What the best practices for the various components of the voice-first creation process
  • What are the consequences of design and implementation choices
  • How to create truly great voice apps that users will love


Who this book is for We expect readers to have had some exposure to voice apps, at least as users. The book is written for anyone who wants a deeper understanding of the how’s and why’s of voice interface design and development. For team of developers, designers, product owners who need a shared understanding of voice interfaces.

About the Author

Ann Thymé-Gobbel’s career has focused on how people use speech and natural language to communicate with each other and with technology. After completing her PhD in cognitive science and linguistics from UC San Diego, she’s held a broad set of voice-related UI/UX design roles in both large corporations and small start-ups, working with diverse teams in product development, client project engagements, and R&D. Her past work includes design, data analysis and establishing best practices at Nuance, voice design for mobile and in-home devices at Amazon Lab 126, and creating natural language conversations for multimodal healthcare apps at 22otters. Her research has covered automatic language detection, error correction, and discourse structure. She is currently Director of UI/UX Design at Loose Cannon Systems, the team bringing to market Milo, a handsfree wearable communicator. Ann never stops doing research: she collects and analyzes data at every opportunity and enjoys sharing herfindings with others, having presented and taught at conferences internationally.

Charles Jankowski has over 30 years’ experience in industry and academia developing applications and algorithms for real-world users incorporating advanced speech recognition, speaker verification, and natural language technologies. He has used state-of-the-art machine learning processes and techniques for data analysis, performance optimization, and algorithm development. Charles has highly in-depth technical experience with state-of-the-art technologies, effective management of cross-functional teams for all facets of application deployment, and outstanding relationships with clients. Currently, he is Director of NLP at Brain Technologies, creating the Natural iOS application with which you can “Say it and Get it.” Previously he was Director of NLP and Robotics at CloudMinds, Director of Speech and Natural Language at 22otters, Senior Speech Scientist at Performance Technology Partners, and Director of Professional Services at Nuance. He has also been an independent consultant. Charles holds S.B., S.M., and Ph.D. degrees from MIT, all in electrical engineering.


Amazon Page

下载地址

PDF, EPUB | 39 MB | 2021-06-28

请登录以查看全部内容 登录

此内容查看价格为12积分(VIP免费),请先
打赏
未经允许不得转载:finelybook » Mastering Voice Interfaces: Creating Great Voice Apps for Real Users

评论 2

  1. #1

    上链接

    348028622周前 (07-22)回复
    • 已更新

      admin2周前 (07-23)回复

觉得文章有用就打赏一下

您的打赏,我们将继续给力更多优质内容

支付宝扫一扫

微信扫一扫