Unstructured Data Analysis: Entity Resolution and Regular Expressions in SAS
Authors: Matthew Windham
ISBN-10: 1629598429
ISBN-13: 9781629598420
Released: 2018-09-06
Print Length 页数: 166 pages
Book Description
Unlock the power of regular expressions and entity resolution to transform your analytics projects
Unstructured data is the most voluminous form of data in the world,and analysts rarely receive it in perfect condition for processing. In other words,textual data needs to be cleaned,transformed,and enhanced before value can be derived from it. Unstructured Data Analysis: Entity Resolution and Regular Expressions in SAS® shows SAS programmers of virtually all skill levels how to harness the robust power of regular expressions and entity resolution within the SAS programming language for a wide array of everyday applications of unstructured data analyses.
This book uses a practical,examples-based approach to present techniques for unstructured data processing and provides the foundational information needed to perform advanced applications. Beginning with regular expressions in SAS,readers will progress to learning the building blocks of Entity Resolution Analytics including entity extraction,ETL,entity resolution,network mapping and analysis,and management concepts. Filled with motivational examples and helpful guidelines,this book is a critical reference for every analytics professional who works with unstructured data.