Text Analysis in Python for Social Scientists (Elements in Quantitative and Computational Methods for the Social Sciences) New Edition
by Dirk Hovy(Author)
Publisher Finelybook 出版社: Cambridge University Press; New edition (March 17, 2022)
Language 语言: English
pages 页数: 102 pages
ISBN-10 书号: 1108958508
ISBN-13 书号: 9781108958509
Text contains a wealth of information about about a wide variety of sociocultural constructs. Automated prediction methods can infer these quantities (sentiment analysis is probably the most well-known application). However, there is virtually no limit to the kinds of things we can predict from text: power, trust, misogyny, are all signaled in language. These algorithms easily scale to corpus sizes infeasible for manual analysis. Prediction algorithms have become steadily more powerful, especially with the advent of neural network methods. However, applying these techniques usually requires profound programming knowledge and machine learning expertise. As a result, many social scientists do not apply them. This Element provides the working social scientist with an overview of the most common methods for text classification, an intuition of their applicability, and Python code to execute them. It covers both the ethical foundations of such work as well as the emerging potential of neural network methods.