Programming for Corpus Linguistics with Python and Dataframes


Programming for Corpus Linguistics with Python and Dataframes (Elements in Corpus Linguistics)
Author: Daniel Keller (Author)
Publisher finelybook 出版社:‏ Cambridge University Press
Publication Date 出版日期:‏ 2024-07-31
Language 语言: English
Print Length 页数: 75 pages
ISBN-10: 1009486780
ISBN-13: 9781009486781

Book Description

This Element offers intermediate or experienced programmers algorithms for Corpus Linguistic (CL) programming in the Python language using dataframes that provide a fast, efficient, intuitive set of methods for working with large, complex datasets such as corpora. This Element demonstrates principles of dataframe programming applied to CL analyses, as well as complete algorithms for creating concordances; producing lists of collocates, keywords, and lexical bundles; and performing key feature analysis. An additional algorithm for creating dataframe corpora is presented including methods for tokenizing, part-of-speech tagging, and lemmatizing using spaCy. This Element provides a set of core skills that can be applied to a range of CL research questions, as well as to original analyses not possible with existing corpus software.

Book Description

This Element offers an introduction to programming for Corpus Linguists in the Python language using dataframes.

Amazon page

相关文件下载地址

PDF | 1.6 MB
下载地址 Download解决验证以访问链接!
打赏
未经允许不得转载:finelybook » Programming for Corpus Linguistics with Python and Dataframes

评论 抢沙发

觉得文章有用就打赏一下

您的打赏,我们将继续给力更多优质内容

支付宝扫一扫

微信扫一扫