Slavic computational and corpus linguistics

Dagmar Divjak, Tomaz Erjavec, Serge Sharoff

Research output: Contribution to journalArticlepeer-review

2 Citations (Scopus)
153 Downloads (Pure)

Abstract

In this paper, we focus on corpus-linguistic studies that address theoretical questions and on computational linguistic work on corpus annotation, that makes corpora useful for linguistic work. First, we discuss why the corpus linguistic approach was discredited by generative linguists in the second half of the 20th century, how it made a comeback through advances in computing and was adopted by usage-based linguistics at the beginning of the 21st century. Then, we move on to an overview of necessary and common annotation layers and the issues that are encountered when performing automatic annotation, with special emphasis on Slavic languages. Finally, we survey the types of research requiring corpora that Slavic linguists are involved in world-wide, and the resources they have at their disposal.
Original languageEnglish
Pages (from-to)171-199
JournalJournal of Slavic Linguistics (JSL)
Volume25
Issue number2
DOIs
Publication statusPublished - 31 Oct 2017

Fingerprint

Dive into the research topics of 'Slavic computational and corpus linguistics'. Together they form a unique fingerprint.

Cite this