Current Linguistic Corpora in the Western Balkans: The History, the Current State, and the Future

Authors

  • Nikola Dobrić

Keywords:

južnoslovanski jeziki, zahodni Balkan, korpusna lingvistika, korpusi, jezikovni korpusi, slovenščina, hrvarščina, srbščina, zgodovinski pregledi, South Slavic languages, West Balkan, corpus linguistic, corpora, linguistic corpora, language resources, history, natural language processing, Slovenian languge, Croatian language, Serbian language

Abstract

The West Balkans have had a rich history in developing language corpora. The first electronic corpus in the region was created only a few years after thevery first one in the world, while the idea of developing electronic language resources dates even further back. This early development of natural language processing was somewhat hampered by the unfortunate events of the 1990s, but in the last two decades there has been some substantial improvement in the development of the West Balkan language corpora. The paper presents a historical overview of the language corpora development in the region in the period from 1950 to 1990 as well as its current state and future prospects.

Downloads

Published

2012-04-15

How to Cite

Dobrić, N. (2012) “Current Linguistic Corpora in the Western Balkans: The History, the Current State, and the Future”, Slavistična revija, 60(4), pp. 677–692. Available at: https://srl.si/ojs/srl/article/view/COBISS_ID-51417186 (Accessed: 19 May 2024).

Issue

Section

ARTICLES