Latviešu-lietuviešu-latviešu paralēlo tekstu korpusa izveide

Collection:
Mokslo publikacijos / Scientific publications
Document Type:
Straipsnis / Article
Language:
Latvių kalba / Latvian
Title:
Latviešu-lietuviešu-latviešu paralēlo tekstu korpusa izveide
Alternative Title:
Building Latvian-Lithuanian-Latvian parallel corpus
In the Journal:
Vārds un tā pētišanas aspekti. 2012, 16 (2), p. 180-188
Summary / Abstract:

LTReikšminiai žodžiai: Baltų kalbos; Latvių kalba; Lygiagretusis tekstynas; Paralelinis tekstynas; Tekstų atranka; Baltic languages; Latvian language; Latvian, selection of texts; Lithuanian; Parallel corpus.

ENThe Institute of Mathematics and Computer Science in collaboration with Vytautas Magnus University is building the Latvian-Lithuanian-Latvian Parallel Corpus (January 2010). The Latvian-Lithuanian-Latvian Parallel Corpus will contain ~8 million running words; it will be bidirectional, aligned on the sentence level. The parallel corpus will be freely available on the web. The building of a parallel corpus is not an easy task, especially for small languages like Latvian and Lithuanian. There are several steps in the development of a parallel corpus - set up the text selection criteria, recognition and selection of texts and preparation (digitalization and alignment) of texts. There are some text selection criteria set up, for instance, time, media, originals, etc. Based on these criteria, the necessary texts were started to acquire from different. [From the publication]

ISSN:
1407-4737
Related Publications:
Permalink:
https://www.lituanistika.lt/content/64951
Updated:
2019-12-14 14:51:22
Metrics:
Views: 39
Export: