LTReikšminiai žodžiai: Klasteris; Anotavimas; Kompiuterinės lingvistikos centras; Skaitmeniniai lietuvių kalbos ištekliai; Tekstynas; Centre of Computational Linguistics; Annotation; Cluster; Corpus; Lithuanian language recourses.
ENResearch and activities related to Lithuanian language resources started at Vytautas Magnus University in 1994 when the Centre of Computational Linguistics was founded in the Faculty of Humanities (http://donelaitis.vdu.lt). It took care of the compilation of the big „Corpus of the Contemporary of the Lithuanian Language“ (100 million running words). Recently the big corpus was supplemented with its spoken component. Lithuanian language resources had to be processed with language specific tools. Quite a few of them were created by researchers of the CCL: a corpus query system and collocation extraction tool, a lemmatiser, a multifunctional tool called „Morfolema“. The most recent tools comprise a system of morphological annotation and disambiguation, a tool for automatic textual function detection, the internet program of automatic accentuation a music (mono singer) transcription „Solo Explorer“ are available on the CCL website. The greatest achievement is MT system from English into the Lithuanian language meant for translation of internet texts (http://vertimas.vdu.lt). The CCL hosts researchers that deal with both fundamental and applied research that is necessary for the computerization of the Lithuanian language. The most prominent trend is corpusbased and corpus-driven analysis of Lithuanian words and collocations, automated analysis of the Lithuanian grammar, computer-aided text analysis and translation, speech annotation and recognition, analysis of the phonetic and phonotactic characteristics of the Lithuanian speech. [From the publication]