Corpora for bilingual terminology extraction in cybersecurity domain

Direct Link:
Collection:
Mokslo publikacijos / Scientific publications
Document Type:
Knygos dalis / Part of the book
Language:
Anglų kalba / English
Title:
Corpora for bilingual terminology extraction in cybersecurity domain
Summary / Abstract:

LTReikšminiai žodžiai: Terminologija; Dvikalbių terminų išskyrimas; Kibernetinis saugumas; Kalbiniai ištekliai; Paraleliniai resursai; CLARIN. Keywords: Terminology; Bilingual term extraction; Cybersecurity; Linguistic resources; Parallel resources; CLARIN.

ENThe paper aims at presenting English-Lithuanian corpora for bilingual term extraction (BiTE) in the cybersecurity domain within the framework of the project DVITAS. It is argued that a system of parallel, comparable, and training corpora for BiTE is particularly useful for less resourced languages, as it allows to efficiently use strengths and avoid weaknesses of comparable and parallel resources. A special focus is given to the open nature of the data, which is achieved by publishing the data in CLARIN-LT repository. [From the publication]

Permalink:
https://www.lituanistika.lt/content/96610
Updated:
2023-08-03 23:18:02
Metrics:
Views: 4
Export: