Building of parallel and comparable cybersecurity corpora for bilingual terminology extraction

Collection:
Mokslo publikacijos / Scientific publications
Document Type:
Knygos dalis / Part of the book
Language:
Anglų kalba / English
Title:
Building of parallel and comparable cybersecurity corpora for bilingual terminology extraction
In the Book:
Selected papers from the CLARIN Annual Conference 2021, Virtual Event, 2021, 27-29 September / edited by Monica Monachini and Maria Eskevich. Linköping : Linköping University Electronic Press, 2022. P. 126-138. (Linköping electronic conference proceedings ; 189)
Summary / Abstract:

ENThe paper aims at presenting English-Lithuanian corpora for bilingual term extraction (BiTE) in the cybersecurity domain within the framework of the project DVITAS. It is argued that a system of parallel, comparable, and training corpora for BiTE is particularly useful for less-resourced languages, as it allows efficiently to combine strengths and avoid weaknesses of comparable and parallel resources. A special focus is given to the availability of sources in the cybersecurity domain and issues related to copyright-protected publications, as well as the data curation performed for building the corpora and depositing them to CLARIN-LT repository. Keywords: Bilingual Terminology Extraction, Parallel Corpus, Comparable Corpus. [From the publication]

DOI:
10.3384/ecp18912
ISBN:
9789179294441
Related Publications:
Permalink:
https://www.lituanistika.lt/content/103991
Updated:
2024-03-18 20:06:28
Metrics:
Views: 13
Export: