Workflow reversal and data wrangling in multilingual diachronic analysis and linguistic linked open data modelling

Armaselu, Florentina; McGillivray, Barbara; Liebeskind, Chaya; Valūnaitė-Oleškevičienė, Giedrė; Utka, Andrius; Gifu, Daniela; Khan, Anas Fahad; Apostol, Elena-Simona; Truică, Ciprian-Octavian

Workflow reversal and data wrangling in multilingual diachronic analysis and linguistic linked open data modelling

Link to:

VDU talpykla: straipsnio tekstas

Collection:

Mokslo publikacijos / Scientific publications

Document Type:

Knygos dalis / Part of the book

Language:

Anglų kalba / English

Title:

Workflow reversal and data wrangling in multilingual diachronic analysis and linguistic linked open data modelling

Authors:

In the Book:

Language, data and knowledge 2023 (LDK 2023): proceedings of the 4th conference on language, data and knowledge, 12–15 September 2023. Vienna, Austria. Vienna, 2023. P. 410-416

Subject Category:

Leksikografija / Lexicography; Daugiakalbystė / Multilingualism.

Summary / Abstract:

ENThe article deals with data wrangling in a multilingual collection intended for diachronic analysis and linguistic linked open data modelling for tracing concept change over time. Two types of static word embeddings are used: word2vec (French and Hebrew data sets), and fastText (Latin and Lithuanian data sets). We model examples from these embeddings via the OntoLex-FrAC formalism. To address the challenge of heterogeneity, we use a minimalist workflow design allowing for both convergence and flexibility in attaining the project goals. [From the publication]

Subject:

Kalbotyra / Linguistics

Related Publications:

"Senosios lietuvių kalbos tekstynas" (SLIEKKAS) - nauja diachroninio tekstyno samprata / Jolanta Gelumbeckaitė, Mindaugas Šinkūnas, Vytautas Zinkevičius. Darbai ir dienos. 2012, t. 58, p. 257-278.

Permalink:

https://www.lituanistika.lt/content/111779

Updated:

2024-11-18 15:42:58

Metrics:

Export:

Choose type:

Download

User ID:
User Password: