Workflow reversal and data wrangling in multilingual diachronic analysis and linguistic linked open data modelling

Collection:
Mokslo publikacijos / Scientific publications
Document Type:
Knygos dalis / Part of the book
Language:
Anglų kalba / English
Title:
Workflow reversal and data wrangling in multilingual diachronic analysis and linguistic linked open data modelling
Summary / Abstract:

ENThe article deals with data wrangling in a multilingual collection intended for diachronic analysis and linguistic linked open data modelling for tracing concept change over time. Two types of static word embeddings are used: word2vec (French and Hebrew data sets), and fastText (Latin and Lithuanian data sets). We model examples from these embeddings via the OntoLex-FrAC formalism. To address the challenge of heterogeneity, we use a minimalist workflow design allowing for both convergence and flexibility in attaining the project goals. [From the publication]

Related Publications:
"Senosios lietuvių kalbos tekstynas" (SLIEKKAS) - nauja diachroninio tekstyno samprata / Jolanta Gelumbeckaitė, Mindaugas Šinkūnas, Vytautas Zinkevičius. Darbai ir dienos. 2012, t. 58, p. 257-278.
Permalink:
https://www.lituanistika.lt/content/111779
Updated:
2024-11-18 15:42:58
Metrics:
Views: 2
Export: