Workflow reversal and data wrangling in multilingual diachronic analysis and linguistic linked open data modelling

Collection:
Mokslo publikacijos / Scientific publications
Document Type:
Knygos dalis / Part of the book
Language:
Anglų kalba / English
Title:
Workflow reversal and data wrangling in multilingual diachronic analysis and linguistic linked open data modelling
Summary / Abstract:

ENThe article deals with data wrangling in a multilingual collection intended for diachronic analysis and linguistic linked open data modelling for tracing concept change over time. Two types of static word embeddings are used: word2vec (French and Hebrew data sets), and fastText (Latin and Lithuanian data sets). We model examples from these embeddings via the OntoLex-FrAC formalism. To address the challenge of heterogeneity, we use a minimalist workflow design allowing for both convergence and flexibility in attaining the project goals.

Related Publications:
Permalink:
https://www.lituanistika.lt/content/111779
Updated:
2024-11-18 15:42:58
Metrics:
Views: 8
Export: