Corpora for bilingual terminology extraction in cybersecurity domain

Direct Link:
Collection:
Mokslo publikacijos / Scientific publications
Document Type:
Knygų dalys / Parts of the books
Language:
Anglų kalba / English
Title:
Corpora for bilingual terminology extraction in cybersecurity domain
In the Book:
CLARIN 2021: proceedings of annual conference, 27-29 September 2021. P. 11-15.. Utrecht : Utrecht University, 2021
Summary / Abstract:

ENThe paper aims at presenting English-Lithuanian corpora for bilingual term extraction (BiTE) in the cybersecurity domain within the framework of the project DVITAS. It is argued that a system of parallel, comparable, and training corpora for BiTE is particularly useful for less resourced languages, as it allows to efficiently use strengths and avoid weaknesses of comparable and parallel resources. A special focus is given to the open nature of the data, which is achieved by publishing the data in CLARIN-LT repository.

Permalink:
https://www.lituanistika.lt/content/96610
Updated:
2026-02-25 13:52:02
Metrics:
Views: 22    Downloads: 2
Export: