Języki słowiańskie i litewski w korpusach równoległych Clarin-PL

Collection:
Mokslo publikacijos / Scientific publications
Document Type:
Straipsnis / Article
Language:
Lenkų kalba / Polish
Title:
Języki słowiańskie i litewski w korpusach równoległych Clarin-PL
Alternative Title:
Slavic languages and the Lithuanian language in the Clarin-PL parallel corpora
In the Journal:
Keywords:
LT
Semantika / Semantics.
Summary / Abstract:

LTReikšminiai žodžiai: Daugiakalbis gretinamasis tekstynas; Daugiakalbis lygiagretusis tekstynas; Kiekybinis apimties įvertinimas; Kiekybinis taikymo srities įvertinimas; Semantinė anotacija; Multilingual parallel corpora; Scope quantification; Semantic annotation.

ENThe Clarin Eric and ClarinPL strategic scientific purpose is to support humanistic research in a multicultural and multilingual Europe. Polish researchers put the emphasis on building a bridge between the Polish language and Polish linguistic technologies and other European languages and their linguistic technologies. So far, the Polish scientific community has mainly focused on PolishEnglish connections. ClarinPL has been developing the first and only multilingual corpora of the Polish language in conjunction with other Slavic languages and the Lithuanian language: the Polish-Bulgarian-Russian Parallel Corpus and the Polish-Lithuanian Parallel Corpus. The parallel corpora created by the ISS PAS Corpus Linguistics and Semantics Team break through the existing "canons" and allow scientists access to interlinked multilingual language resources - in the first phase limited to the languages of the three Slavic groups and the Lithuanian language. In the article, the authors present very detailed information on their original system of the semantic annotation of scope quantification in multilingual parallel corpora, hitherto unused in the subject literature. Due to the system's originality, the semantic annotation is carried out manually. Identification of particular values of scope quantification in a sentence and the hereby presented attempts of its recording are supported by longterm research conducted by an international team of linguists and computer scientists / mathematicians developing the issue of quantification of names, time and aspect in natural languages. [From the publication]

DOI:
10.11649/sfps.2016.011
ISSN:
0081-7090; 2392-2435
Related Publications:
Permalink:
https://www.lituanistika.lt/content/84924
Updated:
2020-12-17 20:21:36
Metrics:
Views: 14
Export: