Bilingual parallel corpora for English, Czech and Lithuanian

Collection:
Mokslo publikacijos / Scientific publications
Document Type:
Knygos dalis / Part of the book
Language:
Anglų kalba / English
Title:
Bilingual parallel corpora for English, Czech and Lithuanian
Summary / Abstract:

LTReikšminiai žodžiai: Derinimo įrankis; Lygiagreti korpuso užklausa; Lygiavimo įrankis; Paralelieji tekstynai; Paralelinis korpusas; Paraleliojo tekstyno užklausa; Alignment tool; Parallel corpora; Parallel corpus query.

ENThe paper presents the online parallel corpus that was compiled at the Centre of Computational Linguistics of Vytautas Magnus University. The corpus consists of 4 parts: Czech-Lithuanian, Lithuanian-Czech, English-Lithuanian, and Lithuanian-English. The paper gives an introduction to the most important steps in the process of compiling the bilingual corpora. The aligning tool Aligner 2.0.6.7 is presented in the paper. It has been observed that initial paragraph alignment prior to the sentence alignment makes the alignment process easier and more precise. We also introduce on-line search possibilities of this corpus. The article exemplifies the usage of the bilingual corpus and contrasts that to the usage of traditional bilingual dictionaries. [From the publication]

ISBN:
9789955704539
Related Publications:
Permalink:
https://www.lituanistika.lt/content/71699
Updated:
2019-12-15 12:21:03
Metrics:
Views: 16
Export: