Identification of multiword expressions for Latvian and Lithuanian: hybrid approach

Collection:
Mokslo publikacijos / Scientific publications
Document Type:
Knygų dalys / Parts of the books
Language:
Anglų kalba / English
Title:
Identification of multiword expressions for Latvian and Lithuanian: hybrid approach
In the Book:
EACL 2017: 13th workshop on multiword expressions, April 4, 2017 Valencia, Spain: proceedings of the workshop. P. 97-101.. Stroudsburg : Association for Computational Linguistics, 2017
Summary / Abstract:

ENWe discuss an experiment on automatic identification of bi-gram multiword expressions in parallel Latvian and Lithuanian corpora. Raw corpora, lexical association measures (LAMs) and supervised machine learning (ML) are used due to deficit and quality of lexical resources (e.g., POS-tagger, parser) and tools. While combining LAMs with ML is rather effective for other languages, it has shown some nice results for Lithuanian and Latvian as well. Combining LAMs with ML we have achieved 92,4% precision and 52,2% recall for Latvian and 95,1% precision and 77,8% recall for Lithuanian.

Permalink:
https://www.lituanistika.lt/content/77651
Updated:
2026-02-25 13:38:09
Metrics:
Views: 28
Export: