Lithuanian dependency parsing with rich morphological features

Collection:
Mokslo publikacijos / Scientific publications
Document Type:
Knygos dalis / Part of the book
Language:
Anglų kalba / English
Title:
Lithuanian dependency parsing with rich morphological features
In the Book:
SPMRL 2013 : proceedings of the fourth workshop on statistical parsing of morphologically rich languages. Stroudsburg: Association for computational linguistics, 2013. P. 12-21
Keywords:
LT
Gramatika / Grammar; Morfologija / Morphology; Žodžio kaityba / Inflection.
Summary / Abstract:

LTReikšminiai žodžiai: Anotavimo schema; Gramatinis nagrinėjimas; Gramatinės kategorijos; Išvystytą linksnių sistemą turinti kalba; Morfologinė analizė; Statistika; Annotation scheme; Grammatical categories; Morphological analysis; Parsing; Richly inflected language; Statistics.

ENWe present the first statistical dependency parsing results for Lithuanian, a morphologically rich language in the Baltic branch of the Indo-European family. Using a greedy transition-based parser, we obtain a labeled attachment score of 74.7 with gold morphology and 68.1 with predicted morphology (77.8 and 72.8 unlabeled). We investigate the usefulness of different features and find that rich morphological features improve parsing accuracy significantly, by 7.5 percentage points with gold features and 5.6 points with predicted features. As expected, CASE is the single most important morphological feature, but virtually all available features bring some improvement, especially under the gold condition. [From the publication]

ISBN:
9781937284978
Related Publications:
Permalink:
https://www.lituanistika.lt/content/85417
Updated:
2021-02-02 19:05:05
Metrics:
Views: 14    Downloads: 1
Export: