Kocijan, Kristina and Janjić, Marijana and Librenjak, Sara. (2017). Recognizing Diminutive and Augmentative Croatian Nouns. In: Automatic Processing of Natural-Language Electronic Texts with NooJ. Communications in Computer and Information Science, 667 . Springer International Publishing, pp. 26-36. ISBN 978-3-319-55001-5
|
PDF
(English) - Accepted Version
Download (1MB) | Preview |
Abstract
In this paper, the authors present NooJ morphological grammars for recogniz-ing Croatian diminutive and augmentative nouns for those common nouns that already exist in the Croatian NooJ dictionary. The purpose of this project is twofold. The first one is to recognize both diminutive and augmentative forms of each noun existing in our dictionary (over 20 000 common nouns) if such a form occurs in a text. The second purpose is to determine types of texts in which these words appear the most (or if they even appear) which is the reason why we divided our corpus in two thematic categories (children literature, nov-els). The results of our algorithm are high on both types of text [overall P=0.82; R=0.80; f-measure=0.81]. Although NooJ dictionary allows direct entrance of such derivations as an attribute-value description of a main noun, we have opted for the second option, i.e. writing a morphological grammar that will recognize the needed form. In this way, we are saving the space and time needed to add all the existing forms to the noun’s dictionary.
Item Type: | Book Section |
---|---|
Uncontrolled Keywords: | augmentatives; diminutives; Croatian; morphology; nouns; NooJ |
Subjects: | Information sciences Linguistics |
Departments: | Department of Information Science |
Date Deposited: | 28 Mar 2017 10:30 |
Last Modified: | 01 Mar 2018 00:15 |
URI: | http://darhiv.ffzg.unizg.hr/id/eprint/8499 |
Actions (login required)
View Item |