Akademska digitalna zbirka SLovenije - logo
Slovenian Academy of Sciences and Arts, Lj. (SAZU)
  • GDEX for Slovene [Elektronski vir]
    Kosem, Iztok ; Husak, Milos ; McCarthy, Diana
    Good Dictionary Examples or GDEX is a tool in the Sketch Engine designed to help lexicographers with identifying dictionary examples by ranking sentences according to how likely they are to be good ... candidates. The ranking is done automatically using various syntactic and lexical features. So far, only GDEX for English has been available. This paper presents the design and evaluation of Slovene GDEX, which was used for finding good examples for the new lexical database of Slovene, one of the activities in the Communication in Slovene project. Several different GDEX configurations were designed, evaluated and compared. The evaluation involved examining sentences of lemmas belonging to different word classes. Good sentences were logged for subsequent analysis with external data-mining software, WEKA. The observed behaviour was then usedto adjust the parameters of the GDEX classifiers. We believe that the procedure of identifying features of good examples and their values, describedin this paper, can be used for the development of GDEX for any language.
    Type of material - conference contribution ; adult, serious
    Publish date - 2011
    Language - english
    COBISS.SI-ID - 33344045