VSE knjižnice (vzajemna bibliografsko-kataložna baza podatkov COBIB.SI)
  • Discovering automated lexicography [Elektronski vir] : the case of Slovene lexical database
    Gantar, Polona ; Kosem, Iztok ; Krek, Simon, 1967-
    In this paper, we describe the compilation of the Slovene Lexical Database; main focus being on developing the methodology to improve the tools used for lexicographic analysis and to introduce ... automatic data extraction in the lexicographic process. The semiautomated approach, which was devised in the last stages of database compilation, involved extracting corpus data, i.e. grammatical relations, collocations, examples, and grammatical labels, and conducting lexicographic analysis in the dictionary-writing system rather than in the corpus tool. An evaluation that compared the manual approach with the semi-automatic approach showed that the semi-automatic approach is much quicker and presents the lexicographers with almost all the information they identified as relevant during the manual analysis, as well as additional potentially relevant information for the dictionary entry. The final section of the paper proposes a few avenues for improvement of the semi-automated approach, including the implementation of crowdsourcing and additional post-processing of automatically extracted data.
    Vrsta gradiva - e-članek
    Leto - 2016
    Jezik - angleški
    COBISS.SI-ID - 60424034