DIKUL - logo
Faculty of Arts, Lj. (FFLJ)
OHK FF oddelčne knjižnice bodo imele v času poletnih mesecev nekoliko prilagojene urnike. Poletne urnike si lahko ogledate:
Poletni urniki
  • Identifying false friends between closely related languages [Elektronski vir]
    Ljubešić, Nikola, 1979- ; Fišer, Darja, 1978-
    In this paper we present a corpus-based approach to automatic identification of false friends for Slovene and Croatian, a pair of closely related languages. By taking advantage of the lexical overlap ... between the two languages, we focus on measuring the difference in meaning between identicallyspelled words by using frequency and distributional information. Weanalyze the impact of corpora of different origin and size together with different association and similarity measures and compare them to a simple frequency-based baseline. With the best performing setting we obtain very goodaverage precision of 0.973 and 0.883 on different gold standards. The presented approach works on non-parallel datasets, is knowledge-lean and language-independent, which makes it attractive for natural language processing tasks that often lack the lexical resources and cannot afford to build them by hand.
    Type of material - conference contribution
    Publish date - 2013
    Language - english
    COBISS.SI-ID - 52673634