UP - logo
FF, Osrednja humanistična knjižnica, Ljubljana (FFLJ)
  • Addressing polysemy in bilingual lexicon extraction from comparable corpora [Elektronski vir]
    Fišer, Darja, 1978- ; Ljubešić, Nikola, 1979- ; Kubelka, Ozren
    This paper presents an approach to extract translation equivalents from comparable corpora for polysemous nouns. As opposed to the standard approachesthat build a single context vector for all ... occurrences of a given headword, we first disambiguate the headword with third-party sense taggers and then build a separate context vector for each sense of the headword. Sincestate-of-the-art word sense disambiguation tools are still far from perfect, we also tried to improve the results by combining the sense assignments provided by two different sense taggers. Evaluation of the resultsshows that we outperform the baseline (0.473) in all the settings we experimented with, even when using only one sense tagger, and that the best-performing results are indeed obtained by taking into account the intersection of both sense taggers (0.720).
    Vrsta gradiva - prispevek na konferenci
    Leto - 2012
    Jezik - angleški
    COBISS.SI-ID - 50058338