ALL libraries (COBIB.SI union bibliographic/catalogue database)
  • SUMAT [Elektronski vir] : data collection and parallel corpus compilation for machine translation of subtitles
    Petukhova, Volha ...
    This paper describes the data collection and parallel corpus compilation activities carried out in the FP7 EU-funded SUMAT project. This project aims to develop an online subtitle translation service ... for nine European languages combined into 14 different language pairs. This data provides bilingual and monolingual training data for statistical machine translation engines which will semi-automate the subtitle translation processes of subtitling companies on a large scale.
    Type of material - conference contribution
    Publish date - 2012
    Language - english
    COBISS.SI-ID - 16027926