UP - logo
E-viri
Recenzirano Odprti dostop
  • Extracting Parallel Sentenc...
    Shi, Xiayang; Liu, Xinyi; Yu, Zhenqiang; Cheng, Pei; Xu, Chun

    Journal of physics. Conference series, 01/2022, Letnik: 2171, Številka: 1
    Journal Article

    Abstract At present, machine translation in the market depends on parallel sentence corpus, and the number of parallel sentences will affect the performance of machine translation, especially in low resource corpus. In recent years, the use of non parallel corpora to learn cross language word representation as low resources and less supervision to obtain bilingual sentence pairs provides a new idea. In this paper, we propose a new method. First, we create cross domain mappings in a small number of single languages. Then a classifier is constructed to extract bilingual parallel sentence pairs. Finally, we prove the effectiveness of our method in Uygur Chinese low resource language by using machine translation, and achieve good results.