Volume 9 Number 12 (Dec. 2014)
Home > Archive > 2014 > Volume 9 Number 12 (Dec. 2014) >
JCP 2014 Vol.9(12): 2780-2786 ISSN: 1796-203X
doi: 10.4304/jcp.9.12.2780-2786

A Phrase Table Filtering Model Based on Binary Classification for Uyghur-Chinese Machine Translation

Chenggang Mi1, 2, Yating Yang1, Xi Zhou1, Lei Wang1, Xiao Li1 and Eziz Tursun1, 2
1Xinjiang Technical Institute of Physics and Chemistry of Chinese Academy of Sciences,Urumqi 830011, China
2University of Chinese Academy of Sciences, Beijing 100049, China


Abstract—In statistical machine translation, large amount of unreasonable phrase pairs in a phrase table can affect the decoding efficiency and the overall translation performance, especially in Uyghur-Chinese machine translation. In this paper, we present a novel phrase table filtering model based on binary classification, which consider differences between Uyghur and Chinese, and draw lessons from binary classification in machine learning. In our model, four features are considered: 1) Difference in length between source and target phrase; 2) Proportion of translated words in phrase pairs; 3) Proportion of symbol words; 4) Average number of co-occurrence words in training corpus. We use this model to generate a filtered phrase table. Experimental results show that this new filtering model can improve the performance and efficiency of our current Uygur-Chinese machine translation system.

Index Terms—Uyghur-Chinese machine translation, Phrase table filtering, Binary classification.

[PDF]

Cite: Chenggang Mi, Yating Yang, Xi Zhou, Lei Wang, Xiao Li and Eziz Tursun, "A Phrase Table Filtering Model Based on Binary Classification for Uyghur-Chinese Machine Translation," Journal of Computers vol. 9, no. 12, pp. 2780-2786, 2014.

General Information

ISSN: 1796-203X
Abbreviated Title: J.Comput.
Frequency: Bimonthly
Editor-in-Chief: Prof. Liansheng Tan
Executive Editor: Ms. Nina Lee
Abstracting/ Indexing: DBLP, EBSCO,  ProQuest, INSPEC, ULRICH's Periodicals Directory, WorldCat,etc
E-mail: jcp@iap.org
  • Nov 14, 2019 News!

    Vol 14, No 11 has been published with online version   [Click]

  • Mar 20, 2020 News!

    Vol 15, No 2 has been published with online version   [Click]

  • Dec 16, 2019 News!

    Vol 14, No 12 has been published with online version   [Click]

  • Sep 16, 2019 News!

    Vol 14, No 9 has been published with online version   [Click]

  • Aug 16, 2019 News!

    Vol 14, No 8 has been published with online version   [Click]

  • Read more>>