Encouraging Consistent Translation Choices

Ferhan Ture1,  Douglas W. Oard2,  Philip Resnik3
1Department of Computer Science, University of Maryland, 2College of Information Studies and UMIACS, University of Maryland, 3Department of Linguistics and UMIACS, University of Maryland


Abstract

It has long been observed that monolingual text exhibits a tendency toward "one sense per discourse," and it has been argued that a related "one translation per discourse" constraint is operative in bilingual contexts as well. In this paper, we introduce a novel method using forced decoding to confirm the validity of this constraint, and we demonstrate that it can be exploited in order to improve machine translation quality. Three ways of incorporating such a preference into a hierarchical phrase-based MT model are proposed, and the approach where all three are combined yields the greatest improvements for both Arabic-English and Chinese-English translation experiments.