A Translation Confidence Index for English-German MT

The quality of Machine Translation output can vary a great deal from sentence to sentence, even for the same MT system. A facility for automatically rating the confidence in the MT system’s translation of each segment would vastly improve the usability of the MT system. Obviously, such a confidence index must depend in part on the differences between the source and target languages. In this paper we outline the nature of a translation confidence index (the TCI), which is tunable to different language-pair MT systems, and we then focus on the interesting contrastive linguistic issues in tuning the TCI to the case of English-German.

By: Arendse Bernth , Claudia Gdaniec

Published in: RC22403 in 2000

LIMITED DISTRIBUTION NOTICE:

This Research Report is available. This report has been submitted for publication outside of IBM and will probably be copyrighted if accepted for publication. It has been issued as a Research Report for early dissemination of its contents. In view of the transfer of copyright to the outside publisher, its distribution outside of IBM prior to publication should be limited to peer communications and specific requests. After outside publication, requests should be filled only by reprints or legally obtained copies of the article (e.g., payment of royalties). I have read and understand this notice and am a member of the scientific community outside or inside of IBM seeking a single copy only.

RC22403.pdf

Questions about this service can be mailed to reports@us.ibm.com .