Abstract
Multilingual Neural Machine Translation (MNMT) models leverage many language pairs during training to improve translation quality for low-resource languages by transferring knowledge from high-resource languages. We study the quality of a domain-adapted MNMT model in the medical domain for English-Romanian with automatic metrics and a human error typology annotation which includes terminology-specific error categories. We compare the out-of-domain MNMT with the in-domain adapted MNMT. The in-domain MNMT model outperforms the out-of-domain MNMT in all measured automatic metrics and produces fewer terminology errors.
Original language | English |
---|---|
Title of host publication | Proceedings of the 44th Translating and the Computer (TC44) conference |
Editors | Joss Moorkens, Vilelmini Sosoni |
Place of Publication | Geneva |
Publisher | Editions Tradulex |
Pages | 27-38 |
ISBN (Electronic) | 978-2-9701733-0-4 |
Publication status | Published - Sep 2023 |
Event | Translating and the Computer - European Convention Center Luxembourg (ECCL), Luxembourg, Luxembourg Duration: 24 Nov 2022 → 25 Nov 2022 Conference number: 44 https://asling.org/tc44/ |
Conference
Conference | Translating and the Computer |
---|---|
Abbreviated title | TC44 |
Country/Territory | Luxembourg |
City | Luxembourg |
Period | 24/11/22 → 25/11/22 |
Internet address |
Austrian Fields of Science 2012
- 602051 Translation studies
Keywords
- Neural Machine Translation
- domain adaptation
- automatic metrics
- error annotation
- medical domain
- English-Romanian