Abstract
Multilingual Neural Machine Translation (MNMT) models allow translation across
multiple languages based on a single system. We study the quality of a domain adapted MNMT model in the medical domain for English-Romanian with automatic metrics and a human error typology annotation based on the Multidimensional Quality Metrics (MQM) framework. We further expand the MQM typology to include terminology-specific error categories. We compare the out-of-domain MNMT with the in-domain adapted MNMT on a standard test dataset of abstracts from medical publications. The in-domain MNMT model outperforms the out-of-domain MNMT in all measured automatic metrics, and produces fewer errors. We also manually annotate the reference test dataset to study the quality of the reference translations, and we identify a high number of omissions, additions, and mistranslations. We therefore question the assumed accuracy of existing datasets. Finally, we compare the correlation between the COMET, BERTScore, and chrF automatic metrics with the MQM annotated translations; COMET shows a better correlation with the MQM scores.
multiple languages based on a single system. We study the quality of a domain adapted MNMT model in the medical domain for English-Romanian with automatic metrics and a human error typology annotation based on the Multidimensional Quality Metrics (MQM) framework. We further expand the MQM typology to include terminology-specific error categories. We compare the out-of-domain MNMT with the in-domain adapted MNMT on a standard test dataset of abstracts from medical publications. The in-domain MNMT model outperforms the out-of-domain MNMT in all measured automatic metrics, and produces fewer errors. We also manually annotate the reference test dataset to study the quality of the reference translations, and we identify a high number of omissions, additions, and mistranslations. We therefore question the assumed accuracy of existing datasets. Finally, we compare the correlation between the COMET, BERTScore, and chrF automatic metrics with the MQM annotated translations; COMET shows a better correlation with the MQM scores.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the 24th Annual Conference of the European Association for Machine Translation |
| Subtitle of host publication | 12 – 15 June 2023 Tampere, Finland |
| Place of Publication | Tampere |
| Publisher | European Association for Machine Translation |
| Pages | 355–364 |
| Number of pages | 10 |
| ISBN (Print) | 9789520329471 |
| Publication status | Published - 12 Jun 2023 |
| Event | The 24th Annual Conference of the European Association for Machine Translation - Tampere, Tampere, Finland Duration: 12 Jun 2023 → 15 Jun 2023 https://events.tuni.fi/eamt23/ |
Exhibition
| Exhibition | The 24th Annual Conference of the European Association for Machine Translation |
|---|---|
| Abbreviated title | EAMT 2023 |
| Country/Territory | Finland |
| City | Tampere |
| Period | 12/06/23 → 15/06/23 |
| Internet address |
Austrian Fields of Science 2012
- 102001 Artificial intelligence
- 102019 Machine learning
- 602051 Translation studies
Keywords
- Machine translation
- specialised terminology
- Translation Studies
Fingerprint
Dive into the research topics of 'Quality Analysis of Multilingual Neural Machine Translation Systems and Reference Test Translations for the English-Romanian language pair in the Medical Domain'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver