IndicMT Eval: A Dataset to Meta-Evaluate Machine Translation metrics for Indian Languages

Sai, Ananya B.; Nagarajan, Vignesh; Dixit, Tanay; Dabre, Raj; Kunchukuttan, Anoop; Kumar, Pratyush; Khapra, Mitesh M.

Computer Science > Computation and Language

arXiv:2212.10180 (cs)

[Submitted on 20 Dec 2022 (v1), last revised 3 Jul 2023 (this version, v2)]

Title:IndicMT Eval: A Dataset to Meta-Evaluate Machine Translation metrics for Indian Languages

Authors:Ananya B. Sai, Vignesh Nagarajan, Tanay Dixit, Raj Dabre, Anoop Kunchukuttan, Pratyush Kumar, Mitesh M. Khapra

View PDF

Abstract:The rapid growth of machine translation (MT) systems has necessitated comprehensive studies to meta-evaluate evaluation metrics being used, which enables a better selection of metrics that best reflect MT quality. Unfortunately, most of the research focuses on high-resource languages, mainly English, the observations for which may not always apply to other languages. Indian languages, having over a billion speakers, are linguistically different from English, and to date, there has not been a systematic study of evaluating MT systems from English into Indian languages. In this paper, we fill this gap by creating an MQM dataset consisting of 7000 fine-grained annotations, spanning 5 Indian languages and 7 MT systems, and use it to establish correlations between annotator scores and scores obtained using existing automatic metrics. Our results show that pre-trained metrics, such as COMET, have the highest correlations with annotator scores. Additionally, we find that the metrics do not adequately capture fluency-based errors in Indian languages, and there is a need to develop metrics focused on Indian languages. We hope that our dataset and analysis will help promote further research in this area.

Comments:	ACL 2023 long paper
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2212.10180 [cs.CL]
	(or arXiv:2212.10180v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2212.10180

Submission history

From: Ananya B Sai [view email]
[v1] Tue, 20 Dec 2022 11:37:22 UTC (8,894 KB)
[v2] Mon, 3 Jul 2023 14:26:38 UTC (9,897 KB)

Computer Science > Computation and Language

Title:IndicMT Eval: A Dataset to Meta-Evaluate Machine Translation metrics for Indian Languages

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:IndicMT Eval: A Dataset to Meta-Evaluate Machine Translation metrics for Indian Languages

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators