Multimodal Machine Translation with Reinforcement Learning

Qian, Xin; Zhong, Ziyi; Zhou, Jieli

Computer Science > Computation and Language

arXiv:1805.02356 (cs)

[Submitted on 7 May 2018]

Title:Multimodal Machine Translation with Reinforcement Learning

Authors:Xin Qian, Ziyi Zhong, Jieli Zhou

View PDF

Abstract:Multimodal machine translation is one of the applications that integrates computer vision and language processing. It is a unique task given that in the field of machine translation, many state-of-the-arts algorithms still only employ textual information. In this work, we explore the effectiveness of reinforcement learning in multimodal machine translation. We present a novel algorithm based on the Advantage Actor-Critic (A2C) algorithm that specifically cater to the multimodal machine translation task of the EMNLP 2018 Third Conference on Machine Translation (WMT18). We experiment our proposed algorithm on the Multi30K multilingual English-German image description dataset and the Flickr30K image entity dataset. Our model takes two channels of inputs, image and text, uses translation evaluation metrics as training rewards, and achieves better results than supervised learning MLE baseline models. Furthermore, we discuss the prospects and limitations of using reinforcement learning for machine translation. Our experiment results suggest a promising reinforcement learning solution to the general task of multimodal sequence to sequence learning.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Multiagent Systems (cs.MA); Multimedia (cs.MM)
Cite as:	arXiv:1805.02356 [cs.CL]
	(or arXiv:1805.02356v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1805.02356

Submission history

From: Jieli Zhou [view email]
[v1] Mon, 7 May 2018 06:12:32 UTC (1,506 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2018-05

Change to browse by:

cs
cs.AI
cs.IR
cs.MA
cs.MM

References & Citations

DBLP - CS Bibliography

listing | bibtex

Xin Qian
Ziyi Zhong
Jieli Zhou

export BibTeX citation

Computer Science > Computation and Language

Title:Multimodal Machine Translation with Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Multimodal Machine Translation with Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators