Abstract
Recent work on non-autoregressive neural machine translation (NAT) that leverages alignment information to explicitly reduce the modality of target distribution has reported comparable performance with counterparts that tackle multi-modality problem by implicitly modeling dependencies. Effectiveness in handling alignment is vital for models that follow this approach, where a token reordering mechanism is typically involved and plays a vital role. We review the reordering capability of the respective mechanisms in recent NAT models, and our experimental results show that their performance is sub-optimal. We propose to learn a non-autoregressive language model (NALM) based on transformer which can be combined with Viterbi decoding to achieve better reordering performance. We evaluate the proposed NALM using the PTB dataset where sentences with words permuted in different ways are expected to have their ordering recovered. Our empirical results show that the proposed method can outperform the state-of-the-art reordering mechanisms under different word permutation settings, with a 2-27 BLEU improvement, suggesting high potential for word alignment in NAT.
Original language | English |
---|---|
Title of host publication | Findings of the Association for Computational Linguistics: EMNLP 2022 |
Editors | Yoav Goldberg, Zornitsa Kozareva, Yue Zhang |
Publisher | Association for Computational Linguistics (ACL) |
Pages | 2327-2333 |
Number of pages | 7 |
Publication status | Published - Dec 2022 |
Event | The 2022 Conference on Empirical Methods in Natural Language Processing - Abu Dhabi, United Arab Emirates Duration: 7 Dec 2022 → 11 Dec 2022 https://2022.emnlp.org/ https://aclanthology.org/events/emnlp-2022/ https://aclanthology.org/volumes/2022.findings-emnlp/ |
Conference
Conference | The 2022 Conference on Empirical Methods in Natural Language Processing |
---|---|
Abbreviated title | EMNLP 2022 |
Country/Territory | United Arab Emirates |
City | Abu Dhabi |
Period | 7/12/22 → 11/12/22 |
Internet address |
Scopus Subject Areas
- Computational Theory and Mathematics
- Computer Science Applications
- Information Systems
User-Defined Keywords
- Neural machine translation
- Non-autoregressive alignment
- Word reordering