Efficient Embedded Decoding of Neural Network Language Models in a Machine Translation System

Francisco Zamora-Martinez; Maria Jose Castro-Bleda

doi:10.1142/S0129065718500077

Efficient Embedded Decoding of Neural Network Language Models in a Machine Translation System

Int J Neural Syst. 2018 Nov;28(9):1850007. doi: 10.1142/S0129065718500077. Epub 2018 Feb 22.

Authors

Francisco Zamora-Martinez¹, Maria Jose Castro-Bleda²

Affiliations

¹ 1 R&D Department, das-Nano S. L., Polígono Industrial Talluntxe II, Tajonar 31192, Spain.
² 2 Departamento de Sistemas Informáticos y Computación, Universitat Politècnica de València, València, Spain.

PMID: 29631501
DOI: 10.1142/S0129065718500077

Abstract

Neural Network Language Models (NNLMs) are a successful approach to Natural Language Processing tasks, such as Machine Translation. We introduce in this work a Statistical Machine Translation (SMT) system which fully integrates NNLMs in the decoding stage, breaking the traditional approach based on [Formula: see text]-best list rescoring. The neural net models (both language models (LMs) and translation models) are fully coupled in the decoding stage, allowing to more strongly influence the translation quality. Computational issues were solved by using a novel idea based on memorization and smoothing of the softmax constants to avoid their computation, which introduces a trade-off between LM quality and computational cost. These ideas were studied in a machine translation task with different combinations of neural networks used both as translation models and as target LMs, comparing phrase-based and [Formula: see text]-gram-based systems, showing that the integrated approach seems more promising for [Formula: see text]-gram-based systems, even with nonfull-quality NNLMs.

Keywords: Neural networks; embedded decoding; language modeling; machine translation; statistical machine translation.

MeSH terms

Humans
Models, Statistical
Natural Language Processing*
Neural Networks, Computer*
Translating*
Vocabulary