Machine learning and, in particular, neural networks, have made great advances in the last few years for products that are used by millions of people, most notably in speech recognition, image recognition, and recently in neural machine translation. Neural Machine Translation (NMT) is an end-to-end learning approach for automated translation, with the potential to overcome many of the weaknesses of conventional phrase-based translation systems. Unfortunately, NMT systems are known to be computationally expensive both in training and in translation inference.