Preliminary Program

Learning principled bilingual mappings of word embeddings while preserving monolingual invariance

Mikel Artetxe¹, Gorka Labaka², Eneko Agirre²
¹University of the Basque Country, ²University of the Basque Country (UPV/EHU)

Abstract

Mapping word embeddings of different languages into a single space has multiple applications. In order to map from a source space into a target space, a common approach is to learn a linear mapping that minimizes the distances between equivalences listed in a bilingual dictionary. In this paper, we propose a framework that generalizes previous work, provides an efficient exact method to learn the optimal linear transformation and yields the best bilingual results in translation induction while preserving monolingual performance in an analogy task.