The longstanding goal of multi-lingual learning has been to develop a
universal cross-lingual model that can withstand the changes in multi-lingual
data distributions. However, most existing models assume full access to the
target languages in advance, whereas in realistic scenarios th