XLM-K: Improving Cross-Lingual Language Model Pre-Training with Multilingual Knowledge

Xiaoze Jiang; Yaobo Liang; Weizhu Chen; Nan Duan

XLM-K: Improving Cross-Lingual Language Model Pre-Training with Multilingual Knowledge

Xiaoze Jiang, Yaobo Liang, Weizhu Chen, Nan Duan

[AAAI-22] Main Track

Keywords
Poster Session 5 @ Red 5, Poster Session 12 @ Red 5, Poster Session 5, Poster Session 12

Download Paper

Enter the Virtual Venue

Abstract: Cross-lingual pre-training has achieved great successes using monolingual and bilingual plain text corpora. However, most pre-trained models neglect multilingual knowledge, which is language agnostic but comprises abundant cross-lingual structure alignment. In this paper, we propose XLM-K, a cross-lingual language model incorporating multilingual knowledge in pre-training. XLM-K augments existing multilingual pre-training with two knowledge tasks, namely Masked Entity Prediction Task and Object Entailment Task. We evaluate XLM-K on MLQA, NER and XNLI. Experimental results clearly demonstrate significant improvements over existing multilingual language models. The results on MLQA and NER exhibit the superiority of XLM-K in knowledge related tasks. The success in XNLI shows a better cross-lingual transferability obtained in XLM-K. What is more, we provide a detailed probing analysis to confirm the desired knowledge captured in our pre-training regimen.

Introduction Video

Sessions where this paper appears

Timezone

Poster Session 5

Sat, February 26 12:45 AM - 2:30 AM (+00:00)

Red 5

Add to Calendar
Apple
Google
iCal File
Microsoft 365
Outlook.com
Yahoo

Poster Session 5
Poster Session 12

Mon, February 28 8:45 AM - 10:30 AM (+00:00)

Red 5

Add to Calendar
Apple
Google
iCal File
Microsoft 365
Outlook.com
Yahoo

Poster Session 12