DialogLM: Pre-Trained Model for Long Dialogue Understanding and Summarization

Ming Zhong; Yang Liu; Yichong Xu; Chenguang Zhu; Michael Zeng

DialogLM: Pre-Trained Model for Long Dialogue Understanding and Summarization

Ming Zhong, Yang Liu, Yichong Xu, Chenguang Zhu, Michael Zeng

[AAAI-22] Main Track

Keywords
Poster Session 4 @ Red 5, Poster Session 11 @ Red 5, Oral Session 11 @ Red 5, Poster Session 4, Poster Session 11, Oral Session 11

Download Paper

Enter the Virtual Venue

Abstract: Dialogue is an essential part of human communication and cooperation. Existing research mainly focuses on short dialogue scenarios in a one-on-one fashion. However, multi-person interactions in the real world, such as meetings or interviews, are frequently over a few thousand words. There is still a lack of corresponding research and powerful tools to understand and process such long dialogues. Therefore, in this work, we present a pre-training framework for long dialogue understanding and summarization. Considering the nature of long conversations, we propose a window-based denoising approach for generative pre-training. For a dialogue, it corrupts a window of text with dialogue-inspired noise, and guides the model to reconstruct this window based on the content of the remaining conversation. Furthermore, to process longer input, we augment the model with sparse attention which is combined with conventional attention in a hybrid manner. We conduct extensive experiments on five datasets of long dialogues, covering tasks of dialogue summarization, abstractive question answering and topic segmentation. Experimentally, we show that our pre-trained model \DialogLM significantly surpasses the state-of-the-art models across datasets and tasks.

Introduction Video

Sessions where this paper appears

Timezone

Poster Session 4

Red 5

{ "name":"DialogLM: Pre-Trained Model for Long Dialogue Understanding and Summarization (Poster Session 4)", "description":"", "startDate":"02-25-2022", "endDate":"02-25-2022", "startTime": "09:00", "endTime": "10:45", "location": "Red 5", "timeZone": "US/Pacific", "options":[ "Apple", "Google", "iCal", "Microsoft365", "Outlook.com", "Yahoo" ] }

Poster Session 4
Poster Session 11

Red 5

{ "name":"DialogLM: Pre-Trained Model for Long Dialogue Understanding and Summarization (Poster Session 11)", "description":"", "startDate":"02-27-2022", "endDate":"02-27-2022", "startTime": "16:45", "endTime": "18:30", "location": "Red 5", "timeZone": "US/Pacific", "options":[ "Apple", "Google", "iCal", "Microsoft365", "Outlook.com", "Yahoo" ] }

Poster Session 11
Oral Session 11

Red 5

{ "name":"DialogLM: Pre-Trained Model for Long Dialogue Understanding and Summarization (Oral Session 11)", "description":"", "startDate":"02-27-2022", "endDate":"02-27-2022", "startTime": "18:30", "endTime": "19:45", "location": "Red 5", "timeZone": "US/Pacific", "options":[ "Apple", "Google", "iCal", "Microsoft365", "Outlook.com", "Yahoo" ] }

Oral Session 11