SGD-X: A Benchmark for Robust Generalization in Schema-Guided Dialogue Systems

Harrison Lee; Raghav Gupta; Abhinav Rastogi; Yuan Cao; Bin Zhang; Yonghui Wu

SGD-X: A Benchmark for Robust Generalization in Schema-Guided Dialogue Systems

Harrison Lee, Raghav Gupta, Abhinav Rastogi, Yuan Cao, Bin Zhang, Yonghui Wu

[AAAI-22] Main Track

Keywords
Poster Session 4 @ Red 5, Poster Session 11 @ Red 5, Poster Session 4, Poster Session 11

Download Paper

Enter the Virtual Venue

Abstract: Zero/few-shot transfer to unseen services is a critical challenge in task-oriented dialogue research. The Schema-Guided Dialogue (SGD) dataset introduced a paradigm for enabling models to support an unlimited number of services without additional data collection or re-training through the use of schemas. Schemas describe APIs in natural language, which models consume to understand the services they need to support. However, the impact of the choice of language in these schemas on model performance remains unexplored. We address this by releasing SGD-X, a benchmark for measuring the robustness of dialogue systems to linguistic variations in schemas. SGD-X extends the SGD dataset with crowdsourced variants for every schema, where variants are semantically similar yet stylistically diverse. We evaluate two top-performing dialogue state tracking models on SGD-X and observe that neither generalizes well across schema variants, measured by joint goal accuracy and a novel metric for measuring schema sensitivity. Finally, we present a simple model-agnostic data augmentation method to improve schema robustness and zero-shot generalization to unseen services.

Introduction Video

Sessions where this paper appears

Timezone

Poster Session 4

Fri, February 25 5:00 PM - 6:45 PM (+00:00)

Red 5

Add to Calendar
Apple
Google
iCal File
Microsoft 365
Outlook.com
Yahoo

Poster Session 4
Poster Session 11

Mon, February 28 12:45 AM - 2:30 AM (+00:00)

Red 5

Add to Calendar
Apple
Google
iCal File
Microsoft 365
Outlook.com
Yahoo

Poster Session 11