On the Relation between Distributionally Robust Optimization and Data Curation (Student Abstract)

Agnieszka S≈Çowik; Leon Bottou

On the Relation between Distributionally Robust Optimization and Data Curation (Student Abstract)

Agnieszka S≈Çowik, Leon Bottou, ,

[AAAI-22] Student Abstract and Poster Program - FINALIST

Keywords
Poster Session 6 @ Blue 4, Poster Session 10 @ Blue 4, Poster Session 6, Poster Session 10

Download Paper

Enter the Virtual Venue

Abstract: Machine learning systems based on minimizing average error have been shown to perform inconsistently across notable subsets of the data, which is not exposed by a low average error for the entire dataset. In consequential social and economic applications, where data represent people, this can lead to discrimination of underrepresented gender and ethnic groups. Distributionally Robust Optimization (DRO) seemingly addresses this problem by minimizing the worst expected risk across subpopulations. We establish theoretical results that clarify the relation between DRO and the optimization of the same loss averaged on an adequately weighted training dataset. A practical implication of our results is that neither DRO nor curating the training set should be construed as a complete solution for bias mitigation.

Introduction Video

Sessions where this paper appears

Timezone

Poster Session 6

Sat, February 26 8:45 AM - 10:30 AM (+00:00)

Blue 4

Add to Calendar
Apple
Google
iCal File
Microsoft 365
Outlook.com
Yahoo

Poster Session 6
Poster Session 10

Sun, February 27 4:45 PM - 6:30 PM (+00:00)

Blue 4

Add to Calendar
Apple
Google
iCal File
Microsoft 365
Outlook.com
Yahoo

Poster Session 10