Using Random Perturbations to Mitigate Adversarial Attacks on NLP Models

Abigail Swenor

Using Random Perturbations to Mitigate Adversarial Attacks on NLP Models

Abigail Swenor

[AAAI-22] Undergraduate Consortium

Keywords
Poster Session 1 @ Red 5, Poster Session 5 @ Red 5, Poster Session 1, Poster Session 5

Download Paper

Enter the Virtual Venue

Abstract: Deep learning models have excelled in solving many problems in Natural Language Processing, but are susceptible to extensive vulnerabilities. We offer a solution to this vulnerability by using random perturbations such as spelling correction, synonym substitution, or dropping the word. These perturbations are applied to random words in random sentences to defend NLP models against adversarial attacks. Our defense methods are successful in returning attacked models to their original accuracy within statistical significance.

Sessions where this paper appears

Timezone

Poster Session 1

Thu, February 24 4:45 PM - 6:30 PM (+00:00)

Red 5

Add to Calendar
Apple
Google
iCal File
Microsoft 365
Outlook.com
Yahoo

Poster Session 1
Poster Session 5

Sat, February 26 12:45 AM - 2:30 AM (+00:00)

Red 5

Add to Calendar
Apple
Google
iCal File
Microsoft 365
Outlook.com
Yahoo

Poster Session 5