When AI Difficulty Is Easy: The Explanatory Power of Predicting IRT Difficulty
Fernando Martínez-Plumed, David Castellano-Falcón, Carlos Monserrat, José Hernández-Orallo
[AAAI-22] Main Track
Abstract:
One of challenges of artificial intelligence as a whole is robustness. Many issues such as adversarial examples, out of distribution performance, Clever Hans phenomena, and the wider areas of AI evaluation and explainable AI, have to do with the following question: Did the model fail because it is a hard instance or because something else? In this paper we address this question with a generic method for estimating IRT-based instance difficulty for a wide range of AI domains covering several areas, from supervised feature-based classification to automated reasoning. We show how to estimate difficulty systematically using off-the-shelf machine learning regression models. We illustrate the usefulness of this estimation for a range of applications.
Introduction Video
Sessions where this paper appears
-
Poster Session 4
Fri, February 25 5:00 PM - 6:45 PM (+00:00)
Blue 2
-
Poster Session 7
Sat, February 26 4:45 PM - 6:30 PM (+00:00)
Blue 2
-
Oral Session 4
Fri, February 25 6:45 PM - 8:00 PM (+00:00)
Blue 2