Instance-Sensitive Algorithms for Pure Exploration in Multinomial Logit Bandit
Nikolai Karpov, Qin Zhang
[AAAI-22] Main Track
Abstract:
Motivated by real-world applications such as fast fashion retailing and online advertising, the Multinomial Logit Bandit (MNL-bandit) is a popular model in online learning and operations research, and has attracted much attention in the past decade. In this paper, we give efficient algorithms for {\em pure exploration} in MNL-bandit. Our algorithms achieve {\em instance-sensitive} pull complexities. We also complement the upper bounds by an almost matching lower bound.
Introduction Video
Sessions where this paper appears
-
Poster Session 5
Sat, February 26 12:45 AM - 2:30 AM (+00:00)
Blue 1
-
Poster Session 10
Sun, February 27 4:45 PM - 6:30 PM (+00:00)
Blue 1