Instance-Sensitive Algorithms for Pure Exploration in Multinomial Logit Bandit

Nikolai Karpov, Qin Zhang

[AAAI-22] Main Track
Abstract: Motivated by real-world applications such as fast fashion retailing and online advertising, the Multinomial Logit Bandit (MNL-bandit) is a popular model in online learning and operations research, and has attracted much attention in the past decade. In this paper, we give efficient algorithms for {\em pure exploration} in MNL-bandit. Our algorithms achieve {\em instance-sensitive} pull complexities. We also complement the upper bounds by an almost matching lower bound.

Introduction Video

Sessions where this paper appears

  • Poster Session 5

    Sat, February 26 12:45 AM - 2:30 AM (+00:00)
    Blue 1
    Add to Calendar

  • Poster Session 10

    Sun, February 27 4:45 PM - 6:30 PM (+00:00)
    Blue 1
    Add to Calendar