Offline Interactive Recommendation with Natural-Language Feedback

Ruiyi Zhang, Tong Yu, Yilin Shen, Hongxia Jin

[AAAI-22] Main Track
Abstract: Interactive recommendation with natural-language feedback can provide richer user feedback and has demonstrated advantages over traditional recommender systems. However, the classical online paradigm involves iteratively collecting experience via interaction with users, which is expensive and risky. We propose an offline interactive recommendation to exploit arbitrary experience collected by multiple unknown devices (policies). Direct policy learning with such fixed experience suffers from the distribution shift. To tackle this issue, we leverage a neural correction estimator via adversarial training, which can be seamlessly incorporated into the standard RL objective. Empirical results on real-world datasets demonstrate the effectiveness of our proposed framework.

Introduction Video

Sessions where this paper appears

  • Poster Session 2

    Fri, February 25 12:45 AM - 2:30 AM (+00:00)
    Red 2
    Add to Calendar

  • Poster Session 9

    Sun, February 27 8:45 AM - 10:30 AM (+00:00)
    Red 2
    Add to Calendar