Papers
Showing papers for
.
Curiosity-Driven Exploration via Latent Bayesian Surprise
Pietro Mazzaglia, Ozan Catal, Tim Verbelen, Bart Dhoedt
[AAAI-22] Main Track
Reward-Weighted Regression Converges to a Global Optimum
Miroslav Strupl, Francesco Faccio, Dylan R. Ashley, Rupesh Kumar Srivastava, Jürgen Schmidhuber
[AAAI-22] Main Track
Invariant Action Effect Model for Reinforcement Learning
Zhengmao Zhu, Shengyi Jiang, Yu-Ren Liu, Yang Yu, Kun Zhang
[AAAI-22] Main Track
Chaining Value Functions for Off-Policy Learning
Simon Schmitt, John Shawe-Taylor, Hado van Hasselt
[AAAI-22] Main Track
Learning by Competition of Self-Interested Reinforcement Learning Agents
Stephen Chung
[AAAI-22] Main Track
Deep Reinforcement Learning Policies Learn Shared Adversarial Features Across MDPs
Ezgi Korkmaz
[AAAI-22] Main Track
Regularization Guarantees Generalization in Bayesian Reinforcement Learning through Algorithmic Stability
Aviv Tamar, Daniel Soudry, Ev Zisselman
[AAAI-22] Main Track
Reinforcement Learning Augmented Asymptotically Optimal Index Policy for Finite-Horizon Restless Bandits
Guojun Xiong, Jian Li, Rahul Singh
[AAAI-22] Main Track
Deterministic and Discriminative Imitation (D2-Imitation): Revisiting Adversarial Imitation for Sample Efficiency
Mingfei Sun, Sam Devlin, Katja Hofmann, Shimon Whiteson
[AAAI-22] Main Track
Creativity of AI: Automatic Symbolic Option Discovery for Facilitating Deep Reinforcement Learning
Mu Jin, Zhihao Ma, Kebing Jin, Hankz Hankui Zhuo, Chen Chen, Chao Yu
[AAAI-22] Main Track
Unsupervised Reinforcement Learning in Multiple Environments
Mirco Mutti, Mattia Mancassola, Marcello Restelli
[AAAI-22] Main Track
What About Inputing Policy in Value Function: Policy Representation and Policy-Extended Value Function Approximator
Hongyao Tang, Zhaopeng Meng, Jianye Hao, Chen Chen, Daniel Graves, Dong Li, Changmin Yu, Hangyu Mao, Wulong Liu, Yaodong Yang, Wenyuan Tao, Li Wang
[AAAI-22] Main Track
Sample-Efficient Reinforcement Learning via Conservative Model-Based Actor-Critic
Zhihai Wang, Jie Wang, Qi Zhou, Bin Li, Houqiang Li
[AAAI-22] Main Track
Lifelong Hyper-Policy Optimization with Multiple Importance Sampling Regularization
Pierre Liotet, Francesco Vidaich, Alberto Maria Metelli, Marcello Restelli
[AAAI-22] Main Track
Locality Matters: A Scalable Value Decomposition Approach for Cooperative Multi-Agent Reinforcement Learning
Roy Zohar, Shie Mannor, Guy Tennenholtz
[AAAI-22] Main Track
Blockwise Sequential Model Learning for Partially Observable Reinforcement Learning
Giseung Park, Sungho Choi, Youngchul Sung
[AAAI-22] Main Track
Q-Ball: Modeling Basketball Games Using Deep Reinforcement Learning
Chen Yanai, Adir Solomon, Gilad Katz, Bracha Shapira, Lior Rokach
[AAAI-22] Main Track
Conjugated Discrete Distributions for Distributional Reinforcement Learning
Björn Lindenberg, Jonas Nordqvist, Karl-Olof Lindahl
[AAAI-22] Main Track
Recursive Reasoning Graph for Multi-Agent Reinforcement Learning
Xiaobai Ma, David Isele, Jayesh K. Gupta, Kikuo Fujimura, Mykel J. Kochenderfer
[AAAI-22] Main Track
Learning Robust Policy Against Disturbance in Transition Dynamics via State-Conservative Policy Optimization
Yufei Kuang, Miao Lu, Jie Wang, Qi Zhou, Bin Li, Houqiang Li
[AAAI-22] Main Track
Spline-PINN: Approaching PDEs without Data Using Fast, Physics-Informed Hermite-Spline CNNs
Nils Wandel, Michael Weinmann, Michael Neidlin, Reinhard Klein
[AAAI-22] Main Track
Exploring Safer Behaviors for Deep Reinforcement Learning
Enrico Marchesini, Davide Corsi, Alessandro Farinelli
[AAAI-22] Main Track
Learning Expected Emphatic Traces for Deep RL
Ray Jiang, Shangtong Zhang, Veronica Chelu, Adam White, Hado van Hasselt
[AAAI-22] Main Track