Papers
      Showing papers for
      .
      
    
  Deep Reinforcement Learning Policies Learn Shared Adversarial Features Across MDPs
Ezgi Korkmaz
                
                  [AAAI-22] Main Track
                
                
                
            Blockwise Sequential Model Learning for Partially Observable Reinforcement Learning
Giseung Park, Sungho Choi, Youngchul Sung
                
                  [AAAI-22] Main Track
                
                
                
            Lifelong Hyper-Policy Optimization with Multiple Importance Sampling Regularization
Pierre Liotet, Francesco Vidaich, Alberto Maria Metelli, Marcello Restelli
                
                  [AAAI-22] Main Track
                
                
                
            Locality Matters: A Scalable Value Decomposition Approach for Cooperative Multi-Agent Reinforcement Learning
Roy Zohar, Shie Mannor, Guy Tennenholtz
                
                  [AAAI-22] Main Track
                
                
                
            Regularization Guarantees Generalization in Bayesian Reinforcement Learning through Algorithmic Stability
Aviv Tamar, Daniel Soudry, Ev Zisselman
                
                  [AAAI-22] Main Track
                
                
                
            Spline-PINN: Approaching PDEs without Data Using Fast, Physics-Informed Hermite-Spline CNNs
Nils Wandel, Michael Weinmann, Michael Neidlin, Reinhard Klein
                
                  [AAAI-22] Main Track
                
                
                
            Invariant Action Effect Model for Reinforcement Learning
Zhengmao Zhu, Shengyi Jiang, Yu-Ren Liu, Yang Yu, Kun Zhang
                
                  [AAAI-22] Main Track
                
                
                
            Reward-Weighted Regression Converges to a Global Optimum
Miroslav Strupl, Francesco Faccio, Dylan R. Ashley, Rupesh Kumar Srivastava, Jürgen Schmidhuber
                
                  [AAAI-22] Main Track
                
                
                
            Chaining Value Functions for Off-Policy Learning
Simon Schmitt, John Shawe-Taylor, Hado van Hasselt
                
                  [AAAI-22] Main Track
                
                
                
            Learning Robust Policy Against Disturbance in Transition Dynamics via State-Conservative Policy Optimization
Yufei Kuang, Miao Lu, Jie Wang, Qi Zhou, Bin Li, Houqiang Li
                
                  [AAAI-22] Main Track
                
                
                
            Unsupervised Reinforcement Learning in Multiple Environments
Mirco Mutti, Mattia Mancassola, Marcello Restelli
                
                  [AAAI-22] Main Track
                
                
                
            Q-Ball: Modeling Basketball Games Using Deep Reinforcement Learning
Chen Yanai, Adir Solomon, Gilad Katz, Bracha Shapira, Lior Rokach
                
                  [AAAI-22] Main Track
                
                
                
            Conjugated Discrete Distributions for Distributional Reinforcement Learning
Björn Lindenberg, Jonas Nordqvist, Karl-Olof Lindahl
                
                  [AAAI-22] Main Track
                
                
                
            Curiosity-Driven Exploration via Latent Bayesian Surprise
Pietro Mazzaglia, Ozan Catal, Tim Verbelen, Bart Dhoedt
                
                  [AAAI-22] Main Track
                
                
                
            Reinforcement Learning Augmented Asymptotically Optimal Index Policy for Finite-Horizon Restless Bandits
Guojun Xiong, Jian Li, Rahul Singh
                
                  [AAAI-22] Main Track
                
                
                
            Exploring Safer Behaviors for Deep Reinforcement Learning
Enrico Marchesini, Davide Corsi, Alessandro Farinelli
                
                  [AAAI-22] Main Track
                
                
                
            Creativity of AI: Automatic Symbolic Option Discovery for Facilitating Deep Reinforcement Learning
Mu Jin, Zhihao Ma, Kebing Jin, Hankz Hankui Zhuo, Chen Chen, Chao Yu
                
                  [AAAI-22] Main Track
                
                
                
            Deterministic and Discriminative Imitation (D2-Imitation): Revisiting Adversarial Imitation for Sample Efficiency
Mingfei Sun, Sam Devlin, Katja Hofmann, Shimon Whiteson
                
                  [AAAI-22] Main Track
                
                
                
            Recursive Reasoning Graph for Multi-Agent Reinforcement Learning
Xiaobai Ma, David Isele, Jayesh K. Gupta, Kikuo Fujimura, Mykel J. Kochenderfer
                
                  [AAAI-22] Main Track
                
                
                
            Sample-Efficient Reinforcement Learning via Conservative Model-Based Actor-Critic
Zhihai Wang, Jie Wang, Qi Zhou, Bin Li, Houqiang Li
                
                  [AAAI-22] Main Track
                
                
                
            Learning by Competition of Self-Interested Reinforcement Learning Agents
Stephen Chung
                
                  [AAAI-22] Main Track
                
                
                
            What About Inputing Policy in Value Function: Policy Representation and Policy-Extended Value Function Approximator
Hongyao Tang, Zhaopeng Meng, Jianye Hao, Chen Chen, Daniel Graves, Dong Li, Changmin Yu, Hangyu Mao, Wulong Liu, Yaodong Yang, Wenyuan Tao, Li Wang
                
                  [AAAI-22] Main Track
                
                
                
            Learning Expected Emphatic Traces for Deep RL
Ray Jiang, Shangtong Zhang, Veronica Chelu, Adam White, Hado van Hasselt
                
                  [AAAI-22] Main Track