Papers
Showing papers for
.
Conservative and Adaptive Penalty for Model-Based Safe Reinforcement Learning
Yecheng Jason Ma, Andrew Shen, Osbert Bastani, Dinesh Jayaraman
[AAAI-22] Main Track
Theoretical Guarantees of Fictitious Discount Algorithms for Episodic Reinforcement Learning and Global Convergence of Policy Gradient Methods
Xin Guo, Anran Hu, Junzi Zhang
[AAAI-22] Main Track
Differentially Private Regret Minimization in Episodic Markov Decision Processes
Sayak Ray Chowdhury, Xingyu Zhou
[AAAI-22] Main Track
Admissible Policy Teaching through Reward Design
Kiarash Banihashem, Adish Singla, Jiarui Gan, Goran Radanovic
[AAAI-22] Main Track
Constraint Sampling Reinforcement Learning: Incorporating Expertise For Faster Learning
Tong Mu, Georgios Theocharous, David Arbour, Emma Brunskill
[AAAI-22] Main Track
BScNets: Block Simplicial Complex Neural Networks
Yuzhou Chen, Yulia R. Gel, H. Vincent Poor
[AAAI-22] Main Track
Learning Action Translator for Meta Reinforcement Learning on Sparse-Reward Tasks
Yijie Guo, Qiucheng Wu, Honglak Lee
[AAAI-22] Main Track
SimSR: Simple Distance-Based State Representations for Deep Reinforcement Learning
Hongyu Zang, Xin Li, Mingzhong Wang
[AAAI-22] Main Track
Convergence and Optimality of Policy Gradient Methods in Weakly Smooth Settings
Matthew Shunshi Zhang, Murat A. Erdogdu, Animesh Garg
[AAAI-22] Main Track
Achieving Zero Constraint Violation for Constrained Reinforcement Learning via Primal-Dual Approach
Qinbo Bai, Amrit Singh Bedi, Mridul Agarwal, Alec Koppel, Vaneet Aggarwal
[AAAI-22] Main Track
Learning Parameterized Task Structure for Generalization to Unseen Entities
Anthony Liu, Sungryull Sohn, Mahdi Qazwini, Honglak Lee
[AAAI-22] Main Track
AutoGCL: Automated Graph Contrastive Learning via Learnable View Generators
Yihang Yin, Qingzhong Wang, Siyu Huang, Haoyi Xiong, Xiang Zhang
[AAAI-22] Main Track
Multi-Agent Reinforcement Learning with General Utilities via Decentralized Shadow Reward Actor-Critic
Junyu Zhang, Amrit Singh Bedi, Mengdi Wang, Alec Koppel
[AAAI-22] Main Track
Offline Reinforcement Learning as Anti-Exploration
Shideh Rezaeifar, Robert Dadashi, Nino Vieillard, Léonard Hussenot, Olivier Bachem, Olivier Pietquin, Matthieu Geist
[AAAI-22] Main Track
Globally Optimal Hierarchical Reinforcement Learning for Linearly-Solvable Markov Decision Processes
Guillermo Infante, Anders Jonsson, Vicenç Gómez
[AAAI-22] Main Track
Robust Adversarial Reinforcement Learning with Dissipation Inequation Constraint
Peng Zhai, Jie Luo, Zhiyan Dong, Lihua Zhang, Shunli Wang, Dingkang Yang
[AAAI-22] Main Track
Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation
Evgenii Nikishin, Romina Abachi, Rishabh Agarwal, Pierre-Luc Bacon
[AAAI-22] Main Track
A Generalized Bootstrap Target for Value-Learning, Efficiently Combining Value and Feature Predictions
Anthony GX-Chen, Veronica Chelu, Blake A. Richards , Joelle Pineau
[AAAI-22] Main Track
A Provably-Efficient Model-Free Algorithm for Infinite-Horizon Average-Reward Constrained Markov Decision Processes
Honghao Wei, Xin Liu, Lei Ying
[AAAI-22] Main Track
Constraints Penalized Q-Learning for Safe Offline Reinforcement Learning
Haoran Xu, Xianyuan Zhan, Xiangyu Zhu
[AAAI-22] Main Track
Meta Label Propagation for Few-Shot Semi-Supervised Learning on Graphs
Kaize Ding, Jianling Wang, James Caverlee, Huan Liu
[AAAI-22] Main Track
Graph-Wise Common Latent Factor Extraction for Unsupervised Graph Representation Learning
Thilini Cooray, Ngai-Man Cheung
[AAAI-22] Main Track
Context-Specific Representation Abstraction for Deep Option Learning
Marwa Abdulhai, Dong-Ki Kim, Matthew Riemer, Miao Liu, Gerald Tesauro, Jonathan P. How
[AAAI-22] Main Track
Self-Adaptive Imitation Learning: Learning Tasks with Delayed Rewards from Sub-Optimal Demonstrations
Zhuangdi Zhu, Kaixiang Lin, Bo Dai, Jiayu Zhou
[AAAI-22] Main Track