Feasible Constraint Policy Optimization for Safe Reinforcement Learning
AAMAS 2026 The 25th International Conference on Autonomous Agents and Multiagent Systems
We introduce Feasible Constraint Policy Optimization (FCPO), which seamlessly combines penalty and trust region methods to address policy feasibility while ensuring stability and performance.