alignment

Sheaf-Theoretic Reward Spaces: A New Approach to Safe AI

When we train AI systems using human feedback, we typically