"agents"
-
The Human Must Remain the Control Surface
-
Prompt Injection Is an Operational Risk, Not a Prompting Problem
"governance"
"human-controlled-agents"]
"oasis-claw"]
"practical-ai-safety-stack"
"prompt-injection"
["ai-safety"
-
The Human Must Remain the Control Surface
-
Prompt Injection Is an Operational Risk, Not a Prompting Problem
ai
-
Attention, But Make It Type-Safe
-
Navigating the Safety Manifold: Why Black Holes are Safer Than Walls
-
Gluing Rewards Together: How Math Solves Paradoxes
-
From Proofs to Programs to... Text?
-
Building an Auditable AI: A Complete Walkthrough
ai-safety
-
The Shape of Good Behavior: Why AI Needs More Than Just a Number
-
The Structure of Clear Thinking — An Introduction
attention
category-theory
high-dimensional-reward-spaces
-
The Shape of Good Behavior: Why AI Needs More Than Just a Number
-
Navigating the Safety Manifold: Why Black Holes are Safer Than Walls
-
Gluing Rewards Together: How Math Solves Paradoxes
Human-Controlled Agents
Mathematics
Nature Knows Best
ontological-induction
reinforcement-learning
research
-
Attention, But Make It Type-Safe
-
Navigating the Safety Manifold: Why Black Holes are Safer Than Walls
-
Gluing Rewards Together: How Math Solves Paradoxes
-
From Proofs to Programs to... Text?
-
Building an Auditable AI: A Complete Walkthrough
Safety
series-intro
sheaf-theory
structure_of_clear_thinking
-
The Structure of Clear Thinking — An Introduction
-
From Proofs to Programs to... Text?
-
Building an Auditable AI: A Complete Walkthrough
The Practical AI Safety Stack
The Shape of Good Behavior
The Structure of Clear Thinking
-
Why Your LLM Hallucinates (And How Category Theory Can Help)
-
Attention, But Make It Type-Safe
-
From Proofs to Programs to... Text?
-
Building an Auditable AI: A Complete Walkthrough
-
Compiling Programs Into Attention