Inference model safety
High-inference cost models like o3 might be a boon for AI safety:
- More reasoning is done in chain-of-thought, which is inspectable!
- Mech interp is more promising, as base models will be smaller!
- Running frontier models will be more expensive, reducing deployment overhang!