High-inference cost models like o3 might be a boon for AI safety:

  • More reasoning is done in chain-of-thought, which is inspectable!
  • Mech interp is more promising, as base models will be smaller!
  • Running frontier models will be more expensive, reducing deployment overhang!