Publications — Ryan Kidd

Recently, many AI safety movement-building programs have been criticized for attempting to grow the field too rapidly. At MATS, we think that these are real and important concerns and support mitigating efforts.

LessWrong AI Safety Field-Building

2023-3-2

Aspiring AI safety researchers should ~argmax over AGI timelines

Many people seem to be entering the AI safety ecosystem, acquiring a belief in short timelines and high P(doom), and immediately dropping everything to work on AI safety agendas that might pay off in short-timeline worlds. However, many of these people might not have a sufficient “toolbox” or research experience to have much marginal impact in short timelines worlds.

LessWrong AI Safety Field-Building

2022-12-26

Air-gapping evaluation and support

I think evaluation and support mechanisms should be somewhat “air-gapped,” or isolated, in their information-gathering and decision-making processes.

LessWrong Field-Building Philosophy

2022-12-04

Probably good projects for the AI safety ecosystem

A list of projects that are probably good for the AI safety ecosystem, at least according to me.

LessWrong AI Safety Field-Building

2022-6-30

Selection processes for subagents

People sometimes talk about the human mind containing ``subagents''. I wrote about some possible processes that might favor multi-agent architectures in neural networks.

LessWrong AI Safety

2022-04-10

Is Fisherian Runaway Gradient Hacking?

Fisherian runaway is an insightful example of the path-dependence of local search, where an easily acquired and apparently useful proxy goal can be so strongly favored that disadvantageous traits emerge as side effects.

LessWrong AI Safety