Selected Essays: ML Alignment and Theory Scholars
My most basic north star is harm reduction, especially that of people who are alive today and who live all around the world, including and especially the Global South. I do think that especially with new developments like Claude Mythos, the issue of scalable oversight essentially subsumes most other technical problems in AI safety in NP-complete fashion.