Blog posts

Two red-team critiques of METR's research on long tasks

Blog posts

Can superhuman AI help us improve?

Blog posts

Making Mathematical MONSTERS

Blog posts

Axiomatic jigsaw puzzles: probability