← Tech Breakthrough Ideas
AI scaled

RLHF + reasoning models (test-time compute scaling)

Reasoning models with test-time compute scaling have fundamentally shifted AI capability distribution from training time to inference time, enabling smaller models to solve previously intractable problems in mathematics, coding, and science while achieving cost-competitive inference.

What to watch next

Monitor scaling laws for reasoning beyond current limits; emergence of multi-step verifiable reasoning that enables formal proof verification; and development of interpretable chain-of-thought that reveals genuine reasoning versus pattern matching in reasoning model outputs.

Key sub-ideas & techniques

Current frontier

Key people

Startups & labs to watch