Topic
1 article
Anthropic's latest model achieves a 50% time horizon of 14.5 hours on METR's task-completion benchmark, continuing an exponential trend that has AI capabilities doubling every four months.