Topic
1 article
OpenAI's GPT-5.4 becomes the first general-purpose model to score 83% on GDPVal — a benchmark measuring AI's ability to perform real economic work — while introducing native computer-use capabilities for autonomous agent workflows.