<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>AI Benchmarks on AIBriefCentral</title><link>https://aibriefcentral.com/tags/ai-benchmarks/</link><description>Recent content in AI Benchmarks on AIBriefCentral</description><generator>Hugo -- gohugo.io</generator><language>en-us</language><lastBuildDate>Sat, 07 Mar 2026 00:58:01 +0000</lastBuildDate><atom:link href="https://aibriefcentral.com/tags/ai-benchmarks/index.xml" rel="self" type="application/rss+xml"/><item><title>OpenAI Launches GPT-5.4, First AI to Outperform Humans at Computer Control</title><link>https://aibriefcentral.com/2026/03/openai-launches-gpt-5.4-first-ai-to-outperform-humans-at-computer-control/</link><pubDate>Sat, 07 Mar 2026 00:58:01 +0000</pubDate><guid>https://aibriefcentral.com/2026/03/openai-launches-gpt-5.4-first-ai-to-outperform-humans-at-computer-control/</guid><description>What Happened On Thursday, March 5, 2026, OpenAI announced the release of GPT-5.4, available in three versions: the standard model, GPT-5.4 Thinking (with enhanced reasoning capabilities), and GPT-5.4 Pro (high-performance version). The release represents what OpenAI calls &amp;ldquo;our most capable and efficient frontier model for professional work.&amp;rdquo;
The standout achievement is GPT-5.4&amp;rsquo;s performance on the OSWorld-Verified benchmark, where it scored 75% compared to human performance of 72.4%. This benchmark tests a model&amp;rsquo;s ability to navigate desktop environments using only screenshots and keyboard/mouse actions, essentially measuring how well AI can operate a computer like a human would.</description></item></channel></rss>