OpenAI's GPT-5 rivals human performance across diverse professions

Home

News

October 31, 2025

RichardSmith

# ChatGPT # openai # Claude # gpt-5

On Thursday, OpenAI introduced GDPval, a groundbreaking benchmark evaluating how its AI models stack up against human professionals across diverse industries. This assessment marks an initial step toward gauging whether OpenAI's systems can surpass humans in economically impactful work—a core objective in the company's pursuit of artificial general intelligence (AGI).

According to OpenAI, both GPT-5 and Anthropic's Claude Opus 4.1 demonstrate output quality nearing that of industry specialists.

While these findings don't imply imminent human job replacement, they represent crucial progress tracking. OpenAI acknowledges GDPval currently assesses only a fraction of real-world professional tasks, countering some CEOs' predictions of widespread AI disruption within years.

GDPval evaluates performance across nine key U.S. GDP sectors—including healthcare, finance, manufacturing, and government—testing 44 occupations from software engineering to journalism.

For GDPval-v0, professionals compared AI-generated reports against human counterparts' work. One sample task involved investment bankers analyzing last-mile delivery competitor landscapes against AI versions. OpenAI calculated each model's "win rate" against human outputs across all occupations.

The enhanced GPT-5-high model matched or exceeded expert output 40.6% of the time, while Claude Opus 4.1 achieved a 49% parity rate—OpenAI suggests this higher score may reflect Claude's superior visual presentation rather than substantive advantage.

Featuring Netflix, Box, a16z, ElevenLabs, Wayve, Sequoia Capital, and Elad Gil among 250+ industry leaders hosting 200+ growth-focused sessions. Celebrate TechCrunch's 20th anniversary while gaining competitive insights from technology's foremost thinkers. Early registration before September 26 saves up to $668.

Image Credits: OpenAI

OpenAI acknowledges GDPval-v0's narrow focus—currently testing only research report generation—and plans future iterations assessing broader workplace interactions.

Chief Economist Dr. Aaron Chatterji told TechCrunch these results indicate professionals can increasingly delegate routine tasks to AI, freeing them for higher-value work.

Tejal Patwardhan, leading evaluations, notes rapid progress: GPT-4o scored just 13.7% fifteen months ago, while GPT-5 nearly triples that performance—a trajectory expected to continue.

While benchmarks like AIME 2025 and GPQA Diamond dominate AI assessment, many models approach saturation on these academic tests. GDPval represents growing emphasis on practical, industry-relevant evaluation standards—though OpenAI requires more comprehensive testing to conclusively demonstrate human-level performance across professional domains.

Satya Nadella ready to exploit new OpenAI deal On Wednesday, a Wall Street analyst asked Microsoft CEO Satya Nadella directly how the revised OpenAI partnership would affect the company’s financials.Nadella described the new agreement as a win for everyone. “We feel good about our partnership wit

OpenAI outlines AI economy with public wealth funds, robot taxes, and four-day week As governments struggle to manage the economic impact of superintelligent machines, OpenAI has released a set of policy proposals outlining how wealth and work could be reshaped in an "intelligence age." The ideas blend traditional left-leaning mecha

Greg Brockman reveals how Elon Musk departed OpenAI In late August 2017, key figures at OpenAI—then a small nonprofit research lab—met to discuss how they would establish a for-profit entity to commercialize their technology and raise the capital needed to achieve AGI.Elon Musk was demanding full cont

Related Special Topic Recommendations

Image editing

AI Art Generators for Short-Drama Storyboards: Fantasy & Urban Romance Characters

2026 Latest: Discover the best AI art generators for short-drama storyboards. Our curated list features top-rated tools for creating compelling fantasy and urban romance characters. Compare free vs paid options, see real-world test results, and find your perfect creative partner. Get weekly updated rankings and expert insights from XIX.AI. Start visualizing your story today!

10 tools

xix.ai

writing

Best AI Scripting Tools for Radio & Podcasting: Write Engaging Audio Commercials

Discover the 2026 best AI scripting tools for radio & podcasting at XIX.AI. Our curated, top-rated list features powerful, game-changing solutions to write engaging audio commercials fast. Compare free vs paid options with real-world tests and weekly updated rankings. Unlock your creative edge today!

10 tools

xix.ai

Business

Best AI Contract Review Software: Spot Legal Loopholes & Compliance Risks Instantly

Discover the 2026 best AI contract review software on XIX.AI. Our top-rated, curated list features powerful tools that instantly spot legal loopholes and compliance risks. Compare free vs paid options with real-world tests and weekly updated rankings. Find your game-changing solution for secure, efficient contract analysis. Explore the definitive guide now.

10 tools

xix.ai

Animation Creation

AI Anime Generator for Donghua: Create Web Novel Characters & Comic Avatars

Discover the 2026 best AI anime generators for donghua. Our top-rated, curated list features powerful tools to create stunning web novel characters and comic avatars. Compare free vs paid options with real-world tests. Find your perfect creative partner and bring your stories to life today at XIX.AI.

10 tools

xix.ai

Comic Creation

Top AI Auto-Colorization Tools for Manga: Apply Flat Colors with Zero Consistency Errors

Discover the 2026 best AI auto-colorization tools for manga at XIX.AI. Our curated list features top-rated, game-changing solutions that apply flat colors with zero consistency errors, boosting your productivity. Explore free vs paid comparisons, real-world tests, and weekly updated rankings to find your perfect match. Unlock your AI edge today.

10 tools

xix.ai

writing

Top AI Fiction Profile Creators: Generate Consistent Character Motivations and Fatal Flaws

Discover the 2026 best AI fiction profile creators for crafting deep characters. XIX.AI's curated list features top-rated, game-changing tools that generate consistent motivations and fatal flaws. Compare free vs paid options with real-world tests. Unlock your storytelling potential now.

10 tools

xix.ai