ElevenLabs Sets New Speech-to-Text Benchmark; Google Gemini Follows with Broad Capabilities

Home

News

March 17, 2026

KennethJones

136

Artificial Analysis has released the latest version of its speech-to-text benchmark, AA-WER v2.0. The findings highlight ElevenLabs and Google as clear leaders in audio transcription performance.

When measured by the core word error rate (WER), ElevenLabs' Scribe v2 achieved the top spot with an impressively low 2.3% error rate. Close behind was Google's Gemini3Pro at 2.9%. It's worth noting that Google did not fine-tune Gemini for transcription; this result stems purely from its robust multimodal general capabilities.

Other leading models showed the following results:

Mistral Voxtral Small: Took third place with a 3.0% error rate.

Google Gemini3Flash: Delivered a solid performance with a 3.1% error rate.

OpenAI Whisper Large v3: The most widely-used open-source model placed in the middle of the pack with a 4.2% error rate.

Lowest performers: Alibaba's Qwen3ASR Flash (5.9%), Amazon's Nova2Omni (6.0%), and Rev AI (6.1%) rounded out the bottom of the rankings.

In the dedicated AA-AgentTalk benchmark for voice assistant commands, the leaderboard remained consistent. ElevenLabs' Scribe v2 and Google's Gemini3Pro maintained their lead with error rates of 1.6% and 1.7% respectively, proving highly reliable for short, direct voice interactions.

Anthropic Study Links Polished AI Content to Reduced Human Thinking When you see AI instantly produce a well-structured, logically clear piece of code or document, are you tempted to trust it without a second thought? According to AIbase, the leading AI company Anthropic recently published a research report titled "A

UK Government Departments Clash Over Energy Needs for AI Data Centers The UK government is grappling with a major challenge: advancing clean energy while aiming to become a global leader in artificial intelligence. Yet serious inconsistencies appear between the departments responsible for these goals. The Department fo

Cyberspace Administration of China mandates tagging of AI-generated and fictional short videos The Cyberspace Administration of China has rolled out a comprehensive plan to standardize short video content labeling, mandating that platforms offer six required tags—including "AI-generated content"—ushering in a new era of mandatory transparency

Related Special Topic Recommendations

Comic Creation

Top AI Auto-Colorization Tools for Manga: Apply Flat Colors with Zero Consistency Errors

Discover the 2026 best AI auto-colorization tools for manga at XIX.AI. Our curated list features top-rated, game-changing solutions that apply flat colors with zero consistency errors, boosting your productivity. Explore free vs paid comparisons, real-world tests, and weekly updated rankings to find your perfect match. Unlock your AI edge today.

10 tools

xix.ai

writing

Top AI Fiction Profile Creators: Generate Consistent Character Motivations and Fatal Flaws

Discover the 2026 best AI fiction profile creators for crafting deep characters. XIX.AI's curated list features top-rated, game-changing tools that generate consistent motivations and fatal flaws. Compare free vs paid options with real-world tests. Unlock your storytelling potential now.

10 tools

xix.ai

Business

Top AI Pricing Optimization Software: Track Competitors & Auto-Adjust Store Prices

Discover the 2026 best AI pricing optimization software on XIX.AI. Our curated list features top-rated, game-changing tools that track competitors and auto-adjust your store prices for maximum profit. Compare free vs paid options with real-world tests. Unlock your pricing edge now.

10 tools

xix.ai

code

Best AI Code Reviewers: Automate Clean Code Compliance & Refactor Legacy Repo Files

Discover the 2026 best AI code reviewers on XIX.AI. Our curated list features top-rated, game-changing tools for automating clean code compliance and refactoring legacy repo files. Compare free vs paid options with real-world tests and weekly updated rankings. Unlock your AI edge today.

10 tools

xix.ai

Text-to-speech

Top AI TTS Apps for Dyslexia: Support Learning and Reading Efficiency for Students

Discover the 2026 latest top-rated AI TTS apps curated for dyslexia support. Our expert rankings compare free vs paid tools, highlighting powerful features for enhanced reading efficiency and learning. Explore must-try, game-changing solutions to unlock student potential. Start your journey at XIX.AI.

10 tools

xix.ai

Comic Creation

Top AI Generators for Shonen Manga: Create High-Octane Action Sequences & Energy Effects

Discover the 2026 best AI generators for Shonen manga at XIX.AI. Our top-rated, curated list features powerful tools for creating high-octane action sequences and dynamic energy effects. Compare free vs paid options with real-world tests. Unlock your creative potential and start crafting epic manga today!

15 tools

xix.ai

Comments (1)

0/500

Please login first

LiamWalker

May 14, 2026 at 8:00:20 AM EDT

Just tried ElevenLabs' API and the accuracy is insane for my podcast clips! Gemini being close behind means we're finally getting real competition in this space. Can't wait to see prices drop as they fight it out. 🎧