Google’s latest AI model report lacks key safety details, experts say

Home

News

April 28, 2025

ChristopherThomas

# Gemini # Google

On Thursday, weeks after launching its latest and most advanced AI model, Gemini 2.5 Pro, Google released a technical report detailing the results of its internal safety assessments. However, experts have criticized the report for its lack of detail, making it challenging to fully understand the potential risks associated with the model.

Technical reports are crucial in the AI world, offering insights—even if they're sometimes unflattering—that companies might not usually share publicly. These reports are generally viewed by the AI community as genuine efforts to foster independent research and enhance safety evaluations.

Google's approach to safety reporting differs from some of its competitors. The company only publishes technical reports once a model moves beyond the "experimental" phase. Moreover, Google omits certain "dangerous capability" evaluation results from these reports, saving them for a separate audit.

Despite this, several experts expressed disappointment with the Gemini 2.5 Pro report to TechCrunch, pointing out its scant coverage of Google's proposed Frontier Safety Framework (FSF). Google unveiled the FSF last year, aiming to pinpoint future AI capabilities that might lead to "severe harm."

"This report is very sparse, contains minimal information, and was released weeks after the model was already made public," Peter Wildeford, co-founder of the Institute for AI Policy and Strategy, told TechCrunch. "It's impossible to verify if Google is living up to its public commitments and thus impossible to assess the safety and security of their models."

Thomas Woodside, co-founder of the Secure AI Project, acknowledged the release of the report for Gemini 2.5 Pro but questioned Google's dedication to providing timely supplemental safety evaluations. He noted that Google last published dangerous capability test results in June 2024, for a model announced in February of that year.

Adding to the concerns, Google has not yet released a report for Gemini 2.5 Flash, a smaller, more efficient model announced last week. A spokesperson informed TechCrunch that a report for Flash is "coming soon."

"I hope this is a promise from Google to start publishing more frequent updates," Woodside told TechCrunch. "Those updates should include the results of evaluations for models that haven’t been publicly deployed yet, since those models could also pose serious risks."

While Google was among the first AI labs to propose standardized reports for models, it's not alone in facing criticism for a lack of transparency. Meta released a similarly brief safety evaluation for its new Llama 4 open models, and OpenAI chose not to publish any report for its GPT-4.1 series.

Google's assurances to regulators about maintaining high standards in AI safety testing and reporting add pressure to the situation. Two years ago, Google promised the U.S. government to publish safety reports for all "significant" public AI models "within scope," followed by similar commitments to other countries, pledging "public transparency" around AI products.

Kevin Bankston, a senior adviser on AI governance at the Center for Democracy and Technology, described the trend of sporadic and vague reports as a "race to the bottom" on AI safety.

"Combined with reports that competing labs like OpenAI have reduced their safety testing time before release from months to days, this meager documentation for Google’s top AI model tells a troubling story of a race to the bottom on AI safety and transparency as companies rush their models to market," he told TechCrunch.

Google has stated that, although not detailed in its technical reports, it conducts safety testing and "adversarial red teaming" for models before their release.

Updated 4/22 at 12:58 p.m. Pacific: Modified language around the technical report’s reference to Google’s FSF.

Google's Latest Gemini AI Model Shows Declining Safety Scores in Testing Google's internal testing reveals concerning performance dips in its latest AI model's safety protocols compared to previous versions. According to newly published benchmarks, the Gemini 2.5 Flash model demonstrates 4-10% higher rates of guideline vi

Google's Stitch AI Simplifies App Design Process Google Unveils Stitch AI Design Tool at I/O 2025Google introduced Stitch, its revolutionary AI-powered interface design tool, during the keynote at Google I/O 2025. This innovative solution transforms natural language prompts or reference images into

Google Introduces AI-Powered Tools for Gmail, Docs, and Vids Google Unveils AI-Powered Workspace Updates at I/O 2025During its annual developer conference, Google has introduced transformative AI enhancements coming to its Workspace suite, fundamentally changing how users interact with Gmail, Docs, and Vids. T

Comments (6)

0/200

Submit

MarkSanchez

August 1, 2025 at 9:47:34 AM EDT

Google's AI safety report sounds like a half-baked cake – looks good but lacks substance. 😕 Why skimp on the details? Transparency matters when stakes are this high.

WalterKing

April 29, 2025 at 11:19:22 AM EDT

Der Bericht von Google über Gemini 2.5 Pro ist ein bisschen enttäuschend. Ich hatte mehr Details zu den Sicherheitsbewertungen erwartet, aber es scheint, als würden sie Informationen zurückhalten. Ohne die ganze Geschichte ist es schwer, der KI vollständig zu vertrauen. Vielleicht beim nächsten Mal, Google? 🤔

CharlesThomas

April 28, 2025 at 9:17:55 PM EDT

ジェミニ2.5プロのレポート、ちょっとがっかりですね。安全評価の詳細をもっと知りたかったのに、情報が少なすぎる。AIを完全に信頼するのは難しいです。次回はもっと詳しくお願いします！😅

AlbertWalker

April 28, 2025 at 7:15:07 PM EDT

Google's report on Gemini 2.5 Pro is a bit of a letdown. I was expecting more juicy details about the safety assessments, but it feels like they're holding back. It's hard to trust the AI fully without knowing the full story. Maybe next time, Google? 🤔

JimmyGarcia

April 28, 2025 at 4:01:42 PM EDT

O relatório do Google sobre o Gemini 2.5 Pro é um pouco decepcionante. Esperava mais detalhes sobre as avaliações de segurança, mas parece que eles estão escondendo algo. É difícil confiar totalmente na IA sem saber toda a história. Talvez na próxima, Google? 🤔

BillyThomas

April 27, 2025 at 7:41:33 PM EDT

El informe de Google sobre Gemini 2.5 Pro es decepcionante. Esperaba más detalles sobre las evaluaciones de seguridad, pero parece que están ocultando información. Es difícil confiar en la IA sin conocer toda la historia. ¿Tal vez la próxima vez, Google? 🤔