option
Home
News
Meta Enhances AI Security with Advanced Llama Tools

Meta Enhances AI Security with Advanced Llama Tools

August 9, 2025
1

Meta has released new Llama security tools to bolster AI development and protect against emerging threats.

These upgraded Llama AI model security tools are paired with Meta’s new resources to empower cybersecurity teams in leveraging AI for defense, aiming to enhance safety for all AI stakeholders.

Developers using Llama models now have access to enhanced tools, available directly on Meta’s Llama Protections page, Hugging Face, and GitHub.

Llama Guard 4 introduces multimodal capabilities, enabling safety enforcement for both text and images, critical for increasingly visual AI applications. It’s integrated into Meta’s new Llama API, currently in limited preview.

LlamaFirewall, a new addition, serves as a security hub for AI systems, coordinating safety models and integrating with Meta’s protective tools to counter risks like prompt injection attacks, unsafe code generation, or malicious AI plug-in behavior.

Meta has also refined Llama Prompt Guard. The updated Prompt Guard 2 (86M) model excels at detecting jailbreak attempts and prompt injections. Additionally, the compact Prompt Guard 2 22M reduces latency and compute costs by up to 75%, maintaining strong detection for cost-conscious developers.

Beyond developers, Meta supports cybersecurity professionals with AI-driven tools to combat cyberattacks, responding to growing demands for advanced defenses.

The CyberSec Eval 4 benchmark suite has been revamped, offering organizations tools to assess AI performance in security tasks. It includes two halb2>two new additions:

  • CyberSOC Eval: Developed with CrowdStrike, this framework evaluates AI effectiveness in real Security Operation Centre environments, focusing on threat detection and response. It will be available soon.
  • AutoPatchBench: This tests Llama and other AI models’ ability to identify and patch code vulnerabilities before exploitation.

Meta’s Llama Defenders Program provides partners and developers with tailored AI security solutions, combining open-source and early-access tools to address diverse challenges.

Meta is sharing its internal Automated Sensitive Doc Classification Tool, which labels sensitive documents to prevent unauthorized leaks or misuse in AI systems like RAG setups.

To combat AI-generated audio scams, Meta is sharing the Llama Generated Audio Detector and Llama Audio Watermark Detector with partners like ZenDesk, Bell Canada, and AT&T to identify fraudulent AI voices in phishing or fraud attempts.

Meta also previewed Private Processing for WhatsApp, enabling AI to summarize messages or draft replies without accessing message content, prioritizing user privacy.

Meta openly shares its threat model, encouraging security researchers to scrutinize the architecture before launch, demonstrating a commitment to robust privacy measures.

This comprehensive set of AI security updates from Meta strengthens their AI ecosystem while equipping the tech community with tools for secure development and effective defense.

See also: Microsoft uncovers $4B in AI-driven fraud attempts

Discover more about AI and big data at the AI & Big Data Expo in Amsterdam, California, and London, co-located with events like Intelligent Automation Conference, BlockX, Digital Transformation Week, and Cyber Security & Cloud Expo.

Explore upcoming enterprise technology events and webinars by TechForge here.

Related article
Meta Offers High Pay for AI Talent, Denies $100M Signing Bonuses Meta Offers High Pay for AI Talent, Denies $100M Signing Bonuses Meta is attracting AI researchers to its new superintelligence lab with substantial multimillion-dollar compensation packages. However, claims of $100 million "signing bonuses" are untrue, per a recru
NotebookLM Unveils Curated Notebooks from Top Publications and Experts NotebookLM Unveils Curated Notebooks from Top Publications and Experts Google is enhancing its AI-driven research and note-taking tool, NotebookLM, to serve as a comprehensive knowledge hub. On Monday, the company introduced a curated collection of notebooks from promine
Meta Intensifies Efforts to Curb Unoriginal Content on Facebook Meta Intensifies Efforts to Curb Unoriginal Content on Facebook On Monday, Meta unveiled stricter measures to tackle accounts posting unoriginal content on Facebook, targeting those that repeatedly repurpose others’ text, images, or videos. The company reported re
Comments (0)
0/200
Back to Top
OR