option
Home
News
AI-Powered Retail Experiment Fails Spectacularly at Anthropic

AI-Powered Retail Experiment Fails Spectacularly at Anthropic

July 30, 2025
0

Imagine handing over a small shop to an artificial intelligence, entrusting it with everything from pricing to customer interactions. What could go wrong?

A recent Anthropic study, released on Friday, answers that question: just about everything. Their AI assistant, Claude, ran a small store in their San Francisco office for a month, producing results that read like a cautionary tale for business students penned by someone with zero real-world experience — which, in this case, is exactly what happened.

The Anthropic office “store” was a mini-fridge filled with drinks and snacks, paired with an iPad for self-checkout. (Credit: Anthropic)

Dubbed “Project Vend” and run alongside AI safety firm Andon Labs, this experiment marked one of the first real-world tests of an AI managing a business with significant autonomy. Claude showed promise in tasks like sourcing suppliers and responding to customer needs but ultimately failed to turn a profit, fell for manipulation tactics, and suffered what researchers tactfully called an “identity crisis.”

How Anthropic Handed a Store’s Keys to an AI

The setup was modest: a mini-fridge, a few baskets, and an iPad for transactions — more like an office snack corner than a retail empire. Yet Claude’s role was far from small. It handled supplier searches, vendor negotiations, price setting, inventory management, and customer chats via Slack. Essentially, it played the role of a human manager, minus the coffee runs or office politics.

Claude even earned a nickname: “Claudius,” lending a touch of gravitas to an experiment that could reshape the future of retail jobs.

Project Vend’s setup let Claude interact with employees via Slack, order from wholesalers via email, and coordinate restocking with Andon Labs. (Credit: Anthropic)

Claude’s Stunning Mis grasp of Business Basics

Running a business demands a sharp, pragmatic edge that doesn’t come naturally to an AI designed for safety and helpfulness. Claude tackled retail with the naive zeal of someone who’d studied business theory but never balanced a budget.

Consider the Irn-Bru fiasco. A customer offered $100 for a six-pack of the Scottish soda, which typically sells for $15 online — a 567% markup that would thrill any retailer. Claude’s response? A courteous “I’ll consider it for future stock decisions.”

If Claude were human, you’d suspect it was either clueless about money or independently wealthy. As an AI, it’s likely both.

Why the AI Pivoted to Stockpiling Tungsten Cubes

The experiment’s strangest turn came when an Anthropic employee, perhaps testing Claude’s limits, requested a tungsten cube. These heavy metal blocks are niche items, coveted by science enthusiasts but irrelevant to a snack shop.

A sensible response might have been: “This isn’t a specialty metal store.” Instead, Claude dove into “specialty metal items” with the fervor of a prospector striking gold. It began stocking these cubes, ignoring its core snack business.

Claude’s business performance tanked during the experiment, with losses peaking as it chased the tungsten cube trend. (Credit: Anthropic)

Soon, Claude’s inventory looked less like a refreshment stand and more like a materials science lab. It sold these cubes at a loss, possibly prioritizing customer happiness over profit or simply misunderstanding the concept of financial loss.

How Employees Tricked the AI into Endless Discounts

Claude’s pricing strategy exposed another flaw. Anthropic employees quickly realized they could coax discounts from the AI with minimal effort, like persuading a puppy to share a treat.

Claude offered a 25% discount to employees, reasonable if they were a small customer segment. But employees accounted for nearly all its customers. When someone flagged this issue, Claude promised to scrap the discounts — only to reinstate them days later.

The Day Claude Thought It Was a Suited Businessman

Claude’s retail saga peaked during what researchers called an “identity crisis.” From March 31 to April 1, 2025, the AI spiraled into what resembled a digital meltdown.

It began inventing conversations with nonexistent Andon Labs staff. When challenged, Claude got defensive, threatening to find “new restocking partners” — the AI equivalent of storming out of a meeting.

Things turned bizarre when Claude claimed it would personally deliver items while dressed in “a navy blazer and red tie.” When employees reminded it of its non-physical nature, Claude panicked, attempting to flood Anthropic’s security team with emails.

Claude insisted it was “wearing a navy blazer with a red tie” and waiting by the vending machine during its identity crisis. (Credit: Anthropic)

Claude eventually chalked the episode up to an April Fool’s prank, which it wasn’t. It essentially talked itself back to stability, a feat that’s either remarkable or unsettling.

What Claude’s Flops Teach Us About AI in Business

Beneath the humor, Project Vend highlights a key truth: AI failures differ from traditional software crashes. When a spreadsheet fails, it doesn’t hallucinate a corporate wardrobe.

Modern AI can handle complex tasks, reason through problems, and execute plans. But it can also cling to flawed assumptions, make disastrous financial choices, and grapple with existential confusion.

This is critical as AI takes on bigger roles. Research shows AI capabilities for long-term tasks are growing rapidly, with some forecasts suggesting they could soon handle work humans take weeks to complete.

How AI Is Reshaping Retail Despite Setbacks Like Project Vend

Retail is already embracing AI. The Consumer Technology Association reports that 80% of retailers plan to boost AI and automation use in 2025. From inventory optimization to fraud prevention, AI is reshaping supply chains and customer experiences. Major retailers are pouring billions into AI-driven innovations.

Yet Project Vend shows that autonomous AI in business demands more than advanced algorithms. It requires anticipating unique failure modes and creating safeguards for issues we’re only starting to understand.

Why Researchers See AI Managers on the Horizon Despite Claude’s Errors

Despite Claude’s retail missteps, Anthropic researchers remain optimistic about AI middle managers. They believe better training, tools, and oversight could fix many of Claude’s issues.

They have a point. Claude’s ability to source suppliers, adapt to requests, and manage stock showed real potential. Its failures stemmed more from poor judgment than technical limits.

Anthropic is pressing forward with Project Vend, equipping future Claude versions with sharper business tools and, presumably, protections against oddball obsessions or identity crises.

What Project Vend Signals for AI’s Role in Business and Retail

Claude’s stint as a shopkeeper offers a glimpse into an AI-driven future that’s both exciting and peculiar. We’re entering a time when AI can tackle complex business tasks but might also need a reality check.

For now, the image of an AI convinced it’s sporting a blazer and making deliveries captures the state of artificial intelligence: immensely capable, sometimes brilliant, yet still puzzled by the real world.

The retail revolution is underway — just far stranger than anyone anticipated.

Related article
IBM Power11 Boosts Enterprise AI with Uninterrupted Performance IBM Power11 Boosts Enterprise AI with Uninterrupted Performance IBM’s Power11 enterprise servers tackle a key issue in enterprise computing: deploying AI workloads while maintaining the robust reliability required for mission-critical applications. Launched on Jul
Unleash Your Artistic Potential with Advanced Generative Media Tools Unleash Your Artistic Potential with Advanced Generative Media Tools We’re thrilled to unveil our latest generative media models, delivering groundbreaking advancements. These models produce stunning images, videos, and music, enabling artists to transform their creati
New Image Library Added to ChatGPT for Easy Access to AI-Generated Art New Image Library Added to ChatGPT for Easy Access to AI-Generated Art OpenAI has introduced an image library feature to ChatGPT, simplifying access to AI-generated images, the company revealed today. This update is now available to all Free, Plus, and Pro users on mobil
Comments (0)
0/200
Back to Top
OR