option
Home
News
Anthropic's New AI Model Operates Computers Like Humans, Errors Included

Anthropic's New AI Model Operates Computers Like Humans, Errors Included

May 9, 2025
38

Anthropic

Have you ever dreamed of an AI that can seamlessly interact with your computer, just like a human would? Well, that dream is now a reality, thanks to Anthropic's latest innovation. On Tuesday, they unveiled the new generation of their Claude AI model, named Claude 3.5 Sonnet, which can operate a computer with surprising finesse. Currently in beta mode, this AI is available for developers to experiment with through an API.

Anthropic proudly labels Claude 3.5 Sonnet as the "first frontier AI model to offer computer use in public beta." This means developers can program it to perform a variety of tasks on a computer, such as viewing the screen, maneuvering the cursor, clicking buttons, and even typing on a virtual keyboard. The goal? To replicate the way we interact with our computers every day.

Now, while this new AI is still in the experimental phase, it's not without its hiccups. It can be a bit clumsy and error-prone at times. But that's exactly why Anthropic released it in beta—to gather valuable feedback from developers and refine the model over time.

Why Should We Care About AI Using Computers?

Anthropic has a clear answer to that question: "A vast amount of modern work happens via computers." By enabling AIs to interact with software the same way humans do, they unlock a plethora of new applications that current AI assistants can't handle.

How Can Developers and Users Benefit?

Instead of creating specific tools for each task, Anthropic is teaching Claude general computer skills. This allows the AI to utilize a wide range of standard software programs designed for humans. Developers can harness this capability to automate repetitive tasks, build and test software, and even conduct research.

Several companies are already leveraging Claude 3.5 Sonnet's computer skills, including Asana, Canva, Cognition, DoorDash, Replit, and The Browser Company. For instance, Replit is using these capabilities to enhance its Replit Agent product.

How Did They Train Claude to Use Computers?

Training Claude to navigate a computer involved a lot of trial and error, according to Anthropic. The process requires the AI to understand and interpret images of the computer screen, then decide which actions to take based on what it sees. Claude 3.5 Sonnet accomplishes this by analyzing screenshots, counting pixels to precisely move the cursor, and issuing mouse commands.

How Well Is Claude Performing?

In the OSWorld benchmarking tests, which assess AI models' ability to use computers, Claude 3.5 Sonnet achieved a score of 14.9%. While this is significantly lower than the 70%-75% human-level performance, it's nearly double the 7.7% scored by the next best AI model in the same category.

Despite these promising results, Claude's computer use is still in its infancy. It can't yet perform more complex tasks like dragging windows or zooming into the screen. Additionally, because it relies on screenshots, it might miss certain actions and notifications.

Anthropic remains optimistic, stating, "We expect that computer use will rapidly improve to become faster, more reliable, and more useful for the tasks our users want to complete." They also emphasize that as the technology evolves, it will become more accessible to those with less software development experience, all while maintaining strict safety measures.

Claude 3.5 Sonnet is now accessible to everyone. Developers can start building applications with the computer-use beta on the Anthropic API, Amazon Bedrock, and Google Cloud's Vertex AI.

Related article
AI-Powered Summary: A Complete Guide to Summarizing YouTube Videos AI-Powered Summary: A Complete Guide to Summarizing YouTube Videos In today's fast-paced world, the ability to quickly process and understand information is more important than ever. YouTube, with its endless array of videos, is a treasure trove of knowledge, but who has the time to watch every video from start to finish? This guide will show you how to use AI tool
AI Revolutionizes Ultrasound for Point-of-Care Assessments AI Revolutionizes Ultrasound for Point-of-Care Assessments Artificial intelligence is shaking up the world of healthcare, and ultrasound technology is riding that wave of change. This article dives into how AI is transforming point-of-care ultrasound (POCUS) assessments, making them more accessible, efficient, and accurate. From smoothing out the kinks in i
Machine Learning Cheat Sheets: Essential AI Quick Reference Guide Machine Learning Cheat Sheets: Essential AI Quick Reference Guide In the dynamic world of technology, where AI and cloud computing are driving innovation, staying updated and ready is crucial. Whether you're discussing strategies with a colleague, crafting educational content, or gearing up for an interview, having quick access to key information can make all the
Comments (0)
0/200
Back to Top
OR