Reddit Sues Anthropic for AI Data Extraction Practices
Reddit is alleging that Anthropic has developed its Claude AI models using data from Reddit's user base, doing so without authorization or compensation.
Anyone who accesses Reddit, including automated web crawlers, must agree to the platform's user terms. These terms explicitly state that content cannot be extracted and utilized for commercial purposes without a formal agreement. Reddit contends that Anthropic’s bots have been engaged in precisely this activity for years, harvesting vast quantities of discussions and posts to train and enhance Claude.
Adding intrigue to the lawsuit is its direct challenge to Anthropic’s public image. Anthropic has cultivated a reputation as an ethical and trustworthy AI developer, positioning itself as the industry's "white knight." The legal filing, however, dismisses these assertions as little more than "empty marketing gimmicks."
For example, Reddit cites a statement from July 2024 in which Anthropic claimed it had ceased its bots from crawling Reddit. The lawsuit labels this claim as "false," presenting logs that purportedly show Anthropic's bots attempting to access the site over one hundred thousand times in the subsequent months.
Beyond corporate disputes, this issue touches directly on user privacy. When a Reddit user deletes a post or comment, they expect it to be removed entirely. Reddit has established official licensing agreements with other major AI firms, such as Google and OpenAI, which include technical safeguards to ensure deleted user content is also erased from the AI companies' systems.
According to the lawsuit, Anthropic has no such agreement and has declined to enter into one. This suggests that if Claude was trained on a post you later deleted, that information may remain embedded in its knowledge base, effectively bypassing your decision to remove it. The lawsuit even includes a screenshot in which Claude concedes it lacks the capability to determine whether the Reddit data it was trained on has since been deleted by a user:

So, what is Reddit seeking? While they are requesting damages to cover increased server costs and lost licensing revenue, their demands extend beyond financial compensation. They are petitioning the court for an injunction to compel Anthropic to immediately cease using any Reddit-derived data.
Additionally, Reddit aims to prevent Anthropic from selling or licensing any product developed with that data. This would effectively require a judge to remove Claude from the market.
This legal confrontation raises a critical question: Does content that is "publicly available" on the internet grant corporations the right to freely harvest and monetize it? Reddit's position is a definitive "no," and the outcome of this case could redefine the rules governing AI development moving forward.
See also: Tackling hallucinations: MIT spinout teaches AI to admit when it’s clueless
Want to learn more about AI and big data from industry leaders? Check out AI & Big Data Expo taking place in Amsterdam, California, and London. The comprehensive event is co-located with other leading events including Intelligent Automation Conference, BlockX, Digital Transformation Week, and Cyber Security & Cloud Expo.
Explore other upcoming enterprise technology events and webinars powered by TechForge here.
Related article
Kakao Mobility outlines Level 4 autonomous driving roadmap for physical AI
Kakao Mobility is planning to develop Level 4 autonomous driving technologies internally as part of its physical AI strategy.
At the 2026 World IT Show conference in Seoul's COEX, Kim Jin-kyu — vice president and head of Kakao Mobility's Physical AI
Barry Diller: Trust in Sam Altman irrelevant as AGI nears
Barry Diller, the billionaire media titan, does not believe OpenAI CEO Sam Altman is untrustworthy, despite recent reports suggesting otherwise. Speaking at the Wall Street Journal's "Future of Everything" conference this week, Diller defended Altman
YouTube expands AI deepfake detection to politicians, government officials, and journalists
On Tuesday, YouTube announced it is expanding its deepfake detection technology to a select group of government officials, political candidates, and journalists. The tool identifies AI-generated likenesses and lets pilot participants request the remo
Related Special Topic Recommendations
Comments (0)
0/500
Reddit is alleging that Anthropic has developed its Claude AI models using data from Reddit's user base, doing so without authorization or compensation.
Anyone who accesses Reddit, including automated web crawlers, must agree to the platform's user terms. These terms explicitly state that content cannot be extracted and utilized for commercial purposes without a formal agreement. Reddit contends that Anthropic’s bots have been engaged in precisely this activity for years, harvesting vast quantities of discussions and posts to train and enhance Claude.
Adding intrigue to the lawsuit is its direct challenge to Anthropic’s public image. Anthropic has cultivated a reputation as an ethical and trustworthy AI developer, positioning itself as the industry's "white knight." The legal filing, however, dismisses these assertions as little more than "empty marketing gimmicks."
For example, Reddit cites a statement from July 2024 in which Anthropic claimed it had ceased its bots from crawling Reddit. The lawsuit labels this claim as "false," presenting logs that purportedly show Anthropic's bots attempting to access the site over one hundred thousand times in the subsequent months.
Beyond corporate disputes, this issue touches directly on user privacy. When a Reddit user deletes a post or comment, they expect it to be removed entirely. Reddit has established official licensing agreements with other major AI firms, such as Google and OpenAI, which include technical safeguards to ensure deleted user content is also erased from the AI companies' systems.
According to the lawsuit, Anthropic has no such agreement and has declined to enter into one. This suggests that if Claude was trained on a post you later deleted, that information may remain embedded in its knowledge base, effectively bypassing your decision to remove it. The lawsuit even includes a screenshot in which Claude concedes it lacks the capability to determine whether the Reddit data it was trained on has since been deleted by a user:

So, what is Reddit seeking? While they are requesting damages to cover increased server costs and lost licensing revenue, their demands extend beyond financial compensation. They are petitioning the court for an injunction to compel Anthropic to immediately cease using any Reddit-derived data.
Additionally, Reddit aims to prevent Anthropic from selling or licensing any product developed with that data. This would effectively require a judge to remove Claude from the market.
This legal confrontation raises a critical question: Does content that is "publicly available" on the internet grant corporations the right to freely harvest and monetize it? Reddit's position is a definitive "no," and the outcome of this case could redefine the rules governing AI development moving forward.
See also: Tackling hallucinations: MIT spinout teaches AI to admit when it’s clueless
Want to learn more about AI and big data from industry leaders? Check out AI & Big Data Expo taking place in Amsterdam, California, and London. The comprehensive event is co-located with other leading events including Intelligent Automation Conference, BlockX, Digital Transformation Week, and Cyber Security & Cloud Expo.
Explore other upcoming enterprise technology events and webinars powered by TechForge here.
Barry Diller: Trust in Sam Altman irrelevant as AGI nears
Barry Diller, the billionaire media titan, does not believe OpenAI CEO Sam Altman is untrustworthy, despite recent reports suggesting otherwise. Speaking at the Wall Street Journal's "Future of Everything" conference this week, Diller defended Altman
YouTube expands AI deepfake detection to politicians, government officials, and journalists
On Tuesday, YouTube announced it is expanding its deepfake detection technology to a select group of government officials, political candidates, and journalists. The tool identifies AI-generated likenesses and lets pilot participants request the remo





Home






