Workflow
Reddit user posts
icon
Search documents
Reddit sues Perplexity for scraping of posts, expanding user data battle with AI industry
CNBCยท 2025-10-23 04:41
Core Viewpoint - Reddit has filed a lawsuit against Perplexity for allegedly scraping user posts to train its AI model, highlighting ongoing tensions between content owners and the AI industry [1][3]. Group 1: Allegations and Defendants - Reddit claims that Perplexity, along with three other entities, illegally extracted its copyrighted content by disguising their identities and web scrapers [2]. - The defendants named in the lawsuit include Oxylabs, AWMProxy, and SerpApi, which Reddit accuses of assisting in the data collection process [1][2]. Group 2: Industry Context - This lawsuit is part of a broader trend where content owners are taking legal action against AI firms for using copyrighted material without permission to train large language models [3]. - Reddit has previously initiated a similar lawsuit against AI startup Anthropic, indicating its proactive stance in protecting its content [3]. Group 3: Reddit's Position and Strategy - Reddit's Chief Legal Officer stated that AI companies are engaged in an "arms race for quality human content," leading to a "data laundering" economy where scrapers steal data to sell to clients [4]. - The lawsuit claims that user posts from Reddit have become a primary source for AI-generated answers on Perplexity, with citations increasing forty-fold after a cease-and-desist letter was sent [5]. Group 4: Licensing Agreements - Reddit has been leveraging its data pool by allowing access only through AI-related licensing agreements, having signed deals with OpenAI and Alphabet's Google [6]. - Perplexity argues that it does not train AI models on Reddit content but merely summarizes public discussions, claiming that licensing agreements are therefore unnecessary [6][7]. Group 5: Financial Implications - AI licensing deals with Google and OpenAI reportedly account for nearly 10% of Reddit's revenue, emphasizing the financial significance of data licensing for the company [8].