Data Scraping
Search documents
Reddit sues Perplexity AI over ‘industrial-scale' data scraping
New York Post· 2025-10-23 20:11
Core Viewpoint - Reddit is suing Perplexity AI and three other companies for allegedly scraping posts from its platform on an industrial scale, claiming violations of copyright laws and unfair competition [1][4][12]. Group 1: Allegations and Legal Actions - Reddit accuses Perplexity AI of using data scrapers to unlawfully obtain content from its site, seeking unspecified damages [4][5]. - The lawsuit also targets data scraping partners, including Oxylabs UAB, AWMProxy, and SerpApi, which Reddit describes as entities employing dubious tactics to access its data [5][6]. - Reddit previously filed a similar complaint against Anthropic in June, indicating a pattern of legal action against companies it believes are infringing on its rights [4]. Group 2: Responses from Defendants - Perplexity AI has denied the allegations, claiming that Reddit is engaging in extortion [8][13]. - SerpApi and Oxylabs have also refuted the claims, asserting their commitment to their business practices and readiness to defend against the lawsuit [10][11]. Group 3: Impact on AI Development - Reddit's content is increasingly cited as a primary source for AI-generated responses, with the platform noting that its posts have become crucial for training AI chatbots [11]. - Following a cease-and-desist letter from Reddit, Perplexity's use of Reddit's content reportedly increased significantly, tripling "forty-fold" [12].
Reddit Sues Perplexity Over Alleged Data Scraping
PYMNTS.com· 2025-10-22 21:27
Core Points - Reddit has filed a lawsuit against Perplexity AI and three data scraping firms for unauthorized harvesting of its content [1][3] - The lawsuit highlights the increasing tensions between online platforms and AI developers regarding data usage [1][4] Company Actions - The lawsuit names Perplexity AI, Oxylabs UAB, AWMProxy, and SerpApi as defendants, accusing them of collecting and reselling Reddit data without consent [3] - Reddit's Chief Legal Officer stated that the lawsuit reflects a broader challenge in the industry, emphasizing the competition for high-quality human-generated text [4] Industry Context - Reddit's data has become essential for training generative AI models, with the company already having paid licensing agreements with OpenAI and Google [5] - The lawsuit is part of Reddit's strategy to assert ownership over its data amid the AI industry's demand for training resources [6] Legal Implications - The case could set a precedent for how U.S. courts interpret the legality of web-scraped content used in AI model training [7] - Legal experts suggest that this lawsuit is part of a larger trend of disputes that are shaping data governance and compliance in the tech industry [7]
X @Bloomberg
Bloomberg· 2025-10-22 17:28
Reddit sued Perplexity AI and three other companies over alleged data scraping from the discussion site without permission, a sign of the growing demand and value of original data in the AI industry https://t.co/yHIjbZCg2f ...