Workflow
ClaudeBot
icon
Search documents
数据“中毒”会让AI“自己学坏”
Ke Ji Ri Bao· 2025-08-19 00:18
Core Insights - The article discusses the risks of data poisoning in AI systems, highlighting how malicious interference can lead to incorrect AI learning and potentially dangerous outcomes in various sectors like transportation and healthcare [1][2]. Group 1: Data Poisoning Risks - Data poisoning can occur when misleading data is fed into AI systems, causing them to develop incorrect understandings and make erroneous judgments [1][2]. - A notable example of data poisoning is the case of Microsoft's chatbot Tay, which was forced offline within hours of launch due to being manipulated by users [2]. - The rise of AI web crawlers has led to concerns about the collection of toxic data, which can result in copyright infringement and the spread of false information [3]. Group 2: Copyright and Defensive Measures - Creators are increasingly concerned about their works being used without permission, leading to legal actions like the lawsuit from The New York Times against OpenAI for copyright infringement [4]. - Tools like Glaze and Nightshade have been developed to protect creators' works by introducing subtle alterations that confuse AI models, effectively turning their own creations into "poison" for AI training [4]. - Cloudflare has introduced "AI Maze" to trap AI crawlers in a loop of meaningless data, consuming their resources and time [4]. Group 3: Decentralized Defense Strategies - Researchers are exploring decentralized technologies as a defense against data poisoning, with methods like federated learning allowing models to learn locally without sharing raw data [5][6]. - Blockchain technology is being integrated into AI defense systems to provide traceability and accountability in model updates, enabling the identification of malicious data sources [6]. - The combination of federated learning and blockchain aims to create more resilient AI systems that can alert administrators to potential data poisoning threats [6].
AI全面战争,从爬虫毁灭互联网开始
Hu Xiu· 2025-03-24 14:13
这是第一次,全世界最大的网络基础设施公司之一,Cloudflare,开始用魔法打败魔法,用AI来对抗AI爬虫。 这事有意思的程度,足以载入AI发展史册。这是一次AI领域的全面战争。 你可能现在还有很多疑惑,Cloudflare是什么,AI爬虫是什么,AI迷宫又是什么,这个事到底有意思在哪。 作为这一切的开始,我想先跟你讲一个故事,一个在今年1月份,发生在一个仅有7人的乌克兰公司的故事。 这个公司叫做Triplegangers,做的业务特别简单,就是卖人的3D数字模型。 AI全面战争,从爬虫毁灭互联网开始 昨天看到一个非常有意思的事情。 Triplegangers专注于销售"人体的数字孪生"模型素材,这些高清3D模型照片来自真实人类扫描,价值巨大。 创始人Tomchuk对自己公司的业务一直很满意,公司虽然不大,但这是他最喜欢的事情。 这个网站一共有65000个产品页面,每个产品的页面至少放着三张高清照片。 每一张图片都细致地标注了年龄、肤色、纹身甚至伤疤。 但是,就在一个普通的周六早上, 这种平静被一场风暴骤然打破。 Tomchuk收到了一条紧急通知:公司的网站崩溃了,因为受到了大量的DDoS攻击。 他懵了,因 ...