爬虫技术
Search documents
有判头!3 人非法爬取知名平台 8 亿条核心数据
程序员的那些事· 2025-11-18 00:44
数字化时代,数据价值日益凸显。爬虫技术凭借高效的数据抓取能力与优质的数据产出,成 为众多互联网企业的得力工具。然而,其广泛应用也伴随着一系列法律风险。 在上海市普陀区人民检察院办理的一起案件中,8亿行餐饮商超数据、超300万次地图数据遭 非法爬取。检察机关通过高质效办案,成功破解取证与定性难题。 2025年8月28日,经普陀区人民检察院提起公诉,法院以 陈某某犯提供侵入计算机信息系统 月不等,同时适用缓刑,并处罚金三万元至一万元不等 。 来源:上海普陀检察微信公众号 核心数据被"暗流"爬取 风控系统拉响警报 2023年2月初,某知名地图软件公司在日常风控监测中发现异常: 有不明用户正持续爬取其数 据库中的地理坐标类数据,单日窃取量高达100万-400万条!该数据涵盖商家信息、服务介 绍、用户点评、排行及地理位置等核心内容。 程序罪,李某某、吴某某犯非法获取计算机信息系统数据罪,分别判处有期徒刑三年至六个 公司虽迅速启动拦截机制,但对方伪造的请求特征高度模拟真实用户行为。同年5月,该公司 向公安机关报案。进一步侦查发现,另一关联互联网平台公司的数据同样被非法爬取。2023 年6月,嫌疑人李某某、吴某某先后被抓获 ...
个人信息被“开盒”用于营销
Xin Hua Ri Bao· 2025-10-20 04:20
Core Viewpoint - The article highlights the emergence of an illegal industry involving the development and sale of "violent customer acquisition" software that exploits user data from short video platforms, leading to significant privacy violations and potential criminal activities [2][5]. Group 1: Illegal Software Development - The accused individuals, including Wei and Tan, developed software to illegally obtain user data from short video platforms, with low development costs and high efficiency [3][4]. - The software was designed to bypass platform security measures, allowing for rapid data extraction, including user IDs, phone numbers, and location information [3][6]. Group 2: Market Dynamics and User Impact - The software was sold in subscription models, with prices ranging from 40 to 140 yuan, targeting internet marketers and individuals involved in illegal activities [4][5]. - The illegal acquisition of user data has led to increased targeted scams and has negatively impacted legitimate businesses by disrupting their customer acquisition efforts [5][8]. Group 3: Legal Proceedings and Challenges - The case has highlighted difficulties in prosecuting such crimes, including evidence collection and tracing the criminal chain due to the anonymity of online transactions [5][6]. - The court's focus was on whether the software constituted a tool for invading computer systems, emphasizing that the method of data acquisition was illegal, regardless of the information's accessibility [6][7]. Group 4: Recommendations for Data Protection - Experts suggest that platforms need to enhance their defenses against "crawler" technologies and improve internal controls to prevent data leaks [7][8]. - There is a call for platforms to adhere to the principle of "minimum necessity" in data collection, ensuring that only essential user data is gathered to mitigate privacy risks [8].