数据标注
Search documents
给AI当老师是种什么体验
Xin Lang Cai Jing· 2026-01-10 09:09
Core Insights - The rapid development of artificial intelligence (AI) is heavily reliant on data annotation technology, with data annotators acting as "teachers" for AI systems [1] - The demand for high-quality datasets is increasing, leading to data annotation becoming a new career for young people [1] Group 1: Data Annotation Role - Data annotators are responsible for labeling various video and image information to ensure AI systems can accurately interpret data [1] - The work involves correcting AI-generated classification tags, which helps reduce the error rate of AI models over time [1] Group 2: Application in Industries - Annotated data is crucial for applications in intelligent driving, enhancing vehicle recognition of roads, obstacles, and traffic signs [1] - The output from data annotators supports nationwide smart applications, improving navigation accuracy and safety in automated driving [1] Group 3: Personal Development and Future Aspirations - Data annotators are continuously learning and adapting to new technologies to enhance their skills and improve the quality of their annotations [1] - There is a collective aspiration among data annotators to refine their work, ensuring AI systems better understand human needs and provide more effective solutions [1]
医疗数据“上架”,成果转化“上车”
Xin Hua Ri Bao· 2026-01-02 19:57
本报讯(记者盛文虎)近日,省肿瘤医院申报的"肿瘤围手术期麻醉镇痛数据集",获得江苏省数据知识产 权登记证书,并在省数交所上架。该院也成为全省首家数据价值变现的省级三甲医院。 "沉睡"在存储器中的医疗数据,变成了明码标价的资产,这笔"生意"的背后,是医院、政府和企业的一 次"联合破冰"——南京市玄武区知识产权局担纲"数据红娘",从制度层面化解医院的后顾之忧,江苏传 古科技扮演"技术保姆"角色,帮助医院将数据资产转化成"可买卖"的产品,省数交所发挥"流通枢纽"功 能,确保交易安全合规。 玄武区知识产权局相关负责人介绍,早在2024年,该区就邀请省肿瘤医院参加科学数据知识产权研讨 会,由此开启了长达一年半的"马拉松式"服务。从对接专业数据服务商,到组织专家团队"把脉会诊", 从优化方案、系统注册,到技术攻关、登记拿证,玄武区手把手服务属地单位,助力医院"跑通"了全流 程。 有了这张"身份证",省肿瘤医院的医疗数据变成了拥有知识产权的产品,医护人员可借此将数据转让成 果纳入科研转化体系。这些数据既是学术研究和课堂教学的素材,也是药企急需的商品。 "完成知识产权登记只是开始,让数据产生价值才是硬道理。"省肿瘤医院相关负 ...
甘肃首批持证残疾人AI训练师结业
Xin Lang Cai Jing· 2026-01-01 19:47
本报讯 (记者康劲)在兰州市"残疾人AI就业工坊"里,键盘敲击声绵密而平稳。屏幕上,一张张道路 图片中,车辆、行人的轮廓被精准勾勒出来。47岁的许小刚目光专注,熟练地进行着数据标注,"不用 出门,一双手、一台电脑在家就能创造价值,这为我的人生打开了全新的一扇窗。" 元旦前夕,甘肃省残疾人AI训练师(数据标注)培训班顺利结业,经过为期20天的系统学习,25名学 员掌握了从图像、文本到前沿3D点云标注的技能,其中首批19名学员成功考取了全国通用人工智能训 练师资格证书。 人工智能训练师自2020年被纳入国家职业分类目录后,因其对专注力、逻辑思维能力要求高,对体力要 求相对较低的特点,成为残疾人实现高质量就业的理想选择之一。本次培训聚焦人工智能产业链的基础 关键环节——数据标注,这正是让AI模型学会"看"和"理解"的基础工作。 "这不仅仅是机械劳动。"31岁的杨智文在培训中实现了技能跃升。他解释道,"我们标注的每一辆车、 每一个障碍物,都在帮助自动驾驶系统更准确地感知环境,从而保障未来的出行安全。" 甘肃省残疾人职业教育和就业服务中心相关负责人表示,这是全省残疾人职业技能培训向数字化、高端 化转型的重要探索,学员们 ...
时薪上千,大模型公司抢985文科生给AI当老师
吴晓波频道· 2025-12-09 00:29
点击上图▲立即购票 惊艳、尖叫和思考,都会出现在这场AI大秀上!12月28日在厦门,吴老师将通过一 场名为"AI闪耀中国"的科技人文秀,把他在2025年所看到的最新的人工智能成果用 视觉化的创意展现给大家。本次活动由吴晓波频道、优酷、七维动力、东南卫视联合 " 这是个需要高质量人文社科人才的岗位,因为只有最善于思考人与世界关系的人类,才能教会 AI 怎么更好的做一个人。 " 文 /巴九灵(微信公众号:吴晓波频道) 这篇文章开始之前,先邀请大家猜猜下面这份招聘要求对应的是什么岗位。 主办,自11月27日起,我们推出"半价票限量秒杀"活动,每日10张,先到先得, 【点击此处,立即购票】 吧 揭晓答案:这份看起来要求不低的工作,招聘的是AI数据标注员。在BOSS直聘上,这个岗位月薪最高接近两万元;部分岗位直接注明"重点大学 本硕博优先"。 通俗地说,数据标注员就是AI的老师,负责对文本、图像、音频等原始数据进行分类、标记或注释,从而教会机器识别、理解并学习人类世界的 逻辑和知识。 2020年起,"人工智能训练师"正式被纳入国家职业分类目录,"数据标注员"是其中的重要工种之一。据国家数据局,截至今年9月底,我国7个数 ...
探索跨境“来数加工”,东莞竞逐高端数据标注新赛道
2 1 Shi Ji Jing Ji Bao Dao· 2025-12-05 06:27
Core Insights - The establishment of the Dongguan Data Annotation Industrial Park marks a significant step in enhancing the data annotation industry, which is crucial for AI model training and applications in advanced fields like autonomous driving [1][2] - Dongguan is positioning itself as a hub for high-end data annotation, leveraging its industrial strengths and aiming to attract over 50 data companies and create more than 30 high-quality datasets within three years [2][6] - The data annotation industry is evolving from labor-intensive processes to high-tech, knowledge-intensive applications, with a growing demand for skilled data annotators [3][4] Industry Overview - Data annotation is essential for AI systems, with data, algorithms, and computing power being the three core elements [1] - The industry is transitioning from simple manual annotation to complex, high-value applications, particularly in industrial manufacturing, which is currently a national shortfall [2][4] - The demand for high-quality, specialized data annotation is increasing, especially with the rise of large AI models and the need for precise, efficient data processing [4][5] Regional Development - Dongguan is actively developing its AI application pilot base and data industry cluster, focusing on high-quality data annotation to extract value from vast industrial data [1][6] - The Dongguan Data Annotation Industrial Park is supported by significant investments and partnerships with major companies like Baidu and China Telecom, aiming to create a comprehensive data annotation ecosystem [6][8] - The region benefits from a rich talent pool, with approximately 176,500 university students and over 20,000 graduates in AI and big data fields annually [7] Strategic Initiatives - The park aims to provide substantial support to enterprises through rent reductions and talent subsidies, fostering collaboration with local industries [5][6] - The establishment of specialized data annotation bases by Baidu and China Telecom is set to enhance the capabilities of local companies in high-end data annotation [6][8] - The introduction of advanced technologies and platforms for data annotation is expected to create a differentiated, intelligent, and high-level data annotation capacity in Dongguan [8]
山西大同:书写推动高质量发展的“三张答卷”
Ren Min Ri Bao· 2025-11-23 22:52
Group 1: Energy Transformation - Datong is undergoing a significant transformation from a traditional energy reliance to a diversified, clean, and efficient energy system, maintaining an annual coal production of over 150 million tons [2] - The city has established 14 intelligent coal mines, with advanced production capacity exceeding 85%, and has completed ultra-low emission upgrades for 9 coal-fired power plants [2][3] - Renewable energy and new energy installed capacity in Datong has surpassed 10 million kilowatts, accounting for over 56% of the total, positioning it as a leader in Shanxi Province [3] Group 2: Digital Infrastructure Development - Datong is rapidly developing its computing power industry, with a total investment exceeding 70 billion yuan in the computing power ecosystem, and operational servers reaching 745,000 [5] - The city has established a national-level data labeling base, achieving a labeling accuracy of 100% and generating a data labeling industry output value of 750 million yuan [5][6] - The electricity consumption of computing centers in Datong has reached 4.38 billion kilowatt-hours from January to September this year, with an expected annual consumption exceeding 6 billion kilowatt-hours [5] Group 3: Cultural Revitalization - Datong is blending historical preservation with modern innovation, attracting 1.52 million visitors during the National Day and Mid-Autumn Festival, showcasing its cultural heritage [7][8] - The city has hosted various cultural activities, including drone performances and symphonic concerts, enhancing its cultural atmosphere and engaging tourists [8] - Datong's rich historical background, including the Yungang Grottoes and Hanging Monastery, contributes to its unique cultural identity and confidence [8][9]
东北三省共建数据标注产业集群
Liao Ning Ri Bao· 2025-11-23 00:48
Core Insights - The Northeast region of China aims to establish a globally competitive data annotation industry cluster through collaboration among Liaoning, Jilin, and Heilongjiang provinces, focusing on innovation and high-quality development [1][2] - Data annotation is identified as a critical process in AI training, transforming raw data into usable formats, likened to refining crude oil into gasoline [1] Group 1: Industry Development - A high-quality development seminar for the data annotation industry was held in Shenyang, emphasizing the need for a collective brand for "Northeast Data Annotation" [1] - The region plans to create a data annotation solution consortium to enhance collaborative development and innovation within the industry [2] Group 2: Current Industry Status - Shenyang has annotated over 8323 TB of data, developed 134 high-quality datasets, and engaged in 76 large model applications, contributing to national and industry standards [1] - The data annotation industry in Shenyang has seen the establishment of 65 companies, employing over 11,800 people, with an industry scale of approximately 2.59 billion yuan [1] Group 3: Future Plans - The Northeast region will adopt a professional, intelligent, and international approach to build a distinctive and complementary regional industrial cluster [2] - The data annotation solution consortium will integrate resources to provide comprehensive, high-value solutions for national clients, addressing various industry upgrade needs [2]
建设高质量数据集,江苏势在必行、必须先行
Xin Hua Ri Bao· 2025-11-06 08:16
Core Insights - The "2025 National High-Quality Data Set and Data Annotation Industry Supply and Demand Matching Conference" held in Nanjing successfully attracted over 500 companies and resulted in more than 90 collaborations with a transaction value exceeding 900 million yuan [1] - Jiangsu province aims to leverage its rich data resources to enhance the construction of high-quality data sets, which is essential for seizing opportunities in artificial intelligence development [1][2] - The definition of high-quality data sets varies across industries, but they must meet the training needs of AI large models [2] Industry Overview - Jiangsu has established 321 high-quality data sets across key sectors such as healthcare, transportation, industry, energy, and cultural tourism, with a total data scale exceeding 93PB [1] - The province has implemented a "1+N" policy framework to optimize the environment for artificial intelligence development, focusing on collaboration between supply and demand enterprises [2][7] Challenges in Data Annotation - Data annotation is crucial for AI development, requiring specialized knowledge and skills, particularly in complex fields like medical data [3][4] - The industry faces challenges such as insufficient data supply and a lack of skilled data annotators, which hinder the progress of large models in niche areas [4] Cost Considerations - The high costs associated with data storage and processing are significant challenges for companies, with many high-quality data sets being discarded due to storage expenses [5][6] - Companies are exploring solutions like establishing cold storage centers in less developed regions to reduce costs associated with data storage [5] Financial Support and Standards - The data industry is knowledge and capital-intensive, with a significant portion of costs tied to acquiring raw data [6] - Financial institutions are encouraged to provide support for data collection and annotation, potentially through innovative financing models [6] - The establishment of standards for high-quality data sets is underway, with guidelines and quality assessment protocols being developed to address current challenges [6]
3位00后,估值700亿
3 6 Ke· 2025-10-28 12:09
Core Insights - Mercor, an AI recruitment startup, has raised $250 million in new funding, achieving a valuation of $10 billion, which is five times its previous valuation of $2 billion earlier this year [1][3] - Founded in 2023 by three college dropouts, Mercor has developed a large professional talent network and has seen its annual recurring revenue grow from $1 to $500 million in just 17 months [1][3] Company Overview - Mercor specializes in AI-driven recruitment, utilizing AI to screen resumes and match candidates to job positions quickly [3][5] - The company has expanded its services to include data annotation and large model evaluation, leveraging its extensive network of 30,000 experts [3][9] - The startup's revenue has quadrupled since the turmoil at Scale AI, a competitor, leading to an influx of Scale's former employees and clients [13][14] Business Model and Revenue - Mercor's annual recurring revenue reached $70 million by February, driven by its new business in large model evaluation [3][9] - The company manages a network of experts who can earn significant daily wages, with total earnings exceeding $1.5 million daily [9][10] - The new funding will be allocated to expanding the talent network, enhancing the matching system, and improving delivery speed [3][4] Competitive Landscape - Mercor's main competitor, Scale AI, faced challenges after being acquired by Meta, which led to concerns about data neutrality and client trust [13][14] - The controversy surrounding Scale AI has inadvertently benefited Mercor, resulting in a significant increase in its revenue and client base [14][15] Future Prospects - Mercor's AI-driven recruitment model has positioned it as a key player in the large model evaluation space, filling a critical gap in the industry [15][16] - The company aims to continue leveraging its talent network to support the growing demand for high-quality data and expert feedback in AI model development [16]
泰安打造全流程数据标注生态圈
Da Zhong Ri Bao· 2025-10-27 03:26
Group 1 - The article highlights the emergence of Xiaohongshu as a platform for young people to explore diverse interests, supported by precise data annotation and review processes [1] - Shandong Feilixin Digital Technology Co., Ltd. is collaborating with major companies like Tencent and Alibaba to enhance content operations through its technological advantages [1] - Data annotation involves labeling raw data such as images, text, audio, and video, enabling machine learning models to understand and learn from the data [1] Group 2 - The data annotation industry in Tai'an has developed a solid foundation, characterized by a leading enterprise, two major annotation clusters, and a complete industrial chain [2] - Taiying Technology is identified as a leader in the digital middle and back-office operation service industry, rapidly expanding its presence in the data annotation market [2] - Tai'an has gathered over 30 data annotation companies, forming a comprehensive industrial chain from upstream data collection and governance to downstream AI training and application [2]