大语言模型
Search documents
英伟达被起诉,用盗版训练大模型成行业潜规则?
Xin Lang Cai Jing· 2026-02-08 09:51
Core Viewpoint - Nvidia is facing a collective lawsuit regarding copyright infringement related to the use of data from "shadow libraries" for training its AI models, specifically the NeMo Megatron framework, which allegedly includes copyrighted works without permission [3][18]. Group 1: Lawsuit Details - The lawsuit was filed by five authors who claim Nvidia used a dataset from illegal "shadow libraries" to develop its next-generation language model [3][18]. - Nvidia submitted a motion on January 31, 2026, arguing that the plaintiffs failed to provide sufficient evidence of infringement and asserting that its actions fall under "fair use" [4][18]. - A hearing is scheduled for April 2, 2026, to review Nvidia's motion [4]. Group 2: Competitive Pressure - Internal records indicate that Nvidia faced competitive pressure from OpenAI, prompting it to acquire millions of pirated books from shadow libraries to showcase its technology at the 2023 developer conference [19][20]. - The lawsuit highlights that Nvidia provided tools and scripts to clients to facilitate the downloading of pirated datasets [19]. Group 3: Data Sources - Nvidia's NeMo Megatron models were reportedly trained on The Pile dataset, which includes a subset called Books3 sourced from the shadow library Bibliotik, containing approximately 190,000 books [21][22]. - Nvidia is accused of directly collaborating with the largest shadow library, Anna's Archive, to access millions of pirated books, totaling around 500TB of data [24][22]. Group 4: Industry Context - The rise of AI has led to increased litigation over training data copyright issues, with other companies like OpenAI, Anthropic, and Meta also facing similar lawsuits [20][28]. - The competitive landscape has intensified, with Nvidia's need for high-quality training data driving it to engage with shadow libraries, which offer easier access to vast amounts of data [21][27]. Group 5: Legal Precedents - Previous cases have seen significant settlements, such as Anthropic agreeing to pay at least $1.5 billion to settle a copyright infringement lawsuit, potentially setting a record for copyright damages [20][28]. - Courts have ruled on the fair use of copyrighted works for AI training, with some cases determining that using such works can be considered fair use under certain conditions [29][30].
苏炜杰获2026「统计学诺奖」考普斯奖,14年来首位华人得主
机器之心· 2026-02-07 04:09
机器之心编辑部 在时隔 14 年之后,有着「统计学诺贝尔奖」之称的考普斯奖(COPSS Presidents' Award),又一次迎来了华人得主。 2026 年考普斯奖颁给了「北大校友、现宾夕法尼亚大学副教授苏炜杰」。 奖项委员会给他的评语是 ,「为大语言模型的多项应用建立了严格的统计基础;在隐私保护数据分析方面取得突破性进展,并成功应用于 2020 年美国人口普查; 设计了 AI 顶级会议的同行评审机制,并于 ICML 2026 正式落地;在凸优化领域开展了奠基性研究;以及在深度学习的数学理论与高维统计推断方面作出了广泛而 深远的贡献。」 作为国际统计学和数据科学领域的最高荣誉,考普斯奖的地位相当于数学界的菲尔兹奖,每年只颁发给一位年龄在 40 岁以下的统计学家 。该奖项由五大顶级统计 学会(国际数理统计学会 IMS、美国统计学会 ASA、加拿大统计学会 SSC 及美国东西部生物统计学会 ENAR 与 WNAR)共同评选,旨在表彰对统计学理论、方 法或应用做出杰出贡献的学者。 在历史上,考普斯奖的获得者几乎都是后来定义了该领域的宗师级人物。 统计学是华人的优势学科,曾有多位华人获得考普斯奖,包括近期回国的 ...
蔡崇信复盘阿里AI:“早”做,不等于领先
3 6 Ke· 2026-02-07 02:22
Core Insights - Alibaba's Chairman, Joe Tsai, acknowledged that the company started working on Transformer models in 2019 but failed to allocate sufficient resources for their development until the launch of Tongyi Qianwen in 2023, marking a significant entry into the AI race [1][5][24] Group 1: Adoption - The first key point emphasized by Tsai is that AI must be used in practical scenarios to generate real value, not just developed as models [6][7] - The Tongyi App is crucial in Alibaba's AI strategy, serving not only as a user interface but also as a test for the AI's capabilities in real-world applications [8][11] - The unique characteristics of the Chinese market, such as the lower acceptance of enterprise software payment models compared to the U.S., necessitate alternative paths for AI adoption, making the Tongyi App a vital attempt to ensure real usage of models [9][10] Group 2: Scale - Tsai pointed out that AI investment is shifting focus from training to inference, with major tech companies increasing their capital expenditures from $60-80 billion to $120-150 billion annually [12][12] - Inference is identified as the main battleground for AI costs, as it is a daily requirement for users and businesses, unlike training which occurs less frequently [13][14] - The ability to handle high concurrency and maintain stability under load is crucial for scaling AI models, with Alibaba opting to deploy models on its own cloud infrastructure to control performance and throughput [15][16] Group 3: Open Source - Tsai advocates for open source as a practical choice rather than an idealistic one, driven by the commercial landscape and market conditions in China [17][18] - The primary value of open source is not cost but sovereignty, allowing companies and developers to have full control over their models [18][20] - Alibaba's strategy involves making Tongyi Qianwen open source while encouraging users to utilize Alibaba Cloud for training and inference, creating a commercial loop where infrastructure usage generates revenue [22][23]
代表建议重塑上海高考选拔模式,不让理工人才被“刷”在门外
Di Yi Cai Jing· 2026-02-07 01:52
Core Viewpoint - The article emphasizes the urgent need for reform in the high school academic evaluation system in Shanghai to enhance the focus on science education, particularly physics, in light of the increasing demand for STEM talent amid the ongoing Fourth Industrial Revolution [1][2]. Group 1: Current Education System Challenges - The current "3+3" model of the Shanghai high school academic level examination limits the differentiation of science subjects, particularly physics, which is crucial for modern engineering and technology [2][3]. - The implementation of a grading system has compressed the distinction in natural science subjects, affecting the admission ranking of top students compared to core subjects like Chinese, Mathematics, and English [2][3]. Group 2: Recommendations for Reform - The proposal includes establishing "Natural Science Foundation" as a core subject, making physics a mandatory subject with equal or near-equal weight to core subjects [4][5]. - It suggests reforming the evaluation and grading system to include enhanced grading methods, such as providing raw score references for key subjects during admissions [4][5]. - The examination design should be optimized to include more comprehensive application questions and experimental design tasks, allowing for a deeper assessment of critical thinking and scientific inquiry skills [5].
机构研究系列:03 江苏银行——从区域银行到系统重要性银行的跨越之路
Xin Lang Cai Jing· 2026-02-06 23:47
Core Viewpoint - Jiangsu Bank, as a systemically important bank in China, is undergoing significant strategic evolution to adapt to the changing banking landscape, focusing on high-quality development while maintaining its scale advantage [2][4][49]. Group 1: Background and Purpose - Jiangsu Bank is the largest legal person bank in Jiangsu Province and plays a crucial role in the development of urban commercial banks and the Chinese banking industry [2]. - The research aims to analyze Jiangsu Bank's strategic evolution since its establishment in 2007, focusing on its current strategic layout and future development direction [2]. Group 2: Strategic Evolution Process - **Foundation and Scale Breakthrough Period (2007-2015)**: Jiangsu Bank was established through the merger of ten urban commercial banks, focusing on resource integration and local market penetration, achieving total assets of over 1 trillion yuan by 2014 [3]. - **Structural Optimization and Innovation Breakthrough Period (2016-2020)**: The bank went public in 2016, shifting its focus from scale expansion to quality and efficiency, establishing a "four modernization" development vision [4][5]. - **High-Quality Development and Strategic Deepening Period (2021-Present)**: Jiangsu Bank has set five strategic goals for high-quality development, emphasizing value creation, customer service, and political integrity [6][7]. Group 3: Current Strategic Statements - **Top-Level Strategic Positioning**: Jiangsu Bank's mission is to enhance people's quality of life through innovative financial services, with a vision of becoming a leading bank characterized by intelligence, specialization, internationalization, and comprehensiveness [8]. - **Five Strategic Goals**: The bank aims to be the most valuable bank, a service leader, an intelligent innovator, an employee-satisfied bank, and a politically robust bank, with each goal supporting the others [9][10]. Group 4: Business Development Strategies - **Corporate Business**: Jiangsu Bank focuses on strengthening corporate business, particularly in manufacturing and infrastructure, achieving a corporate deposit balance of 14,197 billion yuan, a 22.20% increase year-on-year [11]. - **Retail Business**: The bank aims to expand its retail business, with retail AUM exceeding 1.59 trillion yuan, marking a historic high [12]. - **Financial Market Business**: Jiangsu Bank is enhancing its financial market capabilities, with financial investment assets reaching 18,833 billion yuan, a 23.38% increase [13]. - **Digital Financial Development**: The bank is accelerating its digital transformation, with significant advancements in AI applications and digital financial services [14]. - **Regional Layout**: Jiangsu Bank maintains a strong presence in Jiangsu and extends its services to major economic regions [15][16]. - **ESG Strategy**: The bank integrates ESG principles into its operations, actively participating in sustainable finance initiatives [17]. Group 5: Comparative Analysis - **Comparison with Similar Banks**: Jiangsu Bank's asset scale is 3.95 trillion yuan, ranking it among the top urban commercial banks, with a focus on balanced development across various financial sectors [29][30]. - **Strengths and Weaknesses**: Jiangsu Bank has notable advantages in asset quality and regional presence but faces challenges in retail business proportion and internationalization [37][41][42]. Group 6: Future Strategic Directions - **"15th Five-Year" Planning**: Jiangsu Bank is preparing for its next strategic phase, focusing on clearer positioning and practical development goals, with expected revenue growth rates of 6.9% to 7.6% from 2025 to 2027 [44][45]. - **Key Development Areas**: The bank aims to enhance its technology finance services, green finance initiatives, and wealth management capabilities, with projected growth in these areas [46].
互联网大厂抢人,年薪最高128万
21世纪经济报道· 2026-02-06 14:52
Core Viewpoint - The article discusses the intense competition among major internet companies, particularly Tencent, in attracting top AI talent through high salaries and innovative scholarship programs, highlighting the industry's talent scarcity and the strategic investments being made in AI research and development [1][4]. Group 1: Talent Acquisition Strategies - Tencent is actively recruiting AI talent with high salaries for various positions, such as over 750,000 yuan for user operation roles and nearly 1,000,000 yuan for AI application engineers [1]. - The "Qingyun Plan" is Tencent's initiative aimed at attracting top technical students globally, similar to ByteDance's Top Seed talent program [1]. - The "Qingyun Scholarship" offers significant financial incentives, including 500,000 yuan per recipient, to support students in AI and computer science fields [2]. Group 2: Investment in Research and Development - Tencent's R&D expenditure reached a record high of 22.82 billion yuan in Q3 2025, with a total of 61.983 billion yuan spent in the first three quarters of 2025 [4]. - The company emphasizes the importance of computational resources for top PhD students, providing cloud heterogeneous computing resources as part of the scholarship [4]. Group 3: Recruitment of Established Talent - Tencent is also accelerating the recruitment of established AI experts, as evidenced by the hiring of prominent figures like Pang Tianyu and Yao Shunyu, who have significant academic and industry experience [5]. - The establishment of new departments within Tencent, such as AI Infra and AI Data, aims to enhance its capabilities in large model research and development [5]. Group 4: Academic Collaboration and Knowledge Sharing - Tencent launched its technical blog to share research findings, marking a step towards increasing its academic influence and transparency in AI technology [6].
金智维拟港股上市 中国证监会要求补充说明股权变动等事项
Zhi Tong Cai Jing· 2026-02-06 12:42
Core Viewpoint - The China Securities Regulatory Commission (CSRC) has issued supplementary material requirements for four companies, including Jinzhihui, which is preparing for an IPO on the Hong Kong Stock Exchange, with specific requests for clarification on equity changes and shareholder situations [1][2]. Group 1: Regulatory Requirements - Jinzhihui is required to provide detailed explanations regarding its equity changes, including historical capital increases, share transfer prices, and compliance with capital contribution obligations [1]. - The company must clarify whether there have been any instances of shareholding representation and the progress of incentive grant record changes as of December 2025 [1]. - Jinzhihui is also asked to confirm the permanent residency status of its actual controller and provide updates on changes in partnership and shareholder identification [1][2]. Group 2: Business Model and Market Position - Jinzhihui specializes in providing AI digital employee solutions and enterprise-level intelligent agent solutions, aiding companies in their digital transformation through proprietary AI technologies [2]. - The company has achieved a leading market position in the AI digital employee solutions sector in China, holding the top market share from 2022 to 2024, particularly excelling in the financial services industry [3]. - As of June 30, 2025, Jinzhihui has deployed over 1.8 million AI digital employees across various sectors, serving more than 1,300 high-quality clients [3].
新股消息 | 金智维拟港股上市 中国证监会要求补充说明股权变动等事项
智通财经网· 2026-02-06 12:34
Group 1 - The China Securities Regulatory Commission (CSRC) has issued supplementary material requirements for four companies, including Jinzhihui, which is preparing for a listing on the Hong Kong Stock Exchange [1] - Jinzhihui is required to clarify its equity changes, shareholder status, and provide legal opinions from lawyers regarding its capital contributions and any potential issues [1][2] - The company specializes in providing AI digital employee solutions and enterprise-level intelligent agent solutions, aiding businesses in their digital transformation [2][3] Group 2 - Jinzhihui has maintained a leading market position in the AI digital employee solutions market in China from 2022 to 2024, holding the top market share for three consecutive years [3] - The company has deployed over 1.8 million AI digital employees across various industries, serving more than 1,300 high-quality clients as of June 30, 2025 [3]
Nature:首个能写综述论文的开源AI模型来了,大幅减少科研“幻觉”,堪比人类专家
生物世界· 2026-02-06 04:26
Core Viewpoint - The article discusses the development of OpenScholar, an AI assistant designed specifically for researchers to synthesize scientific literature accurately and efficiently, addressing the issue of "hallucination" in existing large language models [2][5][21]. Group 1: OpenScholar Overview - OpenScholar is a retrieval-augmented language model that can intelligently retrieve relevant paragraphs from 45 million open-access papers and generate comprehensive review papers with accurate citations [5][7]. - The model's citation accuracy is comparable to that of human experts and surpasses mainstream models like GPT-4o in multiple tests [5][11]. Group 2: Functionality and Workflow - OpenScholar operates through a three-step process: retrieval of relevant content, generation of answers with citations, and self-feedback for iterative improvement [7][9]. - The system is built on a dedicated data store (OpenScholar DataStore) that allows for transparent and reproducible research [7][21]. Group 3: Evaluation and Performance - The ScholarQABench benchmark was developed to assess AI systems' reliability in synthesizing scientific literature, featuring nearly 3,000 expert-written questions across various fields [12][13]. - OpenScholar demonstrated impressive results in the benchmark, outperforming GPT-4o in citation accuracy and overall usefulness, with human experts favoring OpenScholar's responses over those of GPT-4o [16][18][19]. Group 4: Implications for Research - The introduction of OpenScholar signifies a significant advancement in the application of AI in scientific research, potentially transforming literature reviews from a burdensome task into an efficient exploration process [21][23]. - Future developments may enhance OpenScholar's capabilities, making it a true collaborator for researchers, allowing them to focus more on innovation rather than information filtering [23].
RGF薪酬观察2026:中国大陆篇(英文版)
Sou Hu Cai Jing· 2026-02-06 03:17
今天分享的是:RGF薪酬观察2026:中国大陆篇(英文版) 报告共计:45页 RGF 2026中国大陆薪酬观察报告核心总结 《RGF 2026中国大陆薪酬观察报告》基于2025年1月至2026年1月期间50余万名候选人数据,聚焦五大行业薪酬趋势、就业市场动态及招聘核心挑战,为企 业薪酬策略制定与职场人士职业规划提供关键参考。报告中所指年度基本薪资均不含津贴、股票、奖金等额外福利。 2025年就业市场呈现"前三季度调整、第四季度复苏"的态势。招聘需求在前三季度温和收缩后,第四季度同比激增31%,彰显企业对2026年的战略信心;人 才供给全年保持稳健增长,第三季度增幅达32%,反映求职者信心与市场结构调整并行;面试接受率自年中起持续改善,第三、四季度分别同比上升14%和 7%,体现企业招聘意愿的强化。 薪酬趋势方面,2026年市场平均薪资增幅较2025年略有上升,折射经济与就业市场的稳定性。高需求岗位薪资涨幅显著,医疗行业销售、消费品行业新零售 相关岗位外部招聘薪资涨幅或达20%,自动驾驶领域算法专家更是有望达到30%。薪酬相关挑战中,"新聘员工薪资高于现有员工"的内部薪资倒挂问题最为 突出,占比33%,其次是薪 ...