大语言模型
Search documents
大模型的2025:6个关键洞察
腾讯研究院· 2025-12-23 08:33
Core Insights - The article discusses a significant paradigm shift in the field of large language models (LLMs) in 2025, moving from "probabilistic imitation" to "logical reasoning" driven by the maturity of verifiable reward reinforcement learning (RLVR) [2][3] - The author emphasizes that the potential of LLMs has only been explored to less than 10%, indicating vast future development opportunities [3][25] Group 1: Technological Advancements - In 2025, RLVR emerged as the core new phase in training LLMs, allowing models to autonomously generate reasoning traces by training in environments with verifiable rewards [7][8] - The increase in model capabilities in 2025 was primarily due to the exploration and release of the "stock potential" of RLVR, rather than significant changes in model parameter sizes [8][9] - The introduction of the o1 model at the end of 2024 and the o3 model in early 2025 marked a qualitative leap in LLM capabilities [9] Group 2: Nature of Intelligence - The author argues that LLMs should be viewed as "summoned ghosts" rather than "evolving animals," highlighting a fundamental difference in their intelligence compared to biological entities [10][11] - The performance of LLMs exhibits a "sawtooth" characteristic, excelling in advanced fields while struggling with basic common knowledge [12][13] Group 3: New Applications and Interfaces - The emergence of Cursor represents a new application layer for LLMs, focusing on context engineering and optimizing prompt design for specific verticals [15] - The introduction of Claude Code (CC) demonstrated the core capabilities of LLM agents, operating locally on user devices and accessing private data [17][18] - The concept of "atmospheric programming" allows users to create powerful programs using natural language, democratizing programming skills [20][21] Group 4: Future Directions - The article suggests that the future of LLMs will involve a shift towards visual and interactive interfaces, moving beyond text-based interactions [24] - The potential for innovation in the LLM space remains vast, with many ideas yet to be explored, indicating a continuous evolution in the industry [25]
招商基金吴松凯:积极破解三大矛盾,推动财富管理可持续发展
Zhong Guo Jin Rong Xin Xi Wang· 2025-12-23 07:40
转自:新华财经 基于上述思考,吴松凯介绍了招商基金构建长期竞争力的具体实践。在顶层设计上,公司在投顾业务创 立之初便确立了独立的考核体系,着重关注客户盈利体验、复购率等,引导团队真正站在客户视角。在 客户服务层面,招商基金已构建起一套多层次、高频更新的客户画像体系,为开展精准化与个性化服务 奠定了数据基础。在科技应用方面,公司较早布局智能投顾,认为科技不仅是提升运营效率、降低成本 的工具,更是实现高质量、个性化服务的有效手段。 吴松凯特别谈到,生成式AI等技术的突破是近年来科技赋能财富管理领域显著的变量,有望从根本上 改变行业的服务模式。他分析说,过去专业而有温度的深度服务受限于高水平理财师的个人服务边界与 成本,大多仅能覆盖高净值客户。随着大语言模型等技术的发展,未来高质量的个性化投教与陪伴将变 得更具可得性,能够普惠至更广泛的客户群体。 吴松凯认为,对个体机构而言,主动推动解决行业共性矛盾的过程,本身就是构建自身长期竞争力的过 程。面对费率下行与模式重构的挑战,坚定长期主义、深度践行客户立场、积极将前沿科技转化为服务 能力,将是财富管理机构实现可持续发展、赢得未来的坚实路径。 在公募基金费率改革持续深化、 ...
中国工商银行刘承岩:2026年,企业进入大规模智能产品化新阶段
Xin Lang Cai Jing· 2025-12-23 06:50
Core Insights - The 22nd China International Financial Forum was held in Shanghai on December 19-20, focusing on building an intelligent financial ecosystem in the digital economy era [1][3] - Liu Chengyan, a senior fintech expert from the Industrial and Commercial Bank of China, emphasized that 2025 will be the year of intelligent agents, marking a new phase in large-scale intelligent productization with the release of major models like GPT-5 and Qianwen-3 [1][3] Group 1: AI and Intelligent Agents - Companies need to advance their AI+ initiatives by transitioning IT architecture from cloud-native to intelligent-native, integrating computing power, data, algorithms, strategies, and applications into a cohesive framework [1][3] - The bank has established an intelligent agent platform accessible to all employees, promoting widespread AI innovation across the organization [1][3] Group 2: Challenges in Implementation - Six key challenges must be addressed for the high-quality application of intelligent agents by 2026: 1. **Computing Power**: Focus on heterogeneous computing power integration, training and inference unification, and resource pooling [2][4] 2. **Algorithms**: Develop enterprise-specific models through the integration of large and small models, creating a model matrix and baseline for iterative evolution [2][4] 3. **Data Capabilities**: Build knowledge engineering, context engineering, and prompt engineering capabilities, while establishing a governance system for enterprise-level knowledge sets [2][4] 4. **Intelligent Agents**: The platform must possess memory capabilities and adhere to methodologies for constructing native intelligent agents [2][4] 5. **Security**: An integrated security system covering model, data, and network security is crucial, especially for customer-facing applications [2][4] 6. **Talent Development**: Accelerate the training of new types of talent such as computing power engineers, knowledge engineers, algorithm engineers, intelligent agent engineers, and prompt engineers [2][4]
三季度收入超5000万美元、70%来自海外,中国AI独角兽拟港股上市
Sou Hu Cai Jing· 2025-12-23 04:21
Core Insights - MiniMax, a domestic AI model unicorn, has received approval from the China Securities Regulatory Commission and passed the Hong Kong Stock Exchange hearing, planning to go public in January 2026 [2] - Founded in November 2021, MiniMax focuses on general artificial intelligence (AGI) and has differentiated itself from competitors by pursuing a "model + product" dual approach [2] - The company has raised significant funding, including nearly $390 million in a Series C round, achieving a post-money valuation exceeding $4 billion [2] - MiniMax's revenue for the first nine months of 2025 is projected to reach $53.44 million, showing substantial growth from previous years [3] Financial Performance - Revenue for 2023, 2024, and the first nine months of 2025 is reported as $3.46 million, $30.52 million, and $53.44 million respectively [3] - The company has incurred significant losses, with net losses of $73.73 million in 2022, $269.25 million in 2023, and projected losses of $512.01 million in 2025 [4] - Adjusted net losses from 2022 to the first nine months of 2025 are $12.15 million, $89.07 million, $244.24 million, and $186.28 million respectively [4] Product and Market Strategy - MiniMax operates with a dual focus on model development and product offerings, including large language models and video generation models [5] - The company has launched several products, with over 71% of its revenue coming from C-end subscriptions in the first nine months of 2025 [5] - MiniMax's overseas revenue accounts for over 70% of total revenue, with North America, Southeast Asia, and Europe as key markets [5] User Engagement and Growth - MiniMax's AI products have served over 212 million individual users and more than 100,000 enterprise clients across over 200 countries [18] - The average monthly active users increased from 3.15 million in 2023 to 27.64 million in the first nine months of 2025 [18] - The number of paying users grew from 119,800 in 2023 to over 1.77 million by the first nine months of 2025 [18] Competitive Landscape - MiniMax's Talkie application has shown significant growth, with revenue contributions increasing from 21.9% in 2023 to 63.7% in 2024 [9] - The company faces competition in the AI companion space, necessitating continuous product iteration and compliance with regulatory standards [11] - MiniMax's Hailuo AI has also emerged as a strong revenue contributor, with $17.46 million in revenue in the first nine months of 2025 [12] Investment and Leadership - Major investors include Alibaba, Tencent, and MiHoYo, with Alibaba holding a 15.04% stake [18] - Key leadership includes non-executive directors from Alibaba and MiHoYo, indicating strong strategic oversight [19]
深扒特斯拉ICCV的分享,我们找到了几个业内可能的解决方案......
自动驾驶之心· 2025-12-23 00:53
Core Insights - The article discusses Tesla's end-to-end autonomous driving solution, highlighting the challenges and innovative solutions developed to address them [3] Group 1: Challenges and Solutions - Challenge 1: Curse of dimensionality, requiring breakthroughs in both input and output layers to enhance computational efficiency and decision accuracy [4] - Solution: UniLION, a unified autonomous driving framework based on linear group RNN, efficiently processes multi-modal data and eliminates the need for intermediate perception and prediction results [4][7] - UniLION's key features include a unified 3D backbone network and the ability to handle various tasks simultaneously, achieving significant performance metrics such as 75.4% NDS and 73.2% mAP in detection tasks [11] Group 2: Interpretability and Safety - Challenge 2: The need for interpretability and safety guarantees in autonomous driving systems, which traditional models struggle to provide [12] - Solution: DrivePI, a unified spatial-aware 4D multi-modal large language model (MLLM) framework that integrates visual and language inputs to enhance system interpretability and safety [13][14] - DrivePI demonstrates superior performance in 3D occupancy prediction and trajectory planning, significantly reducing collision rates compared to existing models [13][17] Group 3: Evaluation - Challenge 3: The complexity of evaluating autonomous driving systems due to the unpredictability of human driving behavior and diverse interaction scenarios [18] - Solution: GenieDrive, a world model framework that uses 4D occupancy representation to generate physically consistent multi-view video sequences, enhancing the evaluation environment for autonomous systems [21][22] - GenieDrive achieves a 7.2% improvement in mIoU for 4D occupancy prediction and reduces FVD metrics by 20.7%, establishing new performance benchmarks [21][27] Group 4: Integrated Ecosystem - The three innovations—UniLION, DrivePI, and GenieDrive—form a synergistic ecosystem that enhances perception, decision-making, and evaluation in autonomous driving [30][31] - This integrated approach addresses key challenges in the industry, paving the way for safer, more reliable, and efficient autonomous driving systems, ultimately accelerating the transition to L4/L5 level autonomy [31]
腾讯从OpenAI、字节抢人才,加速AI破局
Tai Mei Ti A P P· 2025-12-22 09:49
近日,腾讯官宣人事任命和组织调整两条重要消息,在AI圈内引起广泛关注。 文 | 新质动能,作者 | 沛林,编辑 | 沐风 27岁的前OpenAI研究员姚顺雨(Vinces Yao)正式出任腾讯"CEO/总裁办公室"首席AI科学家,同时身兼 AI Infra部与大语言模型部的负责人,向总裁刘炽平和技术工程事业群总裁卢山双线汇报。 而姚顺雨,这位以研究"AI智能体"和提出"AI进入下半场"观点闻名的年轻人,会成为那个为腾讯找到破 局点的人吗? 技术明星的"中场"思考 事实上,姚顺雨在今年9月就已入职,这次只是正式公开这一消息。 有趣的是,3个月前,关于"腾讯上亿年薪聘请前OpenAI研究员"的传言已在行业内流传,腾讯官方出面 辟谣。如今看来,当时辟谣的焦点或许是薪酬数额,而人才落地的本身,早已悄然进行。 翻开姚顺雨的履历,是一个标准的"精英故事": 安徽理科探花、清华姚班、普林斯顿博士、师从GPT-1核心作者。他的学术轨迹,几乎与AI智能体研究 的关键进展同步。 在OpenAI期间,他深度参与的研究项目,旨在解决AI从"聊天"走向"做事"的范式转变:通过"思考—行 动—观察"的循环,让大模型学会使用工具;用"思维 ...
AI 语音输入法,正在偷偷挤走「键盘」
3 6 Ke· 2025-12-22 09:03
Core Insights - The article discusses the evolution of input methods, particularly the shift from traditional keyboard typing to voice input, highlighting the advantages of using AI-driven applications for voice-to-text conversion [3][5][18]. Group 1: Voice Input Technology - The emergence of AI applications has significantly improved voice-to-text functionality, making it more efficient and user-friendly compared to traditional input methods [5][18]. - Typeless is identified as a leading voice input tool that excels in understanding user intent and providing formatted outputs, thus enhancing the overall user experience [9][11][14]. Group 2: User Experience and Efficiency - Users report a marked increase in efficiency when using voice input, as it allows for more natural communication without the constraints of typing [23][26]. - The ability of Typeless to adapt its output based on the context of the application being used is a notable feature, allowing for a more tailored interaction [16][18]. Group 3: Market Dynamics and Concerns - There are concerns regarding the potential for larger companies to develop similar or superior voice input technologies, which could threaten the existence of third-party tools like Typeless [20][21]. - The competitive landscape is further complicated by the presence of free local models that may offer sufficient functionality, raising questions about the long-term value proposition of paid services like Typeless [21][19]. Group 4: Future of Input Methods - The article posits that the traditional keyboard may become less relevant as voice input technologies continue to evolve and gain acceptance, potentially leading to a paradigm shift in how users interact with devices [23][26]. - The integration of voice input capabilities at the operating system level could redefine user interactions, making voice the primary mode of communication with technology [29].
大模型的2025:6个关键洞察,来自OpenAI创始人、AI大神“AK”
3 6 Ke· 2025-12-22 04:22
Core Insights - The report by Andrej Karpathy highlights a significant paradigm shift in the field of large language models (LLMs) from "probabilistic imitation" to "logical reasoning" in 2025, driven by the maturation of Reinforcement Learning with Verifiable Rewards (RLVR) [1][2] - The industry is at a critical juncture, transitioning from "simulating human intelligence" to "pure machine intelligence," with a focus on how to make AI think efficiently rather than just competing on computational power [2][4] Group 1: Technological Advancements - RLVR has emerged as the core new phase in LLM training, allowing models to autonomously generate reasoning traces by training in environments with verifiable rewards [4][5] - The year 2025 saw a significant extension in the training cycles of LLMs, with the ability to optimize for longer reasoning traces and increased "thinking time," leading to qualitative leaps in model capabilities [5][6] Group 2: Nature of Intelligence - Karpathy argues that LLMs should be viewed as "summoned ghosts" rather than "evolving animals," indicating a fundamental difference in the nature of AI intelligence compared to biological intelligence [6][7] - The performance of LLMs exhibits a "zigzag" characteristic, excelling in specialized areas while struggling with basic common knowledge, reflecting their unique intelligence structure [8] Group 3: New Applications and Interfaces - The emergence of applications like Cursor signifies a new layer in LLM usage, focusing on context engineering and optimizing the orchestration of multiple LLM calls for specific vertical domains [9][10] - The introduction of Claude Code (CC) demonstrates the potential of LLM agents to operate locally on user devices, accessing private data and providing a new paradigm of AI interaction [10][11] Group 4: Programming and Development - The concept of "vibe coding" has gained traction, allowing individuals to create powerful programs using natural language, thus democratizing programming skills beyond trained professionals [11][12] - The shift towards atmosphere programming is expected to transform the software development ecosystem, making coding more accessible and flexible for everyday users [12][13] Group 5: Future Prospects - Despite the rapid advancements, the industry has only tapped into less than 10% of the potential of LLMs, indicating vast opportunities for future exploration and innovation [14][15] - The report emphasizes the need for foundational work to continue alongside the rapid development of LLM technologies, suggesting a sustained period of transformation ahead [14][15]
科技日报:不要向AI让出你的“语言权”
Ke Ji Ri Bao· 2025-12-22 00:25
AI可以极大程度地解放生产力,尤其在科研领域,它可以让繁冗的数据分析等工作大幅提速,让科研 人员将更多精力投入到创造性环节,进而提高科研效率。但对于AI向普通生活的入侵,尤其对其经由 语言系统"侵蚀大脑"的趋势,则需保持批判性审视的态度。 我们每个人都应持有一份清醒和自觉,和AI保持一定距离,始终把它圈定在"工具角色"的范围之内。我 们可以善用它在检索和整合上的优长,甚至将其作为参考启发、润色校正的助手,但不能逾越"雷池", 让其直接代笔成为"脑替"。 未来,AI势必会接手人类更多的琐碎事务,成为工作、生活的参与者,但不应成为"主导者"。具体到语 言应用上,我们不应向AI交出自己的"语言权"。让技术的归技术、文化的归文化,或许才是我们的正确 选择。 (文章来源:科技日报) AI的普及是迅速的,"开源"之后,AI生成的文本大量流入社会生活。无论是常用文书,还是社交平台、 网络电商的文本文案,文风都为之一变。 这是一种什么样的语言呢?人们一定很难忘记初见它时的惊讶:结构严谨,语法完美,逻辑流畅,辞采 斐然,术语翻飞,比喻排比迭出……但这种没有瑕疵的语言慢慢传递出一种冰冷的"工业味":模式化、 套路化,如同一种新型 ...
壁仞科技拟全球发售2.48亿股 引入启明创投、南方基金等基石投资者
Zhi Tong Cai Jing· 2025-12-21 23:23
Core Viewpoint - The company, Wallen Technology (06082), is set to launch an IPO from December 22 to December 29, 2025, aiming to issue 248 million H-shares, with a price range of HKD 17.00 to HKD 19.60 per share, and plans to start trading on January 2, 2026 [1]. Group 1: Company Overview - Wallen Technology develops General-Purpose Graphics Processing Unit (GPGPU) chips and intelligent computing solutions based on GPGPU technology, providing essential computing power for artificial intelligence (AI) [1]. - The company's solutions integrate self-developed GPGPU-based hardware and proprietary BIRENSUPA software platform, supporting AI model training and inference across a wide range of applications from cloud to edge [1]. - Wallen Technology's GPGPU-based solutions demonstrate strong performance and efficiency in pre-training, post-training, and inference of large language models (LLMs), giving the company a competitive edge in the domestic market [1]. Group 2: Market Demand and Financials - The rapid development of AI, particularly LLMs and generative AI, has led to an increasing demand for computing solutions among enterprises to meet their surging needs for computing power and AI applications [2]. - The company has developed a specialized technology product, an intelligent computing solution consisting of a hardware system based on its GPGPU architecture and the BIRENSUPA computing software platform, which can be delivered in large-scale intelligent computing clusters [2]. - In 2023, the company's intelligent computing solutions began generating revenue, with 14 and 12 clients contributing RMB 336.8 million and RMB 58.9 million in revenue for the fiscal years ending December 31, 2024, and June 30, 2025, respectively [2]. Group 3: IPO and Use of Proceeds - Assuming an offering price of HKD 18.30 per share and no exercise of the over-allotment option, the net proceeds from the global offering are estimated to be approximately HKD 4.3506 billion [2]. - Approximately 85% of the proceeds will be allocated for future research and development of the company's intelligent computing solutions, 5% for commercialization, and 10% for working capital and general corporate purposes [2]. Group 4: Cornerstone Investors - The company has entered into cornerstone investment agreements, with cornerstone investors agreeing to subscribe for shares amounting to USD 372.5 million under certain conditions [3]. - Notable cornerstone investors include 3W Fund Management Limited, Qiming Venture Partners, and various insurance and investment firms [3].