大语言模型
Search documents
七牛智能涨超5% 七牛云AI推理平台上新GPT-OSS 相关模型可通过控制台或API快速调用
Zhi Tong Cai Jing· 2025-08-11 02:53
Group 1 - Qiniu Intelligent (02567) has seen a stock price increase of over 5%, with a cumulative rise of more than 50% over the past month, currently trading at 1.48 HKD with a transaction volume of 2.0961 million HKD [1] - OpenAI has released a new open-source language model series called GPT-OSS, which includes two models: GPT-OSS-120b and GPT-OSS-20b, marking the first open-source model release since GPT-2 in 2019 [1] - The GPT-OSS models are designed as community general-purpose language models, featuring key capabilities such as function calling, tool invocation, and structured output, suitable for building agent architectures, knowledge Q&A, and RAG retrieval generation scenarios [1] Group 2 - Qiniu Cloud has promptly completed the deployment and tuning of the GPT-OSS models, which are now integrated into the Qiniu Cloud model marketplace, allowing developers to quickly access them via console or API without local deployment [1]
中金《秒懂研报》 | AI赋能玩具:开启情感陪伴新纪元
中金点睛· 2025-08-10 01:08
Core Viewpoint - The article discusses the evolution and market potential of AI toys, highlighting their ability to provide emotional interaction and companionship through advanced technologies like large language models and multimodal interaction [4][7][18]. Group 1: Evolution of AI Toys - AI toys are not just simple toys; they utilize advanced technologies to engage in natural conversations and emotional interactions with users [7]. - The variety of AI toys ranges from small AI accessories to plush toys and comprehensive companion robots, catering to different demographics including children, young adults, and the elderly [7]. - The development of AI toys has progressed from concept to reality, with notable examples like Sony's AIBO and various products in China that leverage AI breakthroughs to offer high cost-performance [7][8]. Group 2: Drivers of AI Toy Demand - Changing modern lifestyles have created new consumer demands, such as the need for educational and companionship products for children and emotional support for the elderly [8]. - Key technological advancements, including the development of large language models and multimodal interaction technologies, have made AI toys feasible [8]. - The improvement in AI chip miniaturization and cost reduction, along with enhanced cloud computing capabilities, supports the continuous learning and functionality of AI toys [8]. Group 3: Market Outlook and Competitive Advantages - The ongoing technological evolution and diverse consumer needs are creating significant market opportunities for AI toys [11]. - The core competitive advantage of AI toys lies in their ability to engage in natural conversations and understand children's language, heavily relying on advanced language models and interaction technologies [11]. - The presence of well-known IP characters can attract consumers and enhance product appeal, although the alignment between IP and product is crucial [13]. Group 4: Future of AI Toys - Future advancements in AI technology are expected to lead to significant improvements in functionality and performance, enhancing user experience and expanding market size [17]. - However, the market also faces challenges, including concerns over children's information security and privacy, as well as the potential impact on social skills and emotional development [17]. - The AI toy industry is still in its early stages, with a low global market penetration rate, indicating substantial growth potential, with projections suggesting the market could reach $60 billion by 2033 [15].
刘若鹏称形成5大超材料生产基地 光启制造大纲字数多达38亿
Shen Zhen Shang Bao· 2025-08-09 18:33
Core Viewpoint - Shenzhen Guoke Technology Co., Ltd. has made significant advancements in the field of metamaterials, achieving a transition from laboratory research to large-scale production, which has been applied extensively in advanced aerospace equipment [1][2]. Group 1: Company Achievements - Over the past 15 years, the company has completed 117,200 design drawings, built 545,000 digital simulation models, and written 7.74 billion words of metamaterial design documentation and 38.28 billion words of manufacturing guidelines [1]. - The company has developed and designed 13.31 million lines of source code, accumulated 2.2 million measured imaging data, and created 8 million measured curves [1]. - The company has established a comprehensive industrial chain layout, including 1 headquarters, 5 production bases, 7 capability platforms, 8 specialized companies, and 1,919 upstream and downstream supporting entities [2]. Group 2: Strategic Development - The company has adopted a "deep vertical" industry development strategy, focusing on large-scale core material development to systematically promote vertical integration of the metamaterial industry chain [3][4]. - The company has applied for over 6,000 patents, with more than 4,000 granted, making it a leader in patent applications in the metamaterials field globally [4]. Group 3: Future Outlook - The company plans to establish new research institutions to support disruptive innovations and develop new tools for micro-material design and manufacturing [6]. - The company is considering the development trends of metamaterials 5.0, aiming to integrate more sensor technologies and next-generation semiconductor technologies into future products [6][7].
ARPO:智能体强化策略优化,让Agent在关键时刻多探索一步
机器之心· 2025-08-09 06:02
Core Viewpoint - The article introduces a novel method called Agentic Reinforced Policy Optimization (ARPO), designed to enhance the performance of large language models (LLMs) in multi-round interactions by addressing the challenges of uncertainty and exploration during tool usage [3][41]. Group 1: Research Motivation and Background - The emergence of Agentic Reinforcement Learning (RL) is driven by the need for LLMs to engage in dynamic multi-round interactions with external tools, moving from static problem-solving to a more interactive agent-environment reasoning paradigm [8]. - Existing Agentic RL methods often underestimate the value of multi-round interactions due to sparse rewards and overuse of tools, leading to a lack of fine-grained exploration of tool usage [8][41]. - The study identifies a significant increase in entropy (uncertainty) after tool calls, indicating an opportunity for exploration that current methods do not fully leverage [14][16]. Group 2: ARPO Methodology - ARPO introduces an entropy-driven adaptive rollout strategy that enhances exploration during high-entropy tool usage phases, allowing for more diverse reasoning paths [11][20]. - The method includes four key steps: initialization of global rollout, monitoring entropy changes, adaptive branching based on entropy, and defining termination conditions for the rollout process [24][27]. - ARPO incorporates advantage attribution estimation to help the model better internalize the value differences in tool usage at each step [28][30]. Group 3: Experimental Results - ARPO outperforms existing sample-level RL methods, achieving better performance with only half the tool call budget across 13 challenging benchmarks, demonstrating its efficiency in training multi-round reasoning agents [21][41]. - The method shows consistent improvements in performance metrics such as Pass@3 and Pass@5, particularly in dynamic, multi-round tasks [37][39]. - In comparative tests, ARPO achieves higher accuracy than GRPO and DAPO in various tasks, including deep search and knowledge-intensive reasoning [41][42]. Group 4: Future Directions - Future research may explore the application of ARPO in multi-modal tasks, expanding its capabilities beyond text-based reasoning to include images and videos [42]. - There is potential for integrating a broader range of external tools to enhance complex task performance through optimized tool usage strategies [42]. - The scalability and real-time deployment of ARPO in larger models and dynamic environments could further improve its practical value and cost-effectiveness [42].
给自动驾驶感知工程师的规划速成课
自动驾驶之心· 2025-08-08 16:04
Core Insights - The article discusses the evolution and importance of planning modules in autonomous driving, emphasizing the need for engineers to understand both traditional and machine learning-based approaches to effectively address challenges in the field [5][8][10]. Group 1: Importance of Planning - Understanding planning is crucial for engineers, especially in the context of autonomous driving, as it allows for better service to downstream customers and enhances problem-solving capabilities [8][10]. - The transition from rule-based systems to machine learning systems in planning will likely see a coexistence of both methods for an extended period, with a gradual shift in their usage ratio from 8:2 to 2:8 [8][10]. Group 2: Planning System Overview - The planning system in autonomous vehicles is essential for generating safe, comfortable, and efficient driving trajectories, relying on inputs from perception outputs [11][12]. - Traditional planning modules consist of global path planning, behavior planning, and trajectory planning, with behavior and trajectory planning often working in tandem [12]. Group 3: Challenges in Planning - A significant challenge in the planning technology stack is the lack of standardized terminology, leading to confusion in both academic and industrial contexts [15]. - The article highlights the need for a unified approach to behavior planning, as the current lack of consensus on semantic actions limits the effectiveness of planning systems [18]. Group 4: Planning Techniques - The article outlines three primary tools used in planning: search, sampling, and optimization, each with its own methodologies and applications in autonomous driving [24][41]. - Search methods, such as Dijkstra and A* algorithms, are popular for path planning, while sampling methods like Monte Carlo are used for evaluating numerous options quickly [25][32]. Group 5: Industrial Practices - The article discusses the distinction between decoupled and joint spatiotemporal planning methods, with decoupled solutions being easier to implement but potentially less optimal in complex scenarios [52][54]. - The Apollo EM planner is presented as an example of a decoupled planning approach, which simplifies the problem by breaking it into two-dimensional issues [56][58]. Group 6: Decision-Making in Autonomous Driving - Decision-making in autonomous driving focuses on interactions with other road users, addressing uncertainties and dynamic behaviors that complicate planning [68][69]. - The use of Markov Decision Processes (MDP) and Partially Observable Markov Decision Processes (POMDP) frameworks is essential for handling the probabilistic nature of interactions in driving scenarios [70][74].
Meta合同工爆料:见过脸书用户向AI聊天机器人泄露隐私
财富FORTUNE· 2025-08-08 13:05
Core Viewpoint - Users are increasingly sharing personal information with AI systems, particularly on Meta's platforms, raising concerns about privacy and data management practices [1][3][7]. Group 1: User Behavior and AI Interaction - Users tend to share highly sensitive personal details with Meta's AI, including real names, phone numbers, and explicit photos, treating the AI as a confidant [1]. - Contract workers for Meta have reported that the frequency of unredacted personal data in user interactions is higher compared to similar projects at other tech companies [1]. Group 2: Historical Context of Privacy Issues - Meta has a troubled history regarding user privacy, highlighted by the Cambridge Analytica scandal, where user data was exploited without consent, leading to a $5 billion fine from the FTC [4][6]. - The company has faced scrutiny for its reliance on third-party contractors for data handling, which has raised questions about its data governance practices [3][7]. Group 3: Current Practices and Company Response - Meta claims to have strict policies in place to limit contractor access to personal data and has implemented processes to handle sensitive information appropriately [8][9]. - Despite these claims, the recent revelations about contractor access to user data have reignited concerns about Meta's data management and privacy practices [7].
OpenAI重磅发布GPT-5!性能大幅提升至“专家级别”
Zheng Quan Shi Bao Wang· 2025-08-08 10:57
在频频"跳票"和多次"剧透"之后,万众期待的GPT-5终于发布了。 (原标题:OpenAI重磅发布GPT-5!性能大幅提升至"专家级别") GPT-5最核心的亮点是,它并非单一的语言或者推理模型,而是整合了GPT系列(大语言模型)和o系 列(推理模型),具备调度子模型的能力。奥特曼在其个人社交平台上连发十余条推文介绍GPT-5,其 中首条就强调"GPT-5是一个集成模型,这意味着不再需要模型切换器,它将自行决定何时需要更深入 地思考"。 北京时间8月8日凌晨1时,OpenAI举行了长达1个多小时的线上发布会,正式推出了GPT-5。与此前的模 型更新直播时间短且主要由研发人员发布相比,GPT-5的发布明显规格更高,不仅发布时间长、细节 多,而且OpenAI首席执行官山姆·奥特曼也现身发布会现场。 经证券时报记者梳理,发布会的主要亮点如下: 集成模型:GPT-5是一个集成模型(integrated model),融合了大语言模型GPT系列和推理模型o系列, 这意味着用户在使用时不再需要手动切换各类不同的模型。 能力提升:据OpenAI公开的测试数据,GPT-5在数学、编程、视觉感知和健康等领域,都表现出了顶尖 性 ...
新网银行积极开展2025年全国金融科技活动周宣传活动
Zhong Guo Jing Ji Wang· 2025-08-08 07:22
(责任编辑:华青剑) 与此同时,新网银行策划主题直播,两位AI专家在直播间细致讲解科技实践应用,生动展示 AIGC、大语言模型等前沿技术,多维度呈现人工智能带来的科技成果,营造热爱科学、崇尚创新的浓 厚氛围,并提高了公众对金融科技的认识和兴趣。直播中,嘉宾们还结合当前网络安全热点话题,提醒 观众在体验AI技术便捷性的同时,也要警惕各类"AI投毒""AI幻觉"。新网银行视频号、微博号、抖音号 多平台现场直播,累计观看人数实现10万+。 新网银行深化数字化战略布局,依靠自身力量,深度融合大数据、隐私计算与人工智能等数字技 术,构建起贯通多场景的开放生态平台,形成了全在线、全实时、全客群的银行业务模式。面向未来, 新网银行将深化前沿技术与金融业务场景的融合创新,通过打造多元化数字普惠金融产品,满足大众多 层次金融需求,以数字技术培育新质生产力,扎实做好五篇大文章的时代答卷。 近期,新网银行以全国金融科技活动周为契机,围绕"矢志创新发展,建设科技强国"主题,精心策 划并开展了一系列丰富多彩的金融科技宣传活动,积极面向公众宣传科普各类知识,为建设科技强国贡 献金融力量。 在全国金融科技活动周期间,新网银行充分利用线上渠 ...
探路数字金融,零售之王“智变”的求索与未来
Zhong Guo Jing Ji Wang· 2025-08-08 07:22
Core Insights - The article emphasizes the critical role of digital finance in the banking sector, highlighting that digitalization is no longer optional but a necessity for all banks in 2023 [1] - China Merchants Bank (CMB) is recognized as a leader in the industry, having made significant investments in digital transformation and technology integration [1][2] Group 1: Digital Finance and Technology Integration - Digital finance is defined as a high-level financial form that combines technology and financial innovation, enhancing the efficiency of financial supply and promoting inclusive financial services [2] - CMB completed a comprehensive cloud migration project by the end of 2022, becoming one of the first major banks in China to fully transition to cloud services, which has significantly upgraded its technological infrastructure [2] - As of June 2024, CMB's "Zhaoqi Loan" has disbursed over 50 billion yuan to 50,000 small and micro enterprises, with 76% of these businesses receiving credit loans for the first time [2][6] Group 2: AI and Risk Management - CMB is focusing on building an intelligent computing platform to leverage large language models, aiming to create specialized models for the financial sector rather than general-purpose models [3] - The bank has developed a comprehensive risk management system that utilizes AI and machine learning to identify and intercept fraudulent transactions, ensuring customer safety [4] - CMB's innovative approach to small and micro enterprise financing has led to the launch of the "Zhaoqi Loan," which offers pure credit, no-collateral loans, streamlining the approval process to seconds [5][6] Group 3: Efficiency and Cost Reduction - The implementation of large language models across various business segments has resulted in significant improvements in efficiency, cost reduction, and enhanced service quality [7] - CMB has integrated over 120 scenarios utilizing large language models, benefiting over 20 million users with intelligent banking services [7] - The bank's fraud detection system processes millions of transactions daily, showcasing the effectiveness of AI in enhancing operational capabilities [7] Group 4: Challenges and Future Directions - Despite the advancements, CMB acknowledges challenges such as high resource consumption, stringent data privacy requirements, and the need for explainability in AI-generated responses [8] - The development of digital finance calls for synchronized efforts between policy and market, including the establishment of a tiered authorization mechanism for data ownership [8]
SuperX首发全栈式多模型一体机,开创多模态智能体协同新纪元
Zhong Guo Chan Ye Jing Ji Xin Xi Wang· 2025-08-08 07:17
Core Insights - Super X AI Technology Limited has launched a multi-model integrated machine that pre-installs OpenAI's latest large language models, GPT-OSS-120B and GPT-OSS-20B, and allows for the download of other popular open-source models, marking a significant innovation in AI products [1][2] - The new product aims to redefine AI infrastructure standards with features such as "plug-and-play," multi-model integration, and scenario penetration, catering to various enterprise sizes and needs [1][3] Product Features - The multi-model integrated machine supports various models including reasoning, general, multi-modal, language synthesis/recognition, embedding, re-ranking, and text-to-image models, enabling deep integration with application scenarios [3] - It facilitates complex business applications, such as directly locating video segments based on text descriptions and supports over 60 pre-set scenario intelligent agents [3][4] - The machine offers cloud collaboration and caching capabilities, allowing users to access the latest global models without delay [3] Market Positioning - SuperX's integrated machine addresses challenges in AI deployment, such as data security, cost control, and technical adaptation, providing a comprehensive enterprise-level generative AI platform [4][5] - The pricing for the new AI server B200 standard and cluster versions is set at $500,000 and $4 million respectively, while the AI workstation standard and flagship versions are priced at $50,000 and $250,000 [5] Industry Impact - The demand for large AI models is experiencing exponential growth across various sectors including education, research, healthcare, finance, automotive, and general industry, positioning SuperX to achieve significant economic benefits and further product development [5] - The CTO of SuperX emphasizes that multi-model collaboration is a crucial step towards achieving AGI, aiming to build an ecosystem for intelligent agent developers in collaboration with industry clients [6]