推理算力 - filings, earnings calls, financial reports, news - Reportify

推理算力

Search documents

计算机行业周报：从国产算力变化到LPU！DS新模型前瞻-20260228

Shenwan Hongyuan Securities· 2026-02-28 12:13

行业及产业行业研究 / 行业点评相关研究《春节海内外大模型更新全梳理！壁仞科技深度发布！——计算机行业周报 20260216-20260220》 2026/02/23 《模型会吞噬软件吗？——计算机行业周报 20260202-20260206》 2026/02/07 证券分析师黄忠煌 A0230519110001 huangzh@swsresearch.com 洪依真 A0230519060003 hongyz@swsresearch.com 刘洋 A0230513050006 liuyang2@swsresearch.com 研究支持崔航 A0230524080005 cuihang@swsresearch.com 曹峥 A0230525040002 caozheng@swsresearch.com 陈晴华 A0230525100001 chenqh@swsresearch.com 罗宇琦 A0230124070004 luoyq@swsresearch.com 联系人王开元 A0230125030001 wangky@swsresearch.com 2026 年 02 ...

Nvidia(US:NVDA)

纯推理芯片

纯推理芯片

计算机行业周报 20260223-20260227：从国产算力变化到 LPU！DS 新模型前瞻！-20260228

Shenwan Hongyuan Securities· 2026-02-28 11:01

行业及产业行业研究 / 行业点评相关研究《春节海内外大模型更新全梳理！壁仞科 2026 年 02 月 28 日从国产算力变化到 LPU！DS 新模型前瞻！看好 ——计算机行业周报 20260223-20260227 技深度发布！——计算机行业周报 20260216-20260220》 2026/02/23 《模型会吞噬软件吗？——计算机行业周报 20260202-20260206》 2026/02/07 证券分析师黄忠煌 A0230519110001 huangzh@swsresearch.com 洪依真 A0230519060003 hongyz@swsresearch.com 刘洋 A0230513050006 liuyang2@swsresearch.com 研究支持崔航 A0230524080005 cuihang@swsresearch.com 曹峥 A0230525040002 caozheng@swsresearch.com 陈晴华 A0230525100001 chenqh@swsresearch.com 罗宇琦 A0230124070004 luoyq@ ...

Nvidia(US:NVDA)

纯推理芯片

国产算力芯片

纯推理芯片

国产算力芯片

周鸿祎，最新发声！

Zhong Guo Ji Jin Bao· 2026-02-27 07:29

在"企业和个人如何快速使用AI"方面，周鸿祎表示，现在面临的问题是都在用AI助手，或者把AI当搜索用，个人如何打造专属私人的智能体？OpenClaw 的启发是要简单化。 "智能体只有做得更加专业，能够直接给企业带来价值，企业才会愿意付费使用。"周鸿祎强调。【导读】全国政协委员、三六零创始人周鸿祎：将关注AI赋能安全等方向中国基金报记者卢鸰全国政协委员、三六零创始人周鸿祎2月26日下午在接受媒体集体采访时表示，今年全国两会期间，将关注AI赋能安全、AI在中国如何落地、企业和个人如何快速使用AI等方向。 "以Anthropic为例，通过AI编程、AI查找漏洞，可以解决很多原来安全上不能解决的问题，所以，我建议关注AI智能体。"周鸿祎称。据其介绍，三六零已经做了几十种、上万个AI安全智能体，这些智能体能够挖掘软件漏洞，抵御其他国家的黑客智能体。对于"AI在中国如何落地"，周鸿祎表示，一定要把算力分成训练算力和推理算力，训练算力在规模上可能还有一定的空间，而推理算力的发展空间是无限的。 "所以，希望各地在发展算力方面能够偏向推理算力。从国家产业政策来看，在芯片政策上不能都追英伟达的高端训练芯片，推理芯 ...

360 Security Technology (SH:601360)

AI安全智能体

AI安全智能体

未知机构：OpenClaw爆火AI闭环更进一步推理算力需求持续提升-20260224

未知机构· 2026-02-24 03:50

建议重点关注—#端侧推理核心+G端本地部署业务核心的【云天励飞】建议重点关注一#端侧推理核心+G端本地部署业务核心的【云天励飞】 1OpenClaw不是普通的AI工具，它更像是一个能一站式搭建业务的智能机器人。 1OpenClaw不是普通的AI工具，它更像是一个能一站式搭建业务的智能机器人。 OpenClaw爆火，AI闭环更进一步，推理算力需求持续提升 OpenClaw爆火，AI闭环更进一步，推理算力需求持续提升传统AI工具如ChatGPT，大多是单一功能，而且不同工具之间没有记忆联动。传统AI工具如ChatGPT，大多是单一功能，而且不同工具之间没有记忆联动比如用这个工具写内容，用那个工具做SEO，彼此之间不通气，只能完成碎片化比如用这个工具写内容，用那个工具做SEO，彼此之间不通气，只能完成碎片化 OpenClaw爆火，AI闭环更进一步，推理算力需求持续提升 OpenClaw爆火，AI闭环更进一步，推理算力需求持续提升建议重点关注—#端侧推理核心+G端本地部署业务核心的【云天励飞】建议重点关注一#端侧推理核心+G端本地部署业务核心的【云天励飞】 1OpenClaw不是普通的AI工具，它更像是 ...

Artificial Intelligence

Artificial Intelligence

未来智造局｜“百万token一分钱” 推理GPU驱动大模型下半场发展

Xin Hua Cai Jing· 2026-02-02 08:51

Core Insights - The AI industry is transitioning from a "training-driven" phase to a "reasoning-driven" phase, with reasoning computing power becoming the core element for the commercialization of AI [1][2] - Sunrise, a domestic AI chip company, has launched its new generation reasoning GPU chip, the Qihang S3, aiming for a target of "one cent per million tokens" [1][5] - The next decade will see reasoning infrastructure as the foundational base for China's AI era, emphasizing the need for cost-effective and scalable reasoning capabilities [1][9] Group 1: Reasoning Computing Power - Reasoning computing power is essential for the practical application of AI, with predictions indicating that by 2026, reasoning computing will account for 66% of AI computing, surpassing training computing for the first time [2][4] - The shift towards reasoning-driven AI is crucial for enhancing the efficiency of AI services in the real economy [2][3] Group 2: Sunrise's Innovations - Sunrise is the first company in China to focus on reasoning GPUs, having developed its first chip, Qihang S1, in 2018, and has since released the Qihang S2 and Qihang S3, which are optimized for large model reasoning scenarios [3][5] - The Qihang S3 chip aims to achieve over ten times improvement in reasoning cost-effectiveness, with current costs at approximately 0.57 yuan per million tokens, better than the market average [5][6] Group 3: Industry Challenges and Solutions - The industry faces challenges such as low resource utilization, insufficient adaptation efficiency, and complex operations, with over 40% GPU idle rates under traditional architectures [6][8] - Sunrise is collaborating with partners to create a reasoning system-level solution that optimizes both hardware and software to address these challenges and improve computing efficiency [6][8] Group 4: Market Potential and Future Trends - The demand for reasoning tokens is expected to grow exponentially, with a significant market opportunity for specialized reasoning GPUs [6][9] - The reduction of reasoning costs is projected to lead to a massive increase in AI applications, with estimates suggesting that a 50% cost reduction could trigger widespread adoption [8][9]

Artificial Intelligence

Artificial Intelligence

周鸿祎剧透三六零将发“短剧智能体” 输入剧本即可生成漫剧大片

Zheng Quan Shi Bao Wang· 2026-01-26 09:12

Core Insights - The founder of 360 Group, Zhou Hongyi, predicts that by 2026, the world will enter the "hundred billion intelligent agent" era, and China is well-positioned to seize this strategic opportunity [1][4] - 360 Group is set to launch a "short drama intelligent agent" that allows users to generate large-scale animated films from scripts, significantly lowering the barriers to content creation [1][2] Group 1: AI Evolution and Market Dynamics - Zhou Hongyi believes that 2024 will be a year focused on large models, while 2025 will be a transition period. Large models, primarily in the form of "chatbots," struggle to address complex business problems directly [1] - The "five-force model" proposed by Zhou includes "electricity—computing power—intelligence + human power—productivity," emphasizing that converting general computing power into specialized intelligence is crucial for practical applications [1] - The industry often confuses "training computing power" with "inference computing power," with the latter expected to see exponential growth in demand as intelligent agents are applied to complex tasks like short drama production and education [2] Group 2: Transformation of Internet and Business Models - The rise of intelligent agents will fundamentally change how humans interact with software and the internet, leading to a bifurcation into two types of internet: one for human use and another for intelligent agents [3] - Traditional e-commerce models will shift from "humans finding goods" to an agent-based model where intelligent agents handle the entire transaction process, resulting in increased transactions occurring between agents rather than between humans and screens [3] - New trust and settlement systems will emerge in the intelligent agent economy, necessitating advancements in identity verification, transaction security, and automated settlement, which will leverage technologies like blockchain and smart contracts [3] Group 3: China's Strategic Position - China possesses robust electrical infrastructure, a complete industrial system, and excellent open-source model ecosystems, positioning it to capitalize on the opportunities presented by the hundred billion intelligent agent era [4] - There is a call for companies to foster an "AI-native" culture, transforming individuals who embrace AI into "super individuals," while also emphasizing the importance of maintaining safety standards to mitigate risks associated with collective intelligence [4]

360 Security Technology (SH:601360)

百亿智能体时代

短剧智能体

百亿智能体时代

短剧智能体

超百亿美元！OpenAI签下AI芯片大单

新华网财经· 2026-01-16 03:34

Core Viewpoint - OpenAI and Cerebras are collaborating to deploy a 750 MW wafer-scale system, which will become the world's largest high-speed AI inference platform by 2028, with a project value exceeding $10 billion [1]. Group 1: Collaboration and Market Demand - The partnership between OpenAI and Cerebras signifies a strong market demand for inference computing power and highlights the increasing importance of inference speed among tech giants [1]. - Cerebras, founded in 2015, aims to create the fastest AI inference and training platform, with its CS-2 and CS-3 systems already applied in various fields such as medical research and cryptography [4]. Group 2: Technological Advancements - Cerebras' unique system integrates massive computing power, memory, and bandwidth into a single giant chip, eliminating traditional hardware bottlenecks that limit inference speed [4]. - The response speed of large language models based on Cerebras technology can be up to 15 times faster than those based on GPU systems for code and voice chat tasks [4]. Group 3: Industry Trends - The tech industry's history shows that speed has played a crucial role in technology adoption, with significant advancements in processing frequency and internet connectivity driving the growth of personal computing and modern internet [5]. - Low-latency inference solutions provide faster response times and more natural interactions, enhancing productivity in the AI-driven market [5]. Group 4: Competitive Landscape - In December 2025, AI chip startup Groq announced a non-exclusive licensing agreement with NVIDIA, valued at $20 billion, marking NVIDIA's largest transaction to date [5]. - NVIDIA plans to integrate Groq's low-latency processors into its AI factory architecture to support a broader range of AI inference and real-time workloads [6].

低延迟推理解决方案

CS-2和CS-3系统

Cerebras晶圆级系统

低延迟推理解决方案

CS-2和CS-3系统

Cerebras晶圆级系统

阿里云张翅：AI推理算力将超训练算力金融应用需构建“大小飞轮”协同体系

Xin Lang Cai Jing· 2026-01-04 07:53

Group 1 - The core theme of the China Wealth Management 50 Forum 2025 Annual Meeting is "Towards a Financial Powerhouse in the 14th Five-Year Plan" [1][4] - Alibaba Cloud's strategic direction focuses on "full-stack AI cloud" and "globalization," emphasizing a complete system construction from underlying chips and infrastructure to model applications [1][4] Group 2 - The competition between China and the US in various model fields is characterized by mutual strengths and weaknesses, with China showing a clear leading advantage in niche areas such as autonomous driving and embodied intelligence [3][6] - Future demand for reasoning computing power is expected to surpass training computing power, indicating a "reverse" trend [3][6] - The relationship between cloud and AI is described as a mutually reinforcing "flywheel," where financial institutions need to build a dual-wheel system of "large flywheel driving intent understanding and small flywheel executing" to achieve deep collaboration and integrate AI into professional workflows [3][6]

金融Agentic AI

生产级场景

金融Agentic AI

生产级场景

行业点评报告：资本化或助力AI应用商业化加速，继续关注新游

KAIYUAN SECURITIES· 2025-12-29 01:46

Investment Rating - The report maintains a "Positive" investment rating for the media industry [1] Core Insights - The report highlights the acceleration of AI applications and the commercialization of large models, particularly through the IPOs of companies like Zhipu and MiniMax, which are expected to enhance their business investments and technological advancements [11][21] - The gaming sector is experiencing a significant increase in the issuance of game licenses, with 1,771 licenses granted in 2025, marking a more than 20% increase from 2024, indicating ongoing policy support for the gaming industry [11][44] Summary by Sections Section 1: Zhipu and MiniMax IPOs - Zhipu and MiniMax are set to go public in Hong Kong, which is anticipated to boost their business investments and accelerate the development and application of large model technologies [11] - Zhipu focuses on B-end markets with strong capabilities in model reasoning and programming, while MiniMax targets C-end markets with a diverse product line [11][21] Section 2: Industry Data Overview - The report notes that "NBA Champion Dynasty" topped the iOS free game chart in mainland China, while "Yanyun Sixteen Sounds" led the iOS sales chart [44] - The film "Avatar 3" achieved the highest box office in the week [44] Section 3: Industry News Summary - MiniMax's daily active users surpassed 100 million, and Douyin's mini-games saw significant user and revenue growth [11] - The issuance of 147 game licenses in December 2025 reflects a robust pipeline for new game releases [11][44] Section 4: Announcement Summary - Oriental Pearl is participating in the establishment of an AI fund, and Electric Sound Co. is adjusting its fundraising projects [11] Section 5: Sector Performance Overview - The media sector performed at the lower end of the market in the 52nd week of 2025, while the internet sector showed better performance [11]

多模态AI应用

多模态AI应用

行业周报：大厂加速模型升级，继续布局游戏等多模态AI应用-20251221

KAIYUAN SECURITIES· 2025-12-21 15:28

Investment Rating - The industry investment rating is "Positive" (maintained) [1] Core Insights - Major tech companies are accelerating the upgrade of multimodal AI models, which is expected to enhance content production efficiency and diversity, while also increasing demand for inference computing power [4][30] - The gaming sector is anticipated to maintain high prosperity due to new game launches and ongoing operations of evergreen games, with recommendations to increase investments in this area [4][29] Industry Data Overview - "Delta Operation" ranked first in the iOS game free list in mainland China, while "Honor of Kings" topped the iOS game revenue list [10][14] - The film "Zootopia 2" achieved the highest box office for the week [10][25] Industry News Summary - Major companies are continuously investing in large models, with the domestic gaming market reaching new highs in both scale and user numbers [28] - Google’s Gemini 3 Flash has broken the "performance-cost-speed" Pareto frontier, while domestic giants are increasing resource allocation for continuous iteration of large models [28][29] - The launch of the new Alibaba model supports role-playing functions and is the most comprehensive video generation model globally [29] - Tencent's mixed world model 1.5 allows for the creation of interactive worlds from text or images, enhancing the gaming experience [29] - The Doubao large model has seen a significant increase in daily token processing volume, indicating robust growth in AI applications [31][32]

万相 2.6 系列模型

万相 2.6 系列模型