Workflow
AI Infra
icon
Search documents
未知机构:东吴计算机春节海内外AI催化不断聚焦最确定的AIinfra20260-20260224
未知机构· 2026-02-24 03:30
首席执行官黄仁勋表示,该公司计划在GTC 2026中宣布一款 将"震惊世界"的芯片。 【东吴计算机】春节海内外AI催化不断,聚焦最确定的AI infra 20260223 春节阿里、字节、智谱、kimi、Minimax、Google等大模型厂商纷纷发布新模型和产品。 openclaw热度大增,短短两周内,OpenClaw的Token使用量就飙升到了OpenRouter上全部Token约13%。 1存储持续紧缺,新型存储架构呼之欲出。 < 【东吴计算机】春节海内外AI催化不断,聚焦最确定的AI infra 20260223 春节阿里、字节、智谱、kimi、Minimax、Google等大模型厂商纷纷发布新模型和产品。 openclaw热度大增,短短两周内,OpenClaw的Token使用量就飙升到了OpenRouter上全部Token约13%。 1存储持续紧缺,新型存储架构呼之欲出。 AI SSD也有望亮相。 海力士表示:所有客户需求都无法满足。 M2.5未上调原有模型价格,但推出高价新套餐。 海外H、A、B卡的算力租赁价格持续上涨。 目前DRAM及NAND库存仅剩约4周,且没有任何客户能完全满足需求。 闪迪有 ...
浙商证券:近期国产大模型密集发布 规模化应用拉动推理需求
智通财经网· 2026-02-12 06:16
Core Insights - The recent surge in the release of domestic large models indicates the commencement of an AI arms race, with significant advancements in capabilities and applications [1] - The availability of agents is increasing, transitioning large models from chat-based interactions to collaborative tasks, with notable improvements in multi-modal applications [2] - The demand for inference power is expected to rise significantly as large models are applied on a larger scale, particularly in video production and agent functionalities [3] Group 1: Recent Developments in Large Models - Domestic large models have been released intensively, including DeepSeek's new model with a context processing capability of 1M tokens, significantly higher than the previous maximum of 128K [1] - GLM-5 has been launched on the Zhipu website, focusing on programming and agent enhancement, outperforming the latest model Claude Opus 4.6 in global programming tests [1] - ByteDance's Seedance 2.0 has been introduced, which significantly lowers the barriers and costs of video creation, potentially transforming the video production industry [1] Group 2: Advancements in Agent and Multi-Modal Applications - The usability of agents is improving, with models like Claude Opus 4.5 capable of autonomous programming for up to 5 hours [2] - AI coding agents are expected to double their task handling time every 4 months from 2024-2025, a significant acceleration compared to the previous rate of doubling every 7 months from 2019-2024 [2] - Seedance 2.0 supports various combinations of video, audio, and text inputs, producing high-quality video outputs while reducing creation costs [2] Group 3: Inference Demand and Cost Implications - The token consumption for large models is shifting from dialogue and image generation to more intensive applications like agent functionalities and video production, leading to a rapid increase in inference power requirements [3] - The cost of generating a 5-second 720P video is approximately 4 RMB, with Seedance costing around 2.3 RMB, highlighting the significant cost advantages over manual production [3] - The increase in AI penetration in video creation is expected to drive demand for computational power [3] Group 4: Related Companies - Relevant companies include MiniMax-WP (00100), Zhipu (02513), Yunsai Zhili (600602.SH), Youke De-W (688158.SH), Capital Online (300846.SZ), Qingyun Technology-U (688316.SH), Wangsu Technology (300017.SZ), and Nanxing Co. (002757.SZ) [4]
未知机构:海量数据会议要点产业背景大模型记忆存储领域的发展趋势-20260211
未知机构· 2026-02-11 02:20
海量数据会议要点: 产业背景:大模型记忆存储领域的发展趋势。 大模型记忆侧(长上下文、历史记忆保存)是2026年产业的核心叙事方向,向量数据库、RAG是其中关键技术; 海外谷歌Gemini、国内字节、豆包、阿里千问均在强化大模型记忆能力,该领域被视为产业元年(0到1阶段)的核 心方向。 业务结构转型:公司从传统信创国产化关系数据库向AI数据库迁移转化,AI数据库市场空间和爆发想象力大。 产业背景:大模型记忆存储领域的发展趋势。 大模型记忆侧(长上下文、历史记忆保存)是2026年产业的核心叙事方向,向量数据库、RAG是其中关键技术; 海外谷歌Gemini、国内字节、豆包、阿里千问均在强化大模型记忆能力,该领域被视为产业元年(0到1阶段)的核 心方向。 技术与合作进展:公司与清华大学李国良团队合作成立AIDB实验室,实验室围绕向量数据库、产业变化推进模型 融合;双方合作已有很长时间,此次为实验室落地。 公司已正式切入向量数据库、AIDB领域,已有大量技术积累,订单和收入端有积极表现。 海量数据会议要点: 技术与合作进展:公司与清华大学李国良团队合作成立AIDB实验室,实验室围绕向量数据库、产业 公司是国内数据库前 ...
未知机构:建议关注AIInfra软件龙头字节链核心标的1大模型时代-20260211
未知机构· 2026-02-11 02:10
建议关注AI Infra软件龙头,字节链核心标的 1、大模型时代,程序执行从确定性编程变成LLM大模型基于概率的不确定模型,Token消耗波动、推理延迟抖 动、会话上下文丢失等问题频发,Agent决策链路复杂,问题溯源变成更加困难, 2、Bonree ONE是国内首个实现一体化智能可观测性平台的产品,不仅可以监控Token的消耗,优化成本;而且可 以支持多 Agent 系统中复杂交互行为的分析与追溯,并且在故障 建议关注AI Infra软件龙头,字节链核心标的 1、大模型时代,程序执行从确定性编程变成LLM大模型基于概率的不确定模型,Token消耗波动、推理延迟抖 动、会话上下文丢失等问题频发,Agent决策链路复杂,问题溯源变成更加困难, 2、Bonree ONE是国内首个实现一体化智能可观测性平台的产品,不仅可以监控Token的消耗,优化成本;而且可 以支持多 Agent 系统中复杂交互行为的分析与追溯,并且在故障发生时自动进行根因分析,帮助客户快速定位问 题, 3、携手火山引擎,探索新商业模式。 #火山引擎已经上线了博睿数据提供的第三方探针,将监控数据上报到 APMPlus 服务端进行应用性能监控,#火 ...
未知机构:转1号去了国家信创园近期草根调研下来xc-20260210
未知机构· 2026-02-10 02:20
1号去了国家信创园 近期,草根调研下来,xc 核心业绩去年还行,某信利润都破亿了 关注: 软: 硬: 弹性 AI Infra: 时间窗口:从节前到两会,确定性极强,赔率很高 中期:27年很关键,10 年新周期 转 1号去了国家信创园 近期,草根调研下来,xc 核心业绩去年还行,某信利润都破亿了 关注: 软: 硬: 弹性 AI Infra: 时间窗口:从节前到两会,确定性极强,赔率很高 转 ...
未知机构:1号去了国家信创园近期草根调研下来xc核心业绩去年还行-20260210
未知机构· 2026-02-10 02:20
1号去了国家信创园 1号去了国家信创园 近期,草根调研下来,xc 核心业绩去年还行,某信利润都破亿了 关注: 软: 硬: 近期,草根调研下来,xc 核心业绩去年还行,某信利润都破亿了 关注: 软: 硬: 弹性 AI Infra: 时间窗口:从节前到两会,确定性极强,赔率很高 中期:27年很关键,10 年新周期 弹性 AI Infra: 时间窗口:从节前到两会,确定性极强,赔率很高 ...
Seedance 2.0和字节链
傅里叶的猫· 2026-02-08 15:58
Group 1 - The core point of the article is the significant advancements and commercial potential of ByteDance's Seedance 2.0, which has generated considerable discussion due to its ability to transition from "generating a scene" to "completing a work" [2][3] - Seedance 2.0 demonstrates strong determinism in content generation, allowing creators to precisely control outcomes through integrated visual and auditory signals, enhancing the naturalness of audio-visual synchronization [3] - The model's design focuses on reducing uncertainty in generation paths, optimizing token consumption, and significantly lowering production costs for video content, making it appealing for e-commerce, short dramas, and advertising industries [4] Group 2 - Analysts are optimistic about three main areas benefiting from Seedance 2.0: AI content production and distribution, AI infrastructure, and ByteDance's computing power chain [6] - The recent upgrade of the knowledge platform includes comprehensive daily reports summarizing news and analyst opinions across various industries, enhancing the understanding of market trends [6]
A股晚间热点 | 国常会重磅!研究促进有效投资政策措施
智通财经网· 2026-02-06 16:15
1、 李强主持召开国常会 研究促进有效投资政策措施 重要程度:★★★★★ 以下为晚报正文: 国务院总理李强2月6日主持召开国务院常务会议,听取2025年国务院部门办理全国人大代表建议和全国政协提案工作情况汇报,研究促进有效投资政策措 施,部署修订《环境空气质量标准》,讨论《中华人民共和国招标投标法(修订草案)》。 会议指出,促进有效投资对于稳定经济增长、增强发展后劲具有重要作用。要创新完善政策措施,加力提效用好中央预算内投资、超长期特别国债、地方政 府专项债券等资金和新型政策性金融工具。要结合制定实施"十五五"规划,着眼于长远发展需要和构筑未来竞争优势,在基础设施、城市更新、公共服务、 新兴产业和未来产业等重点领域,深入谋划推动一批重大项目、重大工程。 此外,李强主持召开国务院第十次全体会议,讨论拟提请十四届全国人大四次会议审议的政府工作报告稿和"十五五"规划纲要草案稿。 李强指出,宏观政策要靠前发力,财政资金尽可能提前安排,加强资金下达和项目建设的协同配合,使政策尽快落地见效。各项重点工作要抓紧推进,条件 成熟的及早组织实施。坚持政策支持和改革创新并举,更好激发市场活力,挖掘内需新增长点。要密切跟踪形势变化 ...
互联网大厂抢人,年薪最高128万
21世纪经济报道· 2026-02-06 14:52
Core Viewpoint - The article discusses the intense competition among major internet companies, particularly Tencent, in attracting top AI talent through high salaries and innovative scholarship programs, highlighting the industry's talent scarcity and the strategic investments being made in AI research and development [1][4]. Group 1: Talent Acquisition Strategies - Tencent is actively recruiting AI talent with high salaries for various positions, such as over 750,000 yuan for user operation roles and nearly 1,000,000 yuan for AI application engineers [1]. - The "Qingyun Plan" is Tencent's initiative aimed at attracting top technical students globally, similar to ByteDance's Top Seed talent program [1]. - The "Qingyun Scholarship" offers significant financial incentives, including 500,000 yuan per recipient, to support students in AI and computer science fields [2]. Group 2: Investment in Research and Development - Tencent's R&D expenditure reached a record high of 22.82 billion yuan in Q3 2025, with a total of 61.983 billion yuan spent in the first three quarters of 2025 [4]. - The company emphasizes the importance of computational resources for top PhD students, providing cloud heterogeneous computing resources as part of the scholarship [4]. Group 3: Recruitment of Established Talent - Tencent is also accelerating the recruitment of established AI experts, as evidenced by the hiring of prominent figures like Pang Tianyu and Yao Shunyu, who have significant academic and industry experience [5]. - The establishment of new departments within Tencent, such as AI Infra and AI Data, aims to enhance its capabilities in large model research and development [5]. Group 4: Academic Collaboration and Knowledge Sharing - Tencent launched its technical blog to share research findings, marking a step towards increasing its academic influence and transparency in AI technology [6].
首个大规模记忆湖发布,AI Infra跑步进入“记忆”时代
量子位· 2026-02-05 04:10
田晏林 发自 凹非寺 量子位 | 公众号 QbitAI "Your brain is for having ideas, not holding them. " ——Tiago Forte《Building a Second Brain》 LLM是AI的"第一大脑",记忆平台是AI的"第二大脑"。 畅销书作者Tiago Forte在《构建第二大脑》中曾分享核心观点: "生物大脑只用于思考创造,而外部系统用于信息的可靠存储。" ——这对我们理解AI的"双脑"分工极富启示。 事实上,LLM就如同AI的"第一大脑(生物脑)",它擅长思考、推理与即时生成,而不擅长长期、精确地存储海量事实。 而记忆平台是AI的"第二大脑",它主要按需为LLM提供准确的"记忆"支撑,让LLM从记忆负担中解放,专注于更高层次的推理与创造,从而协 同产生更精准、个性化且可行动的价值。 两者结合,记忆平台负责"记住一切",LLM负责"思考一切"。 3.0 生产力时代(2025年至今):萃取"隐性知识",固化核心资产 行业焦点转向直接提升生产效率。关键一跃在于能否将员工的决策逻辑、经验权衡等隐性知识数字化、轨迹化。 这不再是简单问答,而是通过记 ...