DeepSeek
Search documents
DeepSeek灰度测试新模型!科创人工智能ETF华夏(589010)高位盘整,优刻得涨停领涨
Mei Ri Jing Ji Xin Wen· 2026-02-12 03:05
Group 1 - The core viewpoint of the news highlights the performance of the Huaxia Sci-Tech Artificial Intelligence ETF (589010), which experienced a slight pullback after an initial surge, currently priced at 1.594 yuan, reflecting a 1.271% increase from the opening price [1] - Among the 30 constituent stocks tracked by the ETF, 19 stocks saw gains, with Yuke Technology leading with a 20% limit-up, while Sikan Technology experienced a decline of over 5% [1] - The trading volume of the ETF reached 52.52 million yuan, with a turnover rate of 1.98%, indicating stable liquidity and moderate trading activity [1] Group 2 - DeepSeek has launched a new model and initiated gray testing, enhancing its contextual capability from 128K Tokens to 1M Tokens, with the knowledge base updated to May 2025, improving context capacity, knowledge, capability, and user interaction [1] - Guolian Minsheng Securities notes that recent releases of new features or ecosystem integrations by various large models domestically and internationally are expected to lead a new wave of innovation in large models [1] - The transition to the Agent era is beginning to reshape the internet value chain, shifting focus from traditional traffic scale to business models that emphasize behavioral execution, capability invocation, governance control, and outcome-based payments [1][2]
突发!百万Token上下文+Agent新模型深夜连发,创业板人工智能ETF(159243)放量涨超2%!
Sou Hu Cai Jing· 2026-02-12 02:46
Group 1 - The domestic large model field has seen significant updates, igniting market expectations for a new round of innovation in the AI industry chain [2] - DeepSeek has launched an update that supports a maximum context length of 1 million tokens, enabling the model to process long texts such as entire novels and complex documents, facilitating AI applications in high-value scenarios like financial analysis and legal review [3] - Zhiyu AI has released its flagship model GLM-5, achieving state-of-the-art capabilities in coding and agent functions, bringing its performance close to international leaders, indicating that domestic models are now competitive in execution capabilities [4] Group 2 - The release of new models is part of a concentrated launch window, with several companies including DeepSeek, Kimi, Alibaba, and Baidu unveiling new models, reflecting a shift in the internet value chain towards execution capabilities and new business models [5] - The increasing demand for computing power is supported by significant capital expenditures planned by major companies, with ByteDance planning 160 billion yuan for 2026 and Alibaba investing 380 billion yuan in AI infrastructure over three years [5] - The AI ETF tracking the entrepreneurial board AI index covers the entire AI industry chain, including hardware, software, and applications, with major holdings benefiting from the expansion of cloud vendor capital expenditures [6]
中国大模型“春节档”打响!等待消费级AI出“爆款”
Hua Er Jie Jian Wen· 2026-02-12 01:41
Core Insights - The Chinese AI industry is experiencing an unprecedented wave of flagship model releases, marking a competitive race among major players to convert technological advancements into consumer products [1][10] - The 2026 Spring Festival is anticipated to be a critical period for AI model launches, with multiple companies preparing to unveil significant updates simultaneously [2][10] Group 1: Market Dynamics - The 2025 strategy of DeepSeek's Spring Festival launch has set a precedent, leading other companies to adopt similar tactics for product releases [2] - ByteDance has initiated the competition by launching a trio of models: Seedance 2.0, Seedream 5.0, and Doubao 2.0, with Seedance 2.0 already signaling potential success [2][3] - Alibaba is set to release Qwen 3.5 in mid-February, supported by a substantial customer acquisition incentive of 3 billion yuan [3] - Zhiyu has introduced GLM-5, expanding its parameters from 355 billion to 744 billion [4] - DeepSeek is expected to launch its V4 version in mid-February, focusing on improvements in coding and long prompt handling [6] - MiniMax has recently launched its M2.5 model on the Agent platform [8] Group 2: Competitive Landscape - The simultaneous release of multiple models is likely to create a "winner-takes-all" scenario, where underperforming models may face significant disadvantages [10] - The scarcity of attention during the Spring Festival means that labs failing to present credible flagship updates risk being excluded from developers' consideration [12] - DeepSeek's potential release is seen as pivotal, not just for its chatbot capabilities but for the platform economic benefits it may unlock [12] Group 3: Technological Innovations - DeepSeek's new approach, as outlined in its paper on scalable conditional memory, could enhance model efficiency by shifting expensive computations to cheaper retrieval operations [12][14] - If successful, this could transform AI from an expensive "toy" into an affordable "tool," facilitating broader integration into high-frequency consumer products [14] Group 4: Beneficiaries and Implications - Tencent is projected to be the biggest beneficiary of the model competition, leveraging its high-frequency communication platforms, WeChat and QQ, to enhance user experience through improved model performance [15][16] - For Alibaba and Baidu, while stronger models could enhance user experience, they may also face pricing pressures if DeepSeek instigates a price war in the API service market [17] - Vertical giants like Trip.com, Beike, and Kuaishou stand to benefit from powerful open-source models that lower technical barriers and accelerate product iteration [17] Group 5: Market Sentiment and Future Outlook - Despite the excitement in the capital markets, there is a cautious sentiment regarding the actual performance of consumer-facing AI models, with large-scale user testing during the Spring Festival seen as a critical evaluation point [18][19] - The true signal of adoption will be whether major players integrate AI as a default feature in high-frequency interfaces, which would drive sustained demand for reasoning capabilities [19] Group 6: Valuation and Long-term Perspective - Morgan Stanley maintains an "overweight" rating for model developers Zhiyu and MiniMax, with target prices set at 400 HKD and 700 HKD respectively, based on a 30x P/E ratio for 2030 [21]
全球大公司要闻 | 苹果推迟新版Siri上线,Meta百亿押注AI基建
Wind万得· 2026-02-12 00:54
Group 1 - Meta plans to invest over $10 billion in building a data center park in Indiana, providing 1 GW of power capacity to support AI projects and core social media operations, while hedge fund Pershing Square disclosed a stake in Meta representing 10% of its capital, believing the market underestimates AI's long-term potential [2][3] - Apple faces delays in upgrading its Siri virtual assistant, with multiple new features potentially postponed until iOS 26.5 or iOS 27 due to issues with query handling, response times, and accuracy [2] - ByteDance is reportedly developing an AI chip and negotiating with Samsung for production, aiming to produce at least 100,000 chips this year and gradually increase output to 350,000, although a spokesperson claimed the information is inaccurate [3] Group 2 - NetEase's Q4 2025 revenue reached 27.5 billion yuan, a 3% year-on-year increase, but net profit attributable to shareholders fell nearly 30% to 6.2 billion yuan, missing expectations due to increased sales expenses and investment losses [5] - Zhiyuan Technology launched its new flagship model GLM-5, integrating DeepSeek sparse attention mechanism, targeting programming and intelligent agent capabilities, with internal evaluations indicating performance close to Claude Opus 4.5 [5] - Huazhu Group is under scrutiny from the Beijing Consumer Association for potentially unfair terms in its membership service agreement, prompting the company to initiate a self-examination and commit to improving the consumer environment [6] Group 3 - Amazon received approval from the US FCC to deploy an additional 4,500 low-Earth orbit satellites, expanding its constellation to 7,700 to enhance space internet competition [8] - Cisco reported Q2 revenue of $15.3 billion, exceeding analyst expectations, with product revenue of $11.64 billion, driven by a surge in orders from AI hyperscalers [8] - Ford anticipates achieving a record revenue of $187.3 billion in 2025, but expects a net loss of $8.182 billion, a 239.17% year-on-year decline, primarily due to rising supply chain costs and increased R&D investments [9] Group 4 - Samsung Electronics announced the Galaxy S26 series launch on February 26, featuring the 2nm Exynos 2600 chip and a 200-megapixel camera in the Ultra model, with continued strong demand for memory chips expected until 2027 [12] - Toyota is set to launch a pure electric version of the Highlander for the North American market, targeting a range of 320 miles, with plans to guide users of fuel/mixed models to the Grand Highlander series [12] - LG Energy Solution announced the acquisition of a 49% stake in a Canadian energy storage battery factory from Stellantis to strengthen its energy storage business [13]
AI大模型,密集“上新”
Zhong Guo Zheng Quan Bao· 2026-02-12 00:51
Group 1 - The core viewpoint of the news is the rapid growth and advancements in AI models, particularly highlighting the launch of new flagship models by various companies, which are driving significant market interest and stock price increases [1][2][3] Group 2 - Zhiyu released its next-generation flagship model GLM-5, which has shown a tenfold increase in user flow in a short period, prompting the company to expand its capacity to handle the load [1] - Zhiyu's stock price surged by 53.74% over three trading days, reaching a peak of 354 HKD per share, with a market capitalization of 139.3 billion HKD [1] - Alibaba launched its Qwen3-Max-Thinking model with over one trillion parameters and 36 trillion tokens of pre-training data, marking it as the largest reasoning model from Alibaba [1] - The Kimi K2.5 model from Moonlight Dark Side supports both visual and text input, and introduces "Agent cluster" capabilities for team operations [2] - DeepSeek's new model supports a context length of 1 million tokens, allowing for the processing of extensive content, such as entire book series or large documents [2] - ByteDance's AI video generation model Seedance 2.0 can create movie-quality videos from text or images, indicating advancements in multi-modal video models [3] - The Chinese government is promoting AI applications in the bidding and tendering sector, which is expected to catalyze further growth in the AI market [3] - Securities firms are optimistic about the rapid deployment of intelligent products and the transformative impact of AI on user interactions with software [3]
DeepSeek不发V4,六小龙不敢过年
3 6 Ke· 2026-02-12 00:26
Core Insights - DeepSeek is evolving beyond being just a "chatbot" base and is optimizing its large model's energy efficiency through architectural innovations, as evidenced by the recent release of new models and frameworks [1][3] - The competitive landscape is intensifying, with DeepSeek's new models being crucial for maintaining its industry position against major players like Google and OpenAI [1][2] Group 1: Technological Developments - In January 2024, DeepSeek released the Engram architecture, which separates "conditional memory" from "computation," aiming to reduce errors and save computational power [3] - The new model, referred to as MODEL1, is speculated to either be a lightweight model suitable for edge devices or a "long-sequence expert" designed for processing lengthy documents or code [3] - DeepSeek's commitment to cost-effective AI solutions is evident, as it aims to lower token costs, making AI development more accessible to a broader range of developers [4] Group 2: Market Position and Competition - The release of new models is seen as essential for DeepSeek to avoid falling behind competitors like Gemini 3 and GPT-5, which have demonstrated superior performance in various benchmarks [7][8] - Despite DeepSeek's strong position in the open-source community, the company faces pressure from the rapid advancements of closed-source models, which could lead to a loss of developer loyalty [10][11] - The competitive dynamics are shifting, with major internet companies increasing their investments in AI, potentially impacting DeepSeek's market share and the overall landscape for domestic AI companies [13][14] Group 3: Ecosystem and Community Impact - DeepSeek's open-source models, such as DeepSeek-V3 and R1, have gained significant traction, accounting for over half of the open-source token throughput in a short period [8][9] - The company has established a decentralized and pragmatic technical ecosystem, attracting developers interested in self-controlled and private deployments [4][6] - The ongoing developments in the open-source AI community are reshaping the narrative around Chinese AI capabilities, with DeepSeek playing a pivotal role in this transformation [5][6]
早报 | 强劲非农数据重挫降息预期;DeepSeek、智谱等集体上新;永辉超市CEO致歉;比尔·盖茨时隔两年半再度到访中国
虎嗅APP· 2026-02-12 00:08
大家早上好!这里是今天的早报,每天早上,我都会在这里跟你聊聊昨夜今晨发生了哪些大事儿。 昨夜今晨 【美国1月非农录得13万远超预期,市场削减美联储降息押注】 美东时间周三,美国劳工统计局公布的数据显示,美国1月非农就业人数录得13万人,大幅好于市场预期,为 此前就业增长疲弱的一年画上阶段性句号,也为新一年开局注入更强动能,一定程度缓解了外界对劳动力市场 放缓的担忧,支持美联储维持利率不变的政策路径。 具体数据显示,经季节性调整后,美国1月非农新增就业岗位13万个,远超市场预期的5.5万人,前值(12月份) 被小幅下修至4.8万人。 美国1月失业率录得4.3%,略低于市场预期的4.4%,创2025年8月以来新低。 数据公布后,现货黄金短线跳水近40美元,美元指数短线急升50点,非美货币普遍跳水,美国国债收益率也显 著走高。 【苹果据悉开发新版Siri再次遇挫,多项AI功能或推迟发布】 据媒体援引消息人士报道,苹果筹备已久的升级版Siri计划再次遇到挫折,该项目在最近几周的测试过程中遭 遇问题,可能导致多项备受期待的新功能推迟发布。 知情人士透露,苹果原计划在3月推出的iOS 26.4系统更新中加入这些新功能,但 ...
陆家嘴财经早餐2026年2月12日星期四
Wind万得· 2026-02-11 23:33
Group 1 - The State Council emphasizes the need to comprehensively promote AI technology innovation, industrial development, and application empowerment to foster new productive forces and drive high-quality development [3] - The State Council aims for a unified national electricity market system to be fully established by 2035, transitioning to unified pricing and joint trading [13] - The National Bureau of Statistics reports that China's CPI rose by 0.2% year-on-year in January, while PPI fell by 1.4%, with the data reflecting a base period adjustment [4][13] Group 2 - The automotive industry in China saw production and sales of 2.45 million and 2.346 million vehicles in January, respectively, with a slight year-on-year increase in production and a decrease in sales [13] - The banking wealth management scale decreased by 100 billion yuan in January, indicating a rebalancing of funds among deposits, wealth management, insurance, and equity assets [13] - The Hong Kong Monetary Authority is actively processing license applications for stablecoin issuers, aiming to position Hong Kong as a global innovation center for digital assets [15] Group 3 - The capital market continues a "zero tolerance" regulatory approach, with numerous penalties issued to listed companies and intermediaries for various violations, reflecting an increase in accountability and comprehensive regulation [9] - The Hong Kong IPO market has seen a rare "zero break" phenomenon, with 22 new stocks listed this year not experiencing any price drops on their first day [9] - The MSCI announced its quarterly index adjustments, including the addition of 37 stocks to the MSCI China Index, which will take effect after the market closes on February 27 [9]
智谱发布新一代旗舰模型GLM-5,重点提升编程与智能体能力
Hua Er Jie Jian Wen· 2026-02-11 17:06
2月11日,智谱正式推出新一代旗舰模型GLM-5,主攻编程与智能体能力,官方称已实现开源领域最优 表现。这是继DeepSeek后,国产AI大模型春节档的又一重要发布。 GLM-5参数规模由上一代的355B扩展至744B,激活参数从32B提升至40B。智谱方面证实,此前在全球 模型服务平台OpenRouter登顶热度榜首的神秘模型"Pony Alpha"即为GLM-5。 架构配置方面,GLM-5构建78层隐藏层,集成256个专家模块,每次激活8个,激活参数约44B,稀疏度 5.9%,上下文窗口最高支持202K token。 编程能力显著提升 新一代旗舰模型GLM-5在内部Claude Code评估集中表现突出。前端、后端及长程任务等编程开发场景 下,该模型较上一代GLM-4.7实现全面超越,平均性能提升逾20%。 GLM-5能够以极少人工干预,自主完成Agentic长程规划与执行、后端重构、深度调试等复杂系统工程 任务。官方称,真实编程环境中的使用体感已逼近Claude Opus 4.5水平。 智谱将GLM-5定位为最新一代旗舰级对话、编程与智能体模型,重点强化其在复杂系统工程与长程 Agent任务中的处理能力 ...
腾讯研究院AI速递 20260212
腾讯研究院· 2026-02-11 16:08
Group 1: Google Chrome and WebMCP Protocol - Google Chrome team has released the WebMCP (Web Model Context Protocol), allowing AI agents to interact directly with website kernels via the navigator.modelContext API, bypassing human user interfaces [1] - WebMCP addresses the high costs and low stability issues of traditional agent screenshot recognition, marking a transition from "visual simulation" to "logical direct connection," referred to as "API in UI" [1] - This standard is being jointly promoted by Google and Microsoft, indicating a potential future division of the internet into UI layers for humans and tool layers for agents, heralding the arrival of the "Agentic UI" era [1] Group 2: Runway's Financing and Model Development - Video generation unicorn Runway has secured $315 million in Series E funding, achieving a valuation of $5.3 billion, with participation from Nvidia, AMD, and Adobe, bringing total funding to $815 million [2] - Runway's Gen-4.5 ranks third in the AI-generated video leaderboard, surpassing models like Google Veo 3 and OpenAI Sora 2 Pro [2] - The new funding will be used to train the next generation of world models, having already launched the general world model GWM-1, which includes variants for explorative environments, dialogue characters, and robotic operations [2] Group 3: xAI Leadership Changes - xAI co-founders Jimmy Ba and Wu Yuhua announced their departures within 48 hours, with 6 out of 12 founding team members having left, including 5 in the past year [3] - Responsibilities of the departing co-founders have been redistributed among other co-founders, and SpaceX's acquisition of xAI has been completed, with an IPO plan set to advance in the coming months [3] - xAI's flagship product Grok has recently exhibited strange behaviors, and the talent loss poses challenges for the upcoming IPO [3] Group 4: DeepSeek's New Model - DeepSeek has quietly launched a new model supporting a 1 million token context window, with knowledge cutoff in May 2025, capable of processing content equivalent to the entire "Three-Body Problem" trilogy [4] - This model remains a pure text model, unable to view images directly but capable of reading text from images and documents, with enhanced Agentic Coding capabilities [4] - The industry trend is shifting from LLM reasoning to Agentic reasoning, as indicated by the latest models from Anthropic and OpenAI, suggesting humans will act as architects directing AI teams in software development [4] Group 5: Zhiyu's GLM-5 Model - Zhiyu has confirmed that the mysterious model "Pony Alpha," which topped the OpenRouter popularity chart, is its new model GLM-5, achieving state-of-the-art performance in coding and agent capabilities [5] - GLM-5's performance in real programming scenarios closely approaches that of Claude Opus 4.5, excelling in complex systems engineering and long-range agent tasks with high tool invocation accuracy [5] Group 6: Ant Group's Omni Model - Ant Group has open-sourced the full-modal model Ming-flash-omni 2.0, the first in the industry to generate voice, environmental sound effects, and music simultaneously on the same audio track [7] - This model excels in visual language understanding, controllable speech generation, and image editing, surpassing capabilities of Gemini 2.5 Pro and Qwen3-Omini-30B-A3B-Instruct [7] - The model employs a unified architecture for deep multi-modal integration, supporting zero-shot voice cloning and fine attribute control, and has been open-sourced on platforms like HuggingFace [7] Group 7: iFlytek's Starfire X2 Model - iFlytek has released the Starfire X2 model, trained on entirely domestic computing power, with overall capabilities matching international top levels, particularly in mathematics, reasoning, and agent tasks [8] - Starfire X2 utilizes a 293 billion MoE sparse architecture, improving inference performance by 50% compared to X1.5, and continues to enhance capabilities in over 130 languages, maintaining industry leadership in key languages for Latin America and ASEAN [8] - Industry applications have been significantly upgraded, with medical capabilities passing authoritative evaluations and educational applications achieving personalized learning through error analysis [8] Group 8: Meituan's LongCat Research Agent - Meituan's LongCat has launched a "deep research" feature, scoring 73.1 in the BrowseComp evaluation, approaching top closed-source models, supporting up to 400 interactions and 256K context [9] - Leveraging Meituan's native capabilities in local life, it creates a real training environment and employs a Rubrics-as-Reward mechanism to address AI hallucination issues, ensuring all recommendations are verifiable [9] - The model utilizes a multi-agent specialized division of labor, automating the entire process from information gathering to research analysis and visualization, capable of generating professional reports for restaurant recommendations and travel planning [9] Group 9: ByteDance's Protenix-v1 Model - ByteDance's Seed team has released Protenix-v1, an open-source model that matches the performance of AlphaFold 3 under strict training data and model size constraints [10] - This model successfully unlocks scaling capabilities during inference, with the prediction success rate for antibody-antigen complexes increasing from 36% with a single seed to 47.68% with 80 seeds [10] - The team has adopted a dual-version strategy, with the standard version aligning with academic benchmarks and the extended version utilizing data from June 2025 for practical drug discovery applications, along with the launch of the PXMeter evaluation toolkit [10]