Seek .(SKLTY)
Search documents
DeepSeek新模型曝光;AI产业链业绩兑现丨新鲜早科技
2 1 Shi Ji Jing Ji Bao Dao· 2026-01-22 02:30
Group 1: Technology Developments - DeepSeek has updated its GitHub repository, revealing a new model architecture "MODEL1," which is expected to be more efficient and suitable for edge devices compared to its predecessor DeepSeek-V3.2 [2] - Longji Technology announced significant progress in Co-packaged Optics (CPO) technology, with successful customer sample deliveries and testing, addressing the growing demand for high-bandwidth, low-latency optical interconnects [11] - Shanghai Yiyou Intelligent Control Technology has launched its first automated production line for robot joints in Zhangjiang, aiming to meet the increasing demand and reduce costs for humanoid robots [10] Group 2: Financial Performance and Projections - Moole Technology expects a net loss of 950 million to 1.06 billion yuan for 2025, despite launching a leading GPU product and experiencing revenue growth due to the AI industry's expansion [17] - Demingli anticipates a net profit of 650 million to 800 million yuan for 2025, representing a year-on-year increase of 85.42% to 128.21%, driven by advancements in storage solutions and AI demand [18] - Tianfu Communication projects a net profit of 1.881 billion to 2.150 billion yuan for 2025, reflecting a growth of 40% to 60% due to the accelerating AI industry and global data center construction [19] Group 3: Regulatory and Market Responses - The European Union plans to phase out "high-risk suppliers" in critical sectors, interpreted as targeting Chinese tech firms like Huawei, which has expressed concerns over the fairness of such regulations [2] - Pinduoduo was fined 100,000 yuan for failing to report tax information as required, highlighting regulatory scrutiny on internet platform companies [4] - Zhiyu Technology announced a temporary limit on the sale of its GLM Coding Plan due to high demand and resource constraints, reducing daily sales to 20% of current levels [3]
西贝获新一轮融资,新荣记张勇等入股;马斯克与奥特曼互喷;DeepSeek新模型曝光;黄仁勋:AI时代蓝领更吃香;俞敏洪开办“退休俱乐部”
Sou Hu Cai Jing· 2026-01-22 02:27
Group 1 - The Ministry of Industry and Information Technology (MIIT) has announced the establishment of a safety monitoring platform for the operation status of new energy vehicles, effective from January 1, 2027 [4] - Xibei Catering Group has completed a new round of financing, with investors including Taizhou Xinrongtai Investment and former Ant Group CEO Hu Xiaoming, although the specific amount remains undisclosed [4][5] - The financing has increased Xibei's registered capital from 89.90 million yuan to 101.68 million yuan, marking a 13.1% increase [5] Group 2 - The price of gold jewelry in China is approaching 1500 yuan per gram, with brands like Chow Tai Fook and Lao Feng Xiang reporting significant price increases [7] - OpenAI has announced plans to expand its AI infrastructure in the U.S. to 10 gigawatts by 2029, committing to cover energy costs to prevent price hikes [12] - Nvidia's CEO Jensen Huang emphasized the rising demand for skilled tradespeople in the AI era, predicting that plumbers and electricians could earn six-figure salaries due to the infrastructure needs of AI [10] Group 3 - Apple plans to upgrade Siri into a chatbot by the second half of 2026, utilizing Google's Gemini model [10] - DeepSeek has revealed a new model, MODEL1, which is designed for efficient inference and optimized for edge devices [9] - The VCSEL chip provider Raysees Technology has completed a multi-hundred million yuan Series C financing round [20]
【钛晨报】住建部:有序搭建房地产开发、融资、销售等基础制度;DeepSeek AI新模型:搭载 MODEL1 全新架构,最快2月上线;财政部:在武汉天河国际机场等41个口岸各新设1家口岸进境免税店
Sou Hu Cai Jing· 2026-01-21 23:58
Real Estate Development - The Ministry of Housing and Urban-Rural Development emphasizes the importance of accelerating transformation and upgrading for high-quality real estate development, focusing on two main areas: orderly promotion of "good housing" construction and the establishment of a new model for real estate development [2] - The construction of "good housing" involves collaboration among government, enterprises, and society, with a comprehensive deployment to enhance housing quality through standards, design, materials, construction, and operation [2] - The new model for real estate development aims to ensure a smooth transition from old to new models, focusing on a mechanism that links people, housing, land, and finance [2] Real Estate Financing and Sales - The project company system will be implemented to ensure independent legal rights and responsibilities, prohibiting headquarters from misappropriating project funds before delivery [3] - A lead bank system will be introduced for real estate financing, where one bank or syndicate will be responsible for managing project funds [3] - The promotion of a "current housing sales" system aims to mitigate delivery risks, while pre-sale funds will be regulated to protect buyers' rights [3] Market Trends - The AI technology market is expected to grow significantly, with new personal AI devices emerging and the overall market scale likely to expand further between 2026 and 2027 [3] - The integration of energy and computing networks is crucial for enhancing global competitiveness, as highlighted by industry leaders [4] Mergers and Acquisitions - Energy Fuels has agreed to acquire Australian Strategic Materials for AUD 447 million (approximately USD 300.9 million), marking a significant move to secure the supply chain for rare earth elements [7] Policy Developments - The Ministry of Finance announced the establishment of duty-free shops at 41 ports, allowing residents from Macau to purchase duty-free goods [8] - New tax policies for innovative enterprises' CDRs will be implemented from January 1, 2026, to December 31, 2027, including exemptions on capital gains tax for individual investors [9] Financial Sector Updates - The People's Bank of China is focusing on modernizing the payment system and enhancing cross-border payment capabilities [10] - The National Financial Regulatory Administration has released new regulations to improve the administrative licensing process, enhancing the efficiency of market access [10] Industrial and Technological Development - The Ministry of Industry and Information Technology is promoting humanoid robot technology and aims to strengthen the ecosystem for humanoid robots [11] - A notification has been issued to automate the monitoring of computing power resources across 31 provinces by the end of 2026 [12] Economic Performance - Beijing's GDP reached CNY 5.20734 trillion in 2025, growing by 5.4% year-on-year, with the tertiary sector showing the highest growth at 5.8% [18]
DeepSeek新模型曝光?“MODEL1”现身开源社区
Shang Hai Zheng Quan Bao· 2026-01-21 21:31
Core Insights - DeepSeek has updated its FlashMLA code on GitHub, revealing the previously undisclosed "MODEL1" identifier, which may indicate a new model distinct from the existing "V32" [3][4] - The company plans to launch an "open source week" in February 2025, gradually releasing five codebases, with Flash MLA being the first project [4] - Flash MLA optimizes memory access and computation processes on Hopper GPUs, significantly enhancing the efficiency of variable-length sequence processing, particularly for large language model inference tasks [4] Company Developments - DeepSeek's upcoming AI model, DeepSeek V4, is expected to be released around the Lunar New Year in February 2025, although the timeline may vary [4] - The V4 model is an iteration of the V3 model released in December 2024, boasting advanced programming capabilities that surpass current leading models like Anthropic's Claude and OpenAI's GPT series [5] - Since January 2026, DeepSeek has published two technical papers introducing a new training method called "optimized residual connections (mHC)" and a biologically inspired "AI memory module (Engram)" [5] Industry Context - The introduction of the Engram module aims to improve knowledge retrieval and general reasoning, addressing inefficiencies in the Transformer architecture [5] - The support from Liang Wenfeng's private equity firm, which has achieved a 56.55% average return in 2025, has bolstered DeepSeek's research and development efforts [5]
DeepSeek新模型“MODEL1”曝光
Di Yi Cai Jing Zi Xun· 2026-01-21 09:05
Core Insights - The article discusses the emergence of a new model named "MODEL1" from DeepSeek, coinciding with the one-year anniversary of the DeepSeek-R1 release, indicating potential advancements in AI model architecture [2][6]. Group 1: Model Development - "MODEL1" has been referenced in the updated FlashMLA code on GitHub, suggesting it may represent a new model distinct from the existing "V32" architecture [2][3]. - There are differing opinions in the industry regarding whether "MODEL1" is a version 4 model or an advanced inference model, with some developers speculating it could be the ultimate version of the V3 series [2][5]. - Key technical differences between "MODEL1" and "V32" include variations in key-value (KV) cache layout, sparsity handling, and support for FP8 data format decoding, indicating targeted design for memory optimization and computational efficiency [5]. Group 2: Anticipated Release and Features - The structure of the model files suggests that "MODEL1" is nearing completion or inference deployment, awaiting final weight freezing and testing validation, which implies a forthcoming launch [5]. - There are expectations for DeepSeek to release its next flagship model, DeepSeek V4, in February, with preliminary tests indicating it may surpass other top models in programming capabilities [6]. - Recent technical papers from DeepSeek introduce new training methods and an AI memory module, hinting that these innovations may be integrated into the upcoming model [6]. Group 3: Industry Impact - The DeepSeek-R1 model has been recognized as the most praised model on Hugging Face, significantly lowering barriers in inference technology and production deployment, thus influencing the open-source strategy of major Chinese companies [9]. - Over the past year, Chinese AI models have seen increased downloads on Hugging Face, surpassing those from the U.S., indicating a shift in reliance on Chinese-developed open-source models within the global supply chain [9].
传DeepSeek曝新模型,梁文锋再放“王炸”?
Xin Lang Cai Jing· 2026-01-21 07:55
Core Insights - DeepSeek has generated significant buzz in the AI community with the unexpected exposure of a new model named Model1 during a code update, suggesting a potential new technological path distinct from the existing V3 series [1][6][8] - Speculation is rife that DeepSeek is preparing to launch its next-generation AI model, V4, around mid-February, following a year of iterative improvements to the V3 model [3][8] Model Development Timeline - On March 25, 2025, DeepSeek released V3-0324, enhancing code generation usability and surpassing GPT-4.5 in mathematical and coding capabilities [4] - On May 29, 2025, the R1 model underwent a minor upgrade, improving performance in mathematics, programming, and general logic, with hallucination rates reduced by 45-50% [4] - On August 21, 2025, DeepSeek V3.1 was launched, offering faster response times and stronger agent capabilities, along with support for Anthropic's API [4] - On September 22, 2025, the V3.1-Terminus version was released, addressing issues with mixed-language inputs and enhancing the performance of Code and Search Agents [4] - On September 29, 2025, the V3.2-Exp version introduced a new attention mechanism, with updated API pricing structures [4] - On December 1, 2025, the official V3.2 version was released, achieving inference capabilities comparable to GPT-5 and integrating thinking modes for tool usage [4][9] Research Contributions - Two papers authored by Liang Wenfeng were published between late December 2025 and early January 2026, addressing training stability and knowledge retrieval efficiency in large model architectures [5][10] - The first paper proposed a manifold-constrained hyper-connections framework to enhance training stability by constraining residual connections within a specific manifold [10][11] - The second paper introduced a conditional memory module that improves inference and knowledge task performance by decoupling knowledge storage from neural computation [10][11] Market Expectations - The AI community is eagerly anticipating whether DeepSeek will unveil the new Model1 or V4 during the upcoming Spring Festival, with expectations of a significant impact on the global AI landscape [6][8]
DeepSeek新模型真的要来了?“MODEL1”曝光
Di Yi Cai Jing Zi Xun· 2026-01-21 07:00
Core Insights - The article discusses the emergence of a new model named "MODEL1" from DeepSeek, coinciding with the one-year anniversary of the release of DeepSeek-R1, indicating potential advancements in AI technology [1][4]. Group 1: Model Development - "MODEL1" has been referenced in the updated FlashMLA code on GitHub, suggesting it is a new model distinct from the existing "V32" architecture [1][2]. - There are differing opinions in the industry regarding whether "MODEL1" represents a V4 model or an advanced version of the V3 series [2][3]. - The new model is expected to be close to completion, awaiting final weight freezing and testing validation, indicating a near launch [3]. Group 2: Technical Innovations - FlashMLA is a proprietary software tool optimized for NVIDIA Hopper architecture GPUs, crucial for achieving low-cost and high-performance model implementations [3]. - Key technical differences between "MODEL1" and "V32" include variations in key-value (KV) cache layout, sparse processing methods, and support for FP8 data format decoding, suggesting targeted design for memory optimization and computational efficiency [3]. Group 3: Market Impact and Expectations - The anticipation for DeepSeek's next flagship model is high, with expectations that it will integrate recent research findings, including a new training method and an AI memory module [4]. - The release of DeepSeek-R1 has significantly influenced the open-source community, with increased contributions from major Chinese companies and a shift in global reliance towards Chinese-developed open-source models [5][7].
DeepSeek新模型“Model 1”曝光,疑似“高效推理模型”
Xin Lang Cai Jing· 2026-01-21 06:58
Core Insights - DeepSeek has updated its official GitHub repository with a series of FlashMLA code, drawing attention to a model named "Model 1" [1][2] - Model 1 is speculated to be the new model code that DeepSeek is expected to release around the Chinese New Year [2] Model Specifications - Model 1 is one of the two main model architectures supported in DeepSeek FlashMLA, alongside DeepSeek-V3.2 [2] - It is likely to be an efficient inference model with lower memory usage compared to V3.2, making it suitable for edge devices or cost-sensitive scenarios [2] - Model 1 may also function as a long-sequence expert optimized for sequences longer than 16K, making it ideal for tasks such as document understanding and code analysis [2]
AI视频迎来了它的DeepSeek时刻
Jing Ji Guan Cha Wang· 2026-01-21 06:39
Core Insights - PixVerse R1, launched by Aishi Technology, represents a significant advancement in AI video generation, allowing users to create videos in real-time without needing prompts, marking a transformative moment in the AI video industry [1][2][4] Group 1: Product Features - PixVerse R1 can generate videos instantly, adapting to user commands with remarkable speed, creating an immersive digital world where user input directly influences the narrative [1][3] - The model utilizes an Omni native multimodal architecture, integrating text, images, audio, and video into a unified processing framework, enhancing its generative capabilities [3][4] - It employs a self-regressive flow generation method, allowing it to remember previous inputs and generate content with a "long-term memory," which differentiates it from traditional video generation methods [4][7] Group 2: Market Impact - Aishi Technology secured a strategic investment of $14.2 million from Chinese company Ruyi, which will facilitate collaboration in film, streaming, and gaming sectors, indicating strong market interest in PixVerse R1 [5][6] - The partnership aims to explore innovative applications of AI technology in the film industry, highlighting the potential for significant transformation in content creation [6][7] - The product has already attracted attention from various game companies, indicating its potential to revolutionize interactive media and gaming experiences [8][9] Group 3: Competitive Landscape - Aishi Technology is positioned as a leader in the real-time video generation space, with no other companies having launched similar products, showcasing its competitive edge [7][9] - The company has rapidly gained traction, with over 100 million global users and a monthly active user count exceeding 16 million, reflecting its strong market presence [9][10] - The PixVerse R1 is recognized as the first universal real-time world model supporting up to 1080P resolution, setting a new standard in the industry [9][10] Group 4: Future Prospects - The introduction of PixVerse R1 is expected to blur the lines between video production and consumption, allowing users to generate and edit content in real-time, thus redefining user engagement in media [7][11] - The technology is anticipated to enable new forms of interactive storytelling and AI-native games, where narratives evolve based on user interactions, creating a dynamic digital ecosystem [7][8] - Aishi Technology's founder emphasizes that PixVerse R1 represents a new media form, where AI can create a continuously evolving world based on user intent, marking the beginning of a new era in real-time content generation [11]
DeepSeek AI新模型曝光:搭载 MODEL1 全新架构,最快2月上线
Huan Qiu Wang Zi Xun· 2026-01-21 06:37
Core Insights - DeepSeek plans to launch its next-generation flagship AI model, DeepSeek V4, around mid-February during the Lunar New Year, which is expected to significantly enhance coding capabilities and attract industry attention [1][2] Group 1: Model Development - The release of DeepSeek V4 follows the one-year anniversary of the DeepSeek-R1 model, with developers discovering updates related to FlashMLA in 114 files, including 28 references to an unknown "MODEL1" identifier, likely indicating a new AI model with a different architecture [1][2] - The new architecture optimizes key technical aspects such as key-value (KV) cache layout, sparsity handling, and FP8 data format decoding support, addressing memory usage and computational efficiency issues, thereby laying the groundwork for performance improvements [3] Group 2: Research Innovations - DeepSeek's research team has previously published two technical papers introducing innovative training methods like "optimized residual connections (mHC)" and a biologically inspired "AI memory module (Engram)," suggesting that DeepSeek V4 may integrate these latest research findings to enhance its capabilities in handling complex tasks [3]