Seek .(SKLTY)
Search documents
DeepSeek与意大利谈妥了,但...
Guan Cha Zhe Wang· 2026-01-08 06:57
Core Insights - DeepSeek, a Chinese AI startup, has reached an agreement with Italy's antitrust authority (AGCM) to launch a country-specific version of its chatbot for Italian users and address the "hallucination" issues in its AI model [1][2] - The AGCM concluded its investigation after DeepSeek committed to improving transparency regarding hallucination risks and implementing technical fixes [2][5] - DeepSeek's measures include providing hallucination risk warnings in Italian and organizing workshops for employees to better understand local consumer laws [2][5] Company Developments - DeepSeek has submitted multiple remediation plans to AGCM, gradually meeting regulatory requirements, which led to the termination of the investigation [1][2] - The company reported over 80 million weekly active users, ranking second among domestic AI applications, and achieved a cumulative token usage of 14.37 trillion, leading the global open-source model rankings [6] Industry Context - The "hallucination" issue is a common challenge across the generative AI industry, with AGCM acknowledging that it is a global problem that cannot be completely eliminated [5] - Despite the challenges, DeepSeek's proactive approach may facilitate its expansion into the European market [5] - The potential classification of DeepSeek under the EU's Digital Services Act (DSA) remains uncertain, which could subject the company to stricter scrutiny [6]
光模块CPO龙头反弹,创业板人工智能再创新高!DeepSeek旗舰系统R2春节问世,AI应用大年启动?
Xin Lang Cai Jing· 2026-01-07 11:42
Group 1 - The core viewpoint of the news is that the AI sector, particularly the entrepreneurial board AI index, is experiencing significant growth, driven by advancements in computing hardware and AI applications [1][5][7] - The entrepreneurial board AI index reached a new high, with a cumulative increase of over 114% from January 1, 2025, to January 7, 2026, outperforming other AI-themed indices [3][7] - Key stocks in the AI sector, such as Zhishang Technology and Changxin Bochuang, saw substantial gains, with Zhishang Technology leading with an increase of over 7% [1][5] Group 2 - The upcoming launch of DeepSeek's next-generation flagship system R2 is expected to catalyze further growth in AI applications [7] - Meta's acquisition of Manus for billions is seen as a strategic move to enhance its AI capabilities and accelerate the commercialization of AI technologies [3][7] - The demand for computing power is projected to remain strong, with both domestic and international markets investing heavily in computing infrastructure, benefiting companies involved in optical interconnection solutions [3][7] Group 3 - The entrepreneurial board AI ETF (159363) has shown strong liquidity, with a daily trading volume exceeding 600 million yuan and a recent price increase of 0.79% [1][5] - The ETF is designed to track the entrepreneurial board AI index, which has shown varying annual performance from 2018 to 2025, including a notable increase of 106.35% in 2025 [4][8] - The ETF's portfolio is heavily weighted towards computing hardware, with over 70% allocated to this sector and more than 20% to AI applications, positioning it well to capture AI market trends [8]
新年首炸!DeepSeek提出mHC架构破解大模型训练难题
Sou Hu Cai Jing· 2026-01-07 09:13
Core Insights - DeepSeek has introduced a new architecture called mHC aimed at addressing stability issues in large-scale model training while maintaining performance improvements [1][11]. Group 1: Problem Identification - Large models face a dilemma in training stability, where traditional single-channel connections lead to information congestion as model size increases [3][5]. - Previous solutions, like the hyper-connection approach, improved efficiency but introduced new issues such as uncontrolled information amplification or suppression, leading to gradient explosion and training failures [5][7][9]. Group 2: mHC Architecture - The mHC architecture incorporates an intelligent scheduling system for multi-channel connections, utilizing the Sinkhorn-Knopp algorithm to maintain energy conservation during information transmission [11][13]. - Additional design features include non-negative constraints on input-output mappings to prevent useful signal loss due to coefficient cancellation [15]. Group 3: Infrastructure Optimization - DeepSeek has optimized its infrastructure by merging multiple computation steps into a single operator, reducing memory read/write cycles and employing recomputation strategies to lower memory usage [16][18]. - These optimizations have resulted in significant stability improvements with minimal increases in training time, even at an expansion factor of 4 [18]. Group 4: Performance Validation - Testing on various model sizes, particularly a 27 billion parameter model, demonstrated that mHC effectively resolved training instability issues, achieving lower loss values compared to traditional baseline models [21][22]. - The performance advantages of mHC were consistent across different model sizes, indicating its practical value for both small and large models [24]. Group 5: Industry Implications - The introduction of mHC suggests a shift in the industry towards refined architectural designs rather than merely increasing parameters and computational power, potentially lowering entry barriers for smaller companies in the large-scale model domain [26][29]. - This pragmatic technological innovation is expected to facilitate the deployment of AI technologies, making it easier for more enterprises to engage in large-scale model development [29].
老黄开年演讲「含华量」爆表,直接拿DeepSeek、Kimi验货下一代芯片
3 6 Ke· 2026-01-07 01:35
Core Insights - The presentation at CES 2026 highlighted the significant advancements of Chinese AI models, particularly Kimi K2 and DeepSeek, which are now competing closely with closed-source models in performance [1][8] - The introduction of the MoE (Mixture of Experts) architecture has become a mainstream choice, with over 60% of open-source AI models adopting this structure since 2025, leading to a substantial increase in intelligence levels [16][31] Group 1: Model Performance and Advancements - Kimi K2 Thinking's inference throughput increased tenfold, with token costs dropping to one-tenth of previous levels, indicating a shift towards a "price parity era" for AI inference [4][6] - DeepSeek-R1 and Kimi K2 represent top-tier attempts under the MoE architecture, significantly reducing computational load and memory bandwidth requirements [2][12] - The performance of Kimi K2 Thinking was validated in tests, showing a tenfold increase in performance on the GB200 NVL72 platform [9][19] Group 2: Global Recognition and Impact - DeepSeek and Kimi K2 were recognized in a rigorous benchmark test, with Kimi K2 Thinking achieving the title of "best-performing non-U.S. model" due to its low misguidance rate [21][24] - The rapid development of Chinese open-source models is closing the gap with the strongest closed-source models, providing a significant first-mover advantage [31] - The increasing international acceptance of Chinese AI models is evidenced by endorsements from prominent figures in the tech industry, indicating a growing influence in the global market [24][33] Group 3: Trends and Future Directions - The transition from high benchmark scores to practical usability is evident, with models like Qwen evolving from being known for high scores to being recognized for their quality [32] - The emergence of features such as "interleaved thinking" in Kimi K2 Thinking reflects a trend towards more sophisticated model capabilities, enhancing their applicability in real-world scenarios [34] - The rise of open-source models is pressuring U.S. closed-source giants, as the value proposition of paid models becomes harder to justify against the performance of open-source alternatives [35]
雷军回应小字营销:行业陋习,但我们改/DeepSeek开年「王炸」,梁文锋署名论文发布/马斯克立新年Flag:大规模量产脑机接口
Sou Hu Cai Jing· 2026-01-06 13:46
Group 1 - Lei Jun, the founder of Xiaomi, addressed the controversy surrounding "small font marketing," stating it is an industry habit that needs to be changed, emphasizing the importance of legal compliance while acknowledging the need for clearer communication with consumers [3][4] - Xiaomi plans to standardize product annotations using larger fonts in the future, aiming to improve clarity and consumer understanding [4] - In a recent live stream, Lei Jun revealed that Xiaomi's automotive division aims to deliver over 410,000 vehicles by 2025, with the Xiaomi YU7 model becoming the best-selling mid-to-large SUV for four consecutive months [5][7] Group 2 - BMW China announced a systematic price adjustment for 31 key models starting January 1, 2026, with the highest price drop reaching 300,000 yuan, reflecting a long-term strategy rather than a short-term price war [11][12] - The flagship electric model i7 M70L saw a price reduction from 1.899 million yuan to 1.598 million yuan, a decrease of approximately 16%, while the iX1 eDrive25L's price dropped by 24% [12] - The automotive industry is experiencing significant shifts, with multiple companies reporting their sales figures for 2025, indicating a competitive landscape [7] Group 3 - OpenAI is reportedly working on multiple AI hardware projects, including a pen-shaped device and portable audio equipment, aiming to create an ecosystem of products rather than a single offering [9][10] - The new audio model being developed by OpenAI is expected to provide more natural and expressive responses, enhancing user interaction with AI devices [10] Group 4 - Elon Musk announced that Neuralink plans to begin large-scale production of brain-machine interface devices in 2026, with a focus on simplifying the surgical process for implantation [16][18] - The company aims to enable users to control computers directly through neural signals, with previous successful trials involving a limited number of patients [18] Group 5 - Microsoft CEO Satya Nadella emphasized that 2026 will be a pivotal year for AI, marking a transition from initial exploration to widespread application, with a focus on reshaping human-AI relationships and engineering paradigms [27][29][30] - Nadella highlighted the need for AI to demonstrate tangible positive impacts in the real world to gain societal acceptance [30]
意大利结束对DeepSeek调查,涉及幻觉风险信息披露
2 1 Shi Ji Jing Ji Bao Dao· 2026-01-06 12:15
Group 1 - Italy's antitrust authority AGCM has concluded its investigation into DeepSeek, accepting binding commitments from the company to improve disclosures regarding the risks of AI "hallucinations" [2][3] - The investigation was initiated in June 2025 due to DeepSeek's failure to warn users about the potential for generating false information [2] - DeepSeek's commitments include measures to enhance the clarity, transparency, and timeliness of information related to hallucination risks [3] Group 2 - DeepSeek operates under two companies based in China and has no branches in other countries, providing AI services to non-professional users in Italy [3] - The company launched DeepSeek Chat in Italy on November 2, 2023, and plans to release the DeepSeek App globally on January 15, 2025, although the app was removed from Italian app stores due to the investigation [3] - Since its launch, DeepSeek has gained significant popularity, with 145 million monthly active users in China by Q3 2025, making it the second-largest AI application domestically [3] - DeepSeek leads globally in open-source large model usage, with a cumulative call volume of 14.37 trillion tokens from November 2024 to November 2025 [3]
黄仁勋又夸了DeepSeek,新一代“算力巨兽”正在量产,性能暴增5倍!
Feng Huang Wang· 2026-01-06 02:19
北京时间1月6日早五点,英伟达CEO黄仁勋在CES 2026开幕前的 keynote 演讲中,用90分钟为全球科技产业描绘了一幅激进且完整的未来图景。 在这场信息密度极高的演讲中,他不仅宣告人工智能的发展重心正从纯粹的"数字智能"迈向与物理世界交互的"物理AI"新纪元,更以一系列开源重器—— 从世界模型Cosmos、自动驾驶系统AlphaMio到新一代AI芯片架构Vera Rubin——展示了英伟达作为全栈巨头的野心:即为这个新时代构建从底层芯片、基 础设施到顶层模型与应用的全部基石。 当AI时代的算力需求正无限放大,英伟达仍在通过对计算平台的极限升级,试图吃下这个庞大的数字世界算力基座。 摘要: "Rubin 的到来恰逢其时,因为训练和推理的 AI 计算需求正在激增。" DeepSeek带来的启示,开源是创新主引擎 演讲伊始,黄仁勋便以历史性的视角定调感慨:"每隔10到15年,计算行业就会发生一次平台迁移。"他强调,当前我们正同时经历两大迁移:一是应用转 向以AI为核心构建;二是整个软件开发和运行范式被重塑——从"编程"转向"训练",从CPU转向GPU,从执行预编译代码转向实时生成内容。 "这意味着过去十年 ...
黄仁勋新年第一场演讲,提了DeepSeek
Di Yi Cai Jing· 2026-01-05 23:45
Core Insights - The rise of open-source models has become a catalyst for global innovation in the AI industry, as highlighted by NVIDIA CEO Jensen Huang during a presentation in Las Vegas [1][1] - The introduction of Deepseek R1 has unexpectedly driven transformation across the industry, showcasing the rapid advancements in open-source model performance [1][1] - Several open-source models are emerging globally, with their capabilities increasingly approaching those of leading frontier models [1][1] - The presentation featured images of multiple open-source models, including three from China: Kimi K2, Qwen, and Deepseek V3.2 [1][1]
黄仁勋新年第一场演讲 提了DeepSeek
Di Yi Cai Jing· 2026-01-05 23:17
Group 1 - The core viewpoint of the article highlights the significant progress in the AI industry over the past year, emphasizing the rise of open-source models as a catalyst for global innovation [1] - NVIDIA's CEO Jensen Huang noted that the emergence of the Deepseek R1 model has unexpectedly driven transformation across the industry [1] - Multiple open-source models are now emerging globally, with their performance increasingly approaching that of leading large models [1] Group 2 - The presentation showcased several open-source models, including three from China: Kimi K2, Qwen, and Deepseek V3.2 [1]
软件ETF(159852)涨超3%! DeepSeek近日发布论文,开启架构新篇章!
Jin Rong Jie· 2026-01-05 06:39
资讯所属栏目还有更多独家策划、专家专栏,免费查阅>> 软件ETF(159852)跟踪中证软件服务指数,前十大权重股分别为科大讯飞、金山办公、同花顺、指南 针、恒生电子、拓维信息、润和软件、三六零、软通动力、深信服,权重合计超60.89%。 软件ETF(159852)当前管理费率为0.50%(每年),托管费率为0.10%(每年),没有股票账户的投资 者还可以通过联接基金(012619.OF,012620.OF,021861.OF)布局板块投资机遇。 声明:市场有风险,投资需谨慎。本文为AI基于第三方数据生成,仅供参考,不构成个人投资建议。 市场1月5日消息,据上交所数据显示,今日上证指数再度站上4000点,截至14:05,上证指数上涨 1.32%,中证软件指数上涨3.41%,个股方面,合合信息涨超7%,科大讯飞涨超6%,指南针、同花顺等 涨超3%。热门ETF方面,软件ETF(159852)涨3.36%。 消息面上,DeepSeek近日发布论文,阐述了一种更为高效的人工智能开发方法。该论文由创始人梁文 锋参与撰写,提出了名为"流形约束超连接"(mHC)的框架。作者称,该框架旨在提升可扩展性,同 时降低训练先进人工 ...