开源模型
Search documents
DeepSeek与国产芯片的“双向奔赴”
2 1 Shi Ji Jing Ji Bao Dao· 2025-09-30 23:14
Core Viewpoint - The release of DeepSeek-V3.2-Exp model by DeepSeek Company marks a significant advancement in the domestic AI chip ecosystem, introducing a sparse attention mechanism that reduces computational resource consumption and enhances inference efficiency [1][7]. Group 1: Model Release and Features - DeepSeek-V3.2-Exp model incorporates DeepSeek Sparse Attention, leading to a reduction in API prices by 50% to 75% across its official app, web, and mini-programs [1]. - The new model has received immediate recognition and adaptation from several domestic chip manufacturers, including Cambricon, Huawei, and Haiguang, indicating a collaborative ecosystem [2][6]. Group 2: Industry Impact and Ecosystem Development - The rapid adaptation of DeepSeek-V3.2-Exp by various companies suggests a growing consensus within the domestic AI industry regarding the model's significance, positioning DeepSeek as a benchmark for domestic open-source models [2][5]. - The domestic chip industry, primarily operating under a "Fabless" model, is expected to progress quickly as it aligns with standards defined by DeepSeek, which is seen as a key player in shaping the future of the industry [4][5]. Group 3: Comparison with Global Standards - DeepSeek's swift establishment of an ecosystem contrasts with NVIDIA's two-decade-long development of its CUDA platform, highlighting the rapid evolution of the domestic AI landscape [3][8]. - The collaboration among major internet companies like Tencent and Alibaba in adapting to domestic chips further emphasizes the expanding synergy within the AI hardware and software ecosystem [8].
DeepSeek V3.2要来了?
Guan Cha Zhe Wang· 2025-09-29 09:58
Core Insights - The appearance of DeepSeek-V3.2 on the Hugging Face platform has sparked speculation among users [1] - DeepSeek has a history of releasing new versions and updates around significant holidays [2] - The most recent update prior to the speculation was DeepSeek-V3.1-Terminus, released on September 22, with an open-source announcement [3] Version Release History - DeepSeek V3 was released on December 27, 2024, just before New Year's [3] - DeepSeek-R1-0528 was launched on May 28, 2025, as a special gift for the Dragon Boat Festival [3] - The latest version, DeepSeek-V3.1-Terminus, was made available on September 22, 2023, along with an open-source model [3] Current Status - The Hugging Face interface related to DeepSeek is currently showing errors, and there has been no official response from DeepSeek regarding the situation [4]
乌克兰多地遭空袭,已致4死80余伤;连锁餐饮企业监管新规出台;万达知情人士回应王健林被限高;受贿2.68亿!唐仁健一审被判死缓丨每经早参
Mei Ri Jing Ji Xin Wen· 2025-09-28 22:03
Group 1: Industry Developments - The Ministry of Industry and Information Technology and seven other departments issued a plan for the non-ferrous metals industry, targeting an average annual growth of around 5% in value-added output from 2025 to 2026, with a 1.5% annual growth in the production of ten non-ferrous metals, including copper, aluminum, and lithium [5] - The National Development and Reform Commission held a meeting to discuss expanding effective investment during the 14th Five-Year Plan period, emphasizing the need for practical measures to stimulate private investment and promote healthy and high-quality development of the private economy [6] - The State Administration for Market Regulation released new regulations for food safety responsibilities of chain catering enterprises, which will take effect on December 1, 2025, categorizing enterprises based on the number of stores and assigning regulatory responsibilities accordingly [7] Group 2: Corporate News - Dongfeng Motor is collaborating with Huawei to explore store development for the Warrior brand, aiming to enhance market competitiveness and influence marketing strategies in the automotive industry [14] - Huawei's CEO of the Intelligent Automotive Solutions Business Unit announced that Level 3 autonomous driving is expected to scale up by 2027, marking a significant transformation in the automotive industry [15] - Leap Motor's chairman responded to a recent "height restriction" issue, stating that it has been resolved and emphasizing the need for team improvement and confidence in the company's future [17] - Wanda Group's chairman Wang Jianlin was restricted from high consumption due to economic disputes involving a subsidiary, highlighting the importance of timely resolution of such issues to avoid business disruptions [19] - Tencent released and open-sourced the "Hunyuan Image 3.0," a large-scale multimodal image model, which is expected to have a significant impact on the image modeling field [20] - China's first domestically developed quadrivalent HPV vaccine has been approved for market release, which is anticipated to enhance public health and disease prevention efforts [21] - Starry Sky Dynamics completed a D-round financing of 2.4 billion yuan, indicating strong investor confidence in its development in the aerospace sector [22]
宇树科技王兴兴谈机器人现状:最大挑战在哪里?为什么坚持开源?
机器人圈· 2025-09-26 09:29
Core Viewpoint - The development of humanoid robots is heavily reliant on innovations in communication connectivity, chip computing power, and energy consumption control, necessitating open collaboration and innovation within the industry to accelerate progress [1][2]. Group 1: Development Roadmap - The CEO of Yushu Technology, Wang Xingxing, outlined a roadmap for humanoid robots, emphasizing the need for real-time action generation based on arbitrary commands, aiming for significant advancements by the end of next year [1]. - The company has made progress in teaching robots to perform various human movements, with expectations to achieve real-time action capabilities soon [1]. Group 2: Industry Challenges - A significant challenge in the robotics industry is related to cabling, with 60% to 70% of industrial robot failures attributed to cable issues, highlighting the importance of reducing cable weight and quantity for improved performance and reliability [2]. Group 3: Model Development - The development of large models is crucial for enhancing the general capabilities of robots, with a call for open-source collaboration similar to early OpenAI practices to foster industry growth [3][4]. - Yushu Technology has announced the open-sourcing of UnifoLM-WMA-0, a world model designed for general robot learning, which includes datasets and training source codes [4].
宇树科技王兴兴谈人形机器人最大挑战
2 1 Shi Ji Jing Ji Bao Dao· 2025-09-24 15:13
Core Insights - The development of humanoid robots is heavily reliant on innovations in communication connectivity, which necessitates stringent requirements for chip computing power and energy consumption [1] - The company aims to enable humanoid robots to perform real-time actions based on arbitrary commands by mid-2024, with a longer-term goal of allowing robots to operate autonomously in unfamiliar environments by the end of 2025 [1][2] - The company emphasizes the importance of reducing cable usage in robots, as 60%-70% of industrial robot failures are related to cable issues, and aims to connect the main control unit and limbs with a single cable in the future [2] - The development of large models is crucial for enhancing the general capabilities of robots, and the company advocates for an open-source approach to accelerate industry advancement, having recently open-sourced its UnifoLM-WMA-0 model [3]
宇树科技王兴兴谈人形机器人最大挑战
21世纪经济报道· 2025-09-24 15:12
Core Viewpoint - The development of humanoid robots is heavily reliant on innovations in communication connectivity, which necessitates stringent requirements for chip computing power and energy consumption [1][2]. Group 1: Development Roadmap - The company aims to enable humanoid robots to learn and perform various human movements, such as dance and martial arts, with improved fluidity and effectiveness compared to previous attempts [1]. - The next phase involves allowing robots to execute any command in real-time, moving closer to a state where robots can autonomously perform tasks [1]. - The company anticipates achieving the capability for real-time action generation by the end of this year or early next year, with further advancements expected by the end of next year to allow robots to operate in unfamiliar environments [1][2]. Group 2: Challenges in the Industry - A significant challenge in the robotics industry is related to cable management, with 60%-70% of industrial robot failures attributed to cable issues [2]. - The company emphasizes the importance of reducing the number of cables to enhance robot performance and reliability, aiming for a future where only one cable connects the main control unit to the limbs [2]. Group 3: Open Source Initiatives - The company has announced the open-sourcing of UnifoLM-WMA-0, a world model-action framework designed for general robot learning, which includes datasets and training source codes [3]. - The call for open-source model development is seen as a way to accelerate industry progress, similar to early strategies employed by OpenAI [3].
宇树科技王兴兴谈机器人现状:最大挑战在哪里?为什么坚持开源?
2 1 Shi Ji Jing Ji Bao Dao· 2025-09-24 14:13
Core Viewpoint - The development of humanoid robots is heavily reliant on innovations in communication connectivity, which necessitates unique requirements for chip computing power and energy consumption [1][3] Group 1: Development Roadmap - The company aims to enable humanoid robots to perform real-time actions based on arbitrary commands, with expectations to achieve this by mid-next year [1] - By the end of next year, the goal is for humanoid robots to autonomously operate in unfamiliar environments, such as retrieving a bottle of water for a guest [1] Group 2: Challenges in the Industry - A significant challenge in the robotics industry is related to cabling, with 60% to 70% of industrial robot failures attributed to cable issues [3] - The company is focused on reducing the number of cables connecting the main control unit and limbs to enhance robot performance and reliability [3] Group 3: Open Source Initiatives - The company advocates for an open attitude in the industry, similar to OpenAI's early approach, to accelerate the development of large models and the robotics sector [4] - The company has announced the open-sourcing of UnifoLM-WMA-0, a world model designed for general robot learning, including datasets and training source codes [4] Group 4: Importance of Large Models - Developing corresponding large model capabilities is crucial for enhancing the general capabilities of robots [5]
吴泳铭的两个新判断,和加倍激进投入的阿里云
3 6 Ke· 2025-09-24 13:11
Core Insights - The core message of the article revolves around Alibaba Cloud's aggressive push into the AI sector, particularly through the launch of new models and the establishment of a strategic vision for the future of artificial intelligence, termed ASI (Artificial Superintelligence) [1][4][12]. Group 1: AI Models and Innovations - Alibaba Cloud introduced several new AI models at the Yunqi Conference, including the flagship model Qwen3-Max, which outperforms competitors like GPT-5 and Claude Opus 4, ranking among the top three globally on LMArena [1][6]. - The new models include Qwen3-Next, Qwen3-Coder, Qwen3-VL, Qwen3-Omni, Wan2.5-preview, and Tongyi Bailing, each with significant advancements in capabilities such as visual understanding, coding, and multi-modal interactions [1][6][12]. - The Qwen3-Max model has a pre-training data volume of 36 trillion tokens and over one trillion parameters, showcasing a substantial increase in performance and efficiency [6][12]. Group 2: Strategic Vision and Goals - Alibaba Cloud's CEO, Wu Yongming, articulated a vision where large models will serve as the next generation of operating systems, fundamentally transforming software development and interaction [3][4]. - The company aims to build a "Super AI Cloud" to provide a global intelligent computing network, with a three-year plan involving an investment of 380 billion yuan in AI infrastructure [3][4]. - The transition from AGI (Artificial General Intelligence) to ASI is outlined in three stages: "intelligent emergence," "autonomous action," and "self-iteration," with the ultimate goal of surpassing human intelligence [4][12]. Group 3: Market Response and Financial Performance - Following the announcement of its new AI strategy, Alibaba's stock surged over 9%, reaching its highest level since October 2021, indicating strong market confidence in the company's direction [5][12]. - Alibaba Cloud reported a 26% year-on-year increase in quarterly revenue, with AI-related income growing for eight consecutive quarters at triple-digit rates [12][18]. - The Chinese AI cloud market is projected to reach 22.3 billion yuan by mid-2025, with Alibaba Cloud holding a 35.8% market share, surpassing the combined share of its next three competitors [12].
阿里一口气发了N款新模型,让我们向源神致敬。
数字生命卡兹克· 2025-09-24 05:28
Core Viewpoint - Alibaba's recent cloud conference showcased a comprehensive range of new AI models, indicating a significant investment in AI technology and a commitment to building a robust AI ecosystem [1][64]. Group 1: New Model Releases - The Qwen3-Max model was introduced as a direct competitor to top models like GPT-5 and Claude Opus 4, featuring over 1 trillion parameters and trained on 36 trillion tokens [3][6]. - Qwen3-Max has two versions: the Instruct version for general use and a more advanced Thinking version, which is not yet publicly available [8][15]. - The Wan2.5 model was launched, enhancing capabilities for audio-visual synchronization, allowing users to generate videos from images and audio [20][32]. - Qwen3-VL, a powerful visual language model, supports a context of 256K tokens and can be extended to 1 million tokens, outperforming some competitors in specific tasks [33][37]. - Qwen3-Omni, an end-to-end multimodal model, supports various input types and languages, showcasing Alibaba's extensive capabilities in AI [45][48]. Group 2: Performance and Capabilities - Qwen3-Max achieved top scores in various AI benchmarks, including a perfect score in challenging math reasoning competitions [11][15]. - The models demonstrate advanced reasoning and agent capabilities, allowing them to perform complex tasks and interact with tools effectively [40][41]. - The new models are designed to enhance user experience in applications such as digital content creation and real-time translation, with low latency and high accuracy [49][59]. Group 3: Additional Innovations - Alibaba introduced several other models, including Qwen3-Coder-Plus for improved coding efficiency and Fun-ASR for advanced speech recognition [54][57]. - The company is also focusing on safety with models like Qwen3Guard, aimed at ensuring AI security in real-time applications [60]. - The overall strategy reflects Alibaba's ambition to create a comprehensive AI ecosystem that spans various modalities and applications [68][70].
谈超级人工智能之路,吴泳铭称阿里目标是打造AI时代的操作系统
Di Yi Cai Jing· 2025-09-24 03:29
其次,他判断,AI Cloud是下一代计算机。算力正在从以CPU为核心的计算加速转变为GPU为核心、以 大模型驱动的AI计算,新的计算范式需要更稠密的算力、更高效的网络和更大的集群规模,需要超大 规模的基础设施和全栈基础积累才能承载这样的需求。他认为,未来全世界也许只会有5到6个超级云计 算平台。 AGI并不是终点,吴泳铭认为,AI会经历三个阶段最终成长为超级人工智能。第一阶段是智能涌现,AI 学习人;第二阶段是AI自主行动,辅助人,我们刚刚处在这个阶段的开端,未来也许会有超过世界人 口的智能体和机器人和人类一起工作;第三个阶段是自我迭代,超越人,跨越到这个阶段需要两个要 素,AI将逐步连接几乎物理世界的所有场景和数据,模型能够自我学习、通过与真实世界的持续交互 获得新的数据实现自我迭代与智能升级。 在通往这个变革的路上,吴泳铭作出了一些预测。首先,他认为大模型将是下一代操作系统,在未来物 理世界与数字世界的交互中,大模型扮演今天操作系统的地位。各行各业、所有用户都会通过大模型相 关的工具执行任务,自然语言可能就是未来AI时代的编程语言。 吴泳铭相信,未来大模型将运行在所有计算设备中,基于此,阿里巴巴坚持开源 ...