雷峰网
Search documents
独家秘籍:探索昇思MindSpore如何让SOTA模型迁得快、对得齐
雷峰网· 2025-06-12 08:15
Core Viewpoint - The article emphasizes the rapid evolution of large models and the need for efficient migration and deployment solutions in the AI development ecosystem, highlighting the capabilities of MindSpore in facilitating these processes. Group 1: Migration and Deployment Solutions - MindSpore supports Day0 migration for training, enabling seamless cross-framework model transfer with zero-code migration and maintaining model accuracy, achieving a 5% improvement in training performance under distributed parallel strategies [2][5]. - The deployment process is automated, allowing for quick model service initiation, with HuggingFace models being deployable in under 30 minutes using the vLLM-MindSpore plugin [6][7]. Group 2: Ecosystem and Community Engagement - Since its open-source launch on March 28, 2020, MindSpore has fostered a vibrant developer community, with over 1.2 million downloads and contributions from more than 46,000 developers across 130 countries [8][9]. - The community-driven approach includes a governance model with a council and special interest groups (SIGs) to collaboratively define technical directions [9]. Group 3: Technical Innovations - MindSpore employs advanced techniques such as multi-level pipelining and just-in-time (JIT) compilation, resulting in a 40% increase in single-card training efficiency [10]. - The platform also features automated load balancing tools to address the "bottleneck effect" in large-scale training, achieving over 96% linearity in performance [10].
马云亲自回帖!万字离职贴引爆阿里内网;哪吒创始人被围堵,讨薪员工:态度恶劣,欲抬腿踢员工;小鹏自研图灵芯片展出时被偷丨雷峰早报
雷峰网· 2025-06-12 00:32
Key Points - Jack Ma personally responded to a lengthy farewell post on Alibaba's internal network, which resonated with many employees and reflected on the company's past and values [4][5] - Xingshi Innovation's stock surged 285% on its debut, with a market capitalization exceeding 70 billion yuan, raising 1.938 billion yuan for future projects [19] - Xiaopeng Motors reported that its self-developed Turing chip was stolen during a showcase event, leading to discussions about potential marketing strategies [7] - Neta's founder faced a protest from former employees demanding unpaid wages, highlighting ongoing financial struggles within the company [10] - JD Logistics has begun operations in Saudi Arabia, reportedly building a team of over a thousand to support its logistics services [12][13] - Tencent's online video business underwent a significant organizational restructuring, establishing an executive committee to enhance decision-making [14] - Alibaba's cross-border e-commerce platform AliExpress launched a vehicle sales business, marking a significant expansion into the automotive sector [15] - Volkswagen announced a leadership change in its China operations, appointing a new CEO for its passenger car brand [26] - MiniMax is set to release a text reasoning model and plans to introduce an independent audio application, expanding its product offerings [24] - AITO Wenjie surpassed traditional luxury car brands in China, becoming the top-selling luxury vehicle in the market [27]
揭秘GoTo旗下业务迁移腾讯云始末
雷峰网· 2025-06-12 00:32
Core Viewpoint - The cloud migration of GoTo reflects the escalating competition among domestic cloud giants in the Southeast Asian market, marking a significant shift in their international strategies [1]. Group 1: GoTo's Cloud Migration - GoTo, Indonesia's largest internet technology company, has successfully migrated its Gojek ride-hailing and delivery services to Tencent Cloud, while its financial services infrastructure has been deployed on Alibaba Cloud [2][3]. - The migration is unprecedented in Southeast Asia, as no other company has undertaken such a large-scale cloud transition, drawing significant attention from regional enterprises [3]. - GoTo's decision to migrate was driven by the need to enhance its service offerings and operational efficiency, particularly in the ride-hailing and delivery sectors [5][7]. Group 2: Technical Challenges and Solutions - The migration of GoTo's ODS (On-Demand Services) was particularly challenging due to rapid business growth and the lack of a systematic technical foundation [9]. - GoTo's leadership recognized the necessity of upgrading their technology stack to support the transition to a more efficient delivery system [8][12]. - The migration process involved meticulous planning, including a detailed execution guide with thousands of operational steps and multiple rehearsals to anticipate potential issues [14][15]. Group 3: Strategic Partnerships and Outcomes - GoTo's collaboration with Tencent Cloud was based on a comprehensive evaluation of service compatibility, cost-effectiveness, and technical capabilities [11]. - The successful migration, which took 4 hours and 54 minutes, resulted in a cost reduction of over 50% for GoTo, marking a significant operational leap [16]. - This partnership signifies Tencent Cloud's strategic move into the Southeast Asian market, highlighting the increasing pace of domestic cloud providers' international expansion [17].
华为「数字化风洞」小时级预演万卡集群方案,昇腾助力大模型运行「又快又稳」
雷峰网· 2025-06-11 11:00
Core Viewpoint - The article discusses the launch of the Ascend modeling and simulation platform, which aims to optimize the interaction between load, optimization strategies, and system architecture to enhance infrastructure performance [1]. Group 1: Challenges in AI Model Training - Over 60% of computing power is wasted due to hardware resource mismatches and system coupling, highlighting the inefficiencies in traditional optimization methods [2]. - The training process for large models is likened to "slamming the gas pedal," where the MoE model requires precise balancing of computation and memory to avoid efficiency drops [4]. - Dynamic real-time inference systems face challenges in meeting both high throughput and low latency requirements across varying task types [4]. Group 2: Solutions and Innovations - The "digital wind tunnel" allows for pre-simulation of complex AI models in a virtual environment, enabling the identification of bottlenecks and optimization strategies before real-world implementation [6]. - The Sim2Train framework enhances the efficiency of large-scale training clusters through automatic optimization of deployment space and dynamic performance awareness, achieving a 41% improvement in resource utilization [7]. - The Sim2Infer framework focuses on real-time optimization of inference systems, resulting in over 30% performance improvement through adaptive mixed-precision inference and global load balancing [8]. Group 3: High Availability and Reliability - The Sim2Availability framework ensures high availability of the Ascend computing system, achieving a 98% uptime and rapid recovery from failures through advanced optimization techniques [11]. - The system employs a comprehensive monitoring approach to track hardware states and optimize software fault management, enhancing overall system reliability [13]. Group 4: Future Outlook - As new applications evolve, the demand for innovative system architectures will increase, necessitating continuous advancements in modeling and simulation methods to support the development of computing infrastructure [16].
具身智能估值断层加速,机器人新势力靠什么穿越风暴?
雷峰网· 2025-06-11 11:00
Core Viewpoint - The disparity in valuations within the embodied intelligence sector reflects either a significant gap in capabilities or the presence of a valuation bubble [2][3][24] Group 1: Market Dynamics and Valuations - The first tier of Chinese embodied intelligence startups is estimated to be valued between 2.5 billion to 3 billion RMB [2] - Companies like Yushun and Zhiyuan have valuations exceeding 15 billion RMB and 10 billion RMB respectively, while many others are valued between 2 billion to 3.5 billion RMB [2] - The valuation gap between different tiers of companies can exceed 100% [2] - The market is witnessing a trend where hardware and software are increasingly viewed as separate investment tracks, complicating valuation standards [4][17] Group 2: Investment Trends - The investment landscape is shifting towards a headquarter-focused model, with a preference for established companies like Yushun, which has become a benchmark for hardware projects [16][17] - Many investors are cautious, preferring to invest in companies with proven business models rather than speculative startups [18][19] - The influx of capital into the sector has led to inflated valuations, with early-stage companies often starting at valuations in the millions [18][19] Group 3: Challenges and Future Outlook - The embodied intelligence sector is still heavily reliant on financing, with many companies focusing on securing funding rather than achieving profitability [12][20] - There is a consensus that the market is at a critical juncture, with many companies expected to deliver products and generate revenue in the near future, potentially reshaping the competitive landscape [24] - The industry is still in its early stages, with significant technological advancements yet to be realized, indicating a potential for future growth despite current valuation concerns [23][24]
比亚迪长安等车企承诺账期不超60天,蔚小理尚未跟进;YU7外形被质疑抄袭,专家放话不侵权;喜马拉雅12.6亿美元卖身腾讯音乐
雷峰网· 2025-06-11 00:53
Group 1 - BYD and Changan have unified their payment terms to 60 days, while new players like NIO and Li Auto have not yet responded [4][5] - Xiaomi's YU7 model faces plagiarism accusations, but the company claims its design is original and backed by experts stating it does not infringe on patents [7][8] - BYD's salary levels have surpassed Huawei's, with significant investments in AI and a commitment to improving brand perception amid shareholder criticism [10][11] Group 2 - Ren Zhengfei of Huawei stated that the U.S. has exaggerated Huawei's achievements, emphasizing the need for continuous improvement in chip technology [13] - TSMC is accelerating its U.S. factory construction while slowing down projects in Japan and Europe due to market demand fluctuations [14] - BYD and other Chinese manufacturers are gaining ground in the autonomous driving sector, posing a threat to Tesla's market position [15] Group 3 - The Zhiyuan Research Institute showcased a four-legged robot designed to assist visually impaired individuals, successfully guiding them in complex environments [17] - Tencent Music announced a $12.6 billion acquisition of Himalaya, marking a significant move into the online audio sector [19] - Xiaopeng Motors is set to unveil its G7 model featuring the Turing AI chip, which boasts advanced processing capabilities [26] Group 4 - Huawei is preparing to launch its Pura 80 series smartphones, featuring advanced imaging technology and expected to start at around 5000 yuan [32] - Ideal Auto has established two new robotics divisions, focusing on space and wearable robots, indicating a strategic shift towards AI integration [34] - Gree Electric's president mentioned that several business segments are ready for potential spin-offs, reflecting a strategy to enhance market competitiveness [35]
昇腾 AI 算力集群有多稳?万卡可用度 98%,秒级恢复故障不用愁
雷峰网· 2025-06-10 10:30
Core Viewpoint - The article discusses how Huawei enhances the efficiency and stability of AI computing clusters, emphasizing the importance of high availability to support continuous operation and minimize downtime in AI applications [2][16]. Group 1: High Availability Core Infrastructure - AI computing clusters face complex fault diagnosis challenges due to large system scale and intricate technology stacks, with fault localization taking from hours to days [4]. - Huawei has developed a full-stack observability capability to improve fault detection and management, which includes a fault mode library and cross-domain fault diagnosis [4]. - The CloudMatrix super node achieves a mean time between failures (MTBF) of over 24 hours, significantly enhancing hardware reliability [4]. Group 2: Fault Tolerance and Reliability - Huawei's super node architecture leverages optical link software fault tolerance solutions, achieving a fault tolerance rate of over 99% for optical module failures [5][6]. - The recovery time for high-bandwidth memory (HBM) multi-bit ECC faults has been reduced to 1 minute, resulting in a 5% decrease in computing power loss due to faults [6]. Group 3: Training and Inference Efficiency - The linearity metric measures the improvement in training task speed relative to the number of computing cards, with Huawei achieving a linearity of 96% for the Pangu Ultra 135B model using a 4K card setup [10]. - Huawei's training recovery system can restore training tasks in under 10 minutes, with process-level recovery reducing this to as low as 30 seconds [12]. - For large EP inference architectures, Huawei has proposed a three-tier fault tolerance solution to minimize user impact during hardware failures [12][14]. Group 4: Future Directions - Huawei aims to explore new applications driven by diverse and complex scenarios, breakthroughs in heterogeneous integration, and innovative engineering paradigms focused on observability and intelligent autonomy [16].
万字总结:如何练就适配人形机器人的可靠「灵巧手」?
雷峰网· 2025-06-10 10:30
2025 年 5 月 25 日,雷峰网、AI 科技评论、GAIR Live 品牌举办了一场主题为"具身智能之灵巧手的探索与应用"线上圆桌沙龙。 圆桌主持人为元禾原点合伙人乐金鑫,同时圆桌还邀请了新加坡国立大学助理教授 & RoboScience创始人邵林、上海交通大学副教授 & 千觉机器人创始人马 道林、浙江大学控制科学与工程学院百人计划研究员 & 博士生导师叶琦,共同开展一场深度交流。 VLA 未来有望升级为含触觉的 VTLA,以突破信息融合的技术瓶颈。 作者丨吴华秀 编辑丨 陈彩娴 在具身智能快速崛起的当下,灵巧手作为连接数字智能与物理世界的关键载体,正从传统的执行终端跃升为人工智能落地的核心突破口。 会上,嘉宾们各自分享了与灵巧手的故事,并围绕灵巧手软硬件挑战、数据与模型、落地与应用等多个方面发表独特见解。其中,三位嘉宾围绕如何灵巧手数 据难题,分别给出了意见与想法。 马道林指出,当前灵巧手、夹爪相关的采集数据及其训练出的模型,仍处于整个具身智能领域的初期阶段,而且数据模态更多是视觉和动作方面,还未涵盖触 觉。接下来一方面要采集更多多模态数据,另一方面是解决采集后不同模态数据的处理以及融合等问题。 邵林 ...
损失达几十亿?美的回应北美空调事件:不存在缺陷系主动召回;DeepSeek核心高管离职创业;传华为Pura X有新开屏方案
雷峰网· 2025-06-10 00:28
Group 1 - Xiaomi's China region has undergone personnel adjustments, with Vice President Wang Xiaoyan also taking on the role of General Manager of Xiaomi Home, while the former GM Wang Hui will transition to the Sales Management Department [4] - As of March 31, Xiaomi's offline retail store count in China reached 16,000, with a target of 20,000 by the end of the year [4] - Xiaomi is expanding its new retail model globally, planning to open 10,000 stores overseas in the next five years [5] Group 2 - DeepSeek's core executive has left to start a new venture focused on the Agent sector, with plans to launch a product by Christmas 2025 [7] - DJI's imaging system founder and team leader has reportedly left the company, marking a significant personnel change [9] Group 3 - Midea Group responded to a recall of its North American air conditioning units, stating it was a voluntary recall and not due to defects, despite potential losses amounting to billions [10] - The recalled U-shaped air conditioner has sold 1.7 million units in the U.S. and 45,900 in Canada since its launch in 2020 [10] Group 4 - BYD has entered the top ten of imported car brands in Japan for the first time, with 416 units registered in May and plans to open 100 stores by the end of 2025 [21] - BYD's sales in Japan for 2024 are projected at 2,221 units, a 10% year-on-year increase, despite a 6% decline in overall imported car sales [21] Group 5 - JD.com has released a clean cooperation guideline prohibiting suppliers from engaging with dismissed employees, and established a 10 million yuan anti-corruption reward fund [19] - GAC Aion has seen a leadership change, with He Xianqing taking over as chairman from Feng Xingya [19] Group 6 - Xiaohongshu has established its first overseas office in Hong Kong, marking a significant step in its global strategy [20] - The platform aims to enhance creative collaboration between local content creators and brands, promoting cultural exchange [20] Group 7 - The "Guzi economy" is rapidly growing, with Pinduoduo testing a new group buying service specifically for this market, projected to reach a market size of 168.9 billion yuan in 2024 [13] - SiliconCloud, a generative AI development platform, has surpassed 6 million users and thousands of enterprise clients, with significant daily token generation [14] Group 8 - Neuralink and Grok are collaborating to enable ALS patients to communicate again through a brain-machine interface, showcasing advancements in assistive technology [32] - Toyota is partnering with a Finnish company to launch the world's first hydrogen sauna, aligning with its environmental goals [33] Group 9 - Qualcomm has announced the acquisition of UK semiconductor company Alphawave Semi for approximately $2.4 billion, enhancing its semiconductor IP portfolio [34] - SHEIN has denied reports of plans to increase its Indian supplier base from 150 to 1,000, clarifying its partnership with Reliance is limited to brand licensing [34]
独家丨原抖音生服市场负责人王丁虓加入京东健康,向CEO金恩林汇报
雷峰网· 2025-06-09 13:37
Core Viewpoint - Wang Dingxiao has recently joined JD Health as the head of the marketing department, indicating a strategic shift in the company's marketing leadership and a response to the evolving landscape of the digital marketing industry [2][5]. Group 1: Leadership Changes - Wang Dingxiao, previously the marketing head for Douyin's life services, has taken on the role of general manager of the marketing department at JD Health, reporting directly to CEO Jin Enlin [2][4]. - Prior to Wang's appointment, the marketing department was managed by CEO Jin Enlin himself, highlighting the frequent changes in leadership within JD Health [5]. Group 2: Career Background of Wang Dingxiao - Wang Dingxiao graduated from Tianjin Normal University in 2010 and has held various strategic roles in advertising firms such as Dentsu Digital and Ogilvy before transitioning to marketing at ByteDance in 2017 [2][3]. - During his seven years at ByteDance, Wang played a significant role in the development of the short video industry, managing marketing strategies for key clients across multiple regions [3]. Group 3: Industry Context - The marketing landscape is increasingly characterized by the need to build personal brands for entrepreneurs, leading to frequent adjustments in the roles of brand and marketing departments [5]. - JD's management has seen continuous changes, with recent reports indicating ongoing adjustments in the retail and marketing teams, reflecting the dynamic nature of the industry [5].