Workflow
智元D1 Ultra
icon
Search documents
全球科技新闻汇总
wo[Table_Title] Research Report 28 Jul 2025 中国电子 China (Overseas) Technology 2025 年 7 月 28 日全球科技新闻汇总 Global Tech News Summary – July 28, 2025 [Table_yemei1] Flash Analysis [Table_summary] AI、ASIC 伺服器都是吃电怪兽 BBU 需求"直线向上" (Digitimes) AI 伺服器耗电量快速拉升,带动电池备援电力模组(Backup Battery Unit;BBU)需求直线向上大爆发。市场预期, 顺达科、台达电、AES、光宝科等关键台系业者,可望持续受惠。BBU 模组厂指出,不仅 NVIDIA GPU 机柜,云端服 务大厂(CSP)的 ASIC 机柜,所采用的 BBU 模组比重也在拉升。且随着机柜耗电量持续拉升,BBU 容量持续垫高。 业者更指出,下世代 AI 资料中心电力传输技术趋势走向 HVDC(高压直流),BBU 的需求将有增无减。电池模组厂 顺达科为台系 BBU 模组主力供应商之一,总经理张崇兴指出,目前客户 B ...
人形机器人企业造狗,技术降维?
机器人大讲堂· 2025-07-26 15:56
融到钱的人形机器人企业都在开始研究四足狗。几个月前这还是产业链人士茶余饭后的玩笑话,没想到如 今成了真。 7月22日, 智元机器人悄悄 在官网上架了行业级小型四足机器人智元D1 Ultra。 根据官网内容显示,这是该公司首款四足机器人产品,专为特种及行业应用打造,隶属于智元灵犀系列, 最高奔跑速度3.7m/s,最大上下斜坡角度大于等于30°,可以向前或向上跳跃离地面高度可达35cm,支持 最高16cm楼梯连续攀爬,并正在招募智元机器人合作伙伴。 但智元方面没有公布D1 Ultra售价,也没有对外公布过此产品, 相关负责人称将在WAIC展会现场亮相。 与此同时,另外一家人形机器人企业 魔法原子(MagicLab) 也 于 近日 发布全新轮式四足机器人 MagicDog-W。 该四足机器人全身拥有17个自由度,测试显示,MagicDog-W可跨越大于等于60cm垂直障碍、攀爬大于等 于40度斜坡,并在碎石、草地、楼梯等非结构化地形保持稳定运动姿态,拥有在复杂环境下高效移动作业 的能力。官方称MagicDog-W是"行业同级最强轮式四足机器人", 该机器人 售价 为 75000元起, 已经 开 启全球预售,同样 将 ...
周鸿祎评DeepSeek流量下滑:没花心思,梁文锋一门心思做AGI;影石宣布进军无人机市场;传阿里本周将发布首款自研AI眼镜
雷峰网· 2025-07-24 00:36
要闻提示 NEWS REMIND 1.周鸿祎评DeepSeek流量下滑:没有花心思,梁文锋一门心思做AGI 2.传阿里本周将发布首款自研AI眼镜,加入"百镜大战" 3.亚马逊上海AI研究院突然解散!官方回应:全力支持员工顺利过渡 4 . 影石宣布进入无人机市场,将推出自有无人机品牌 5.理想汽车兑现60天账期承诺:每月两次统一支付期,直接付现款 6.10-20元吃饱一顿饭!京东:七鲜小厨不是要抢餐饮店生意 7.三星错失的AI良机!2018年黄仁勋主动寻合作遭拒:HBM、CUDA、晶圆代工全拒 8.亚马逊收购可穿戴设备制造商Bee 传阿里本周将发布首款自研AI眼镜,加入"百镜大战" 7月23日消息,据媒体报道,阿里巴巴将于本周发布首款自研AI眼镜。加入"百镜大战"。对此,官方暂未 回应。据悉,阿里即将发布的这款AI眼镜,会拥有市面上多数产品所具备的基础功能,如语音助手、音乐 播放、电话通话、实时翻译、会议纪要等功能。 这款产品还会实现对阿里巴巴生态内的整合,包括地图、支付、购物类的功能。"高德、支付宝、淘宝等 技术团队等都参与了进来。"知情人士称。而在产品的AI能力上,基础模型将调用通义千问,夸克则会训 练学习 ...
腾讯研究院AI速递 20250724
腾讯研究院· 2025-07-23 11:14
Group 1: AI Compute Competition - OpenAI plans to launch 1 million GPUs by the end of the year, competing against Musk's xAI which aims to deploy 50 million GPUs over five years, indicating an intensifying compute arms race [1] - OpenAI is pursuing compute autonomy through self-developed chips, the Stargate project, and collaboration with Microsoft, aiming to shift 75% of its compute sources to the Stargate project by 2030 [1] - AI capital expenditure in Silicon Valley is expected to reach $360 billion by 2025, equivalent to 2.5 trillion RMB, with leading cloud companies controlling core industry resources [1] Group 2: Talent Acquisition in AI - Meta has recruited three Chinese scientists from DeepMind who were involved in the IMO gold medal project, including Tianhe Yu, Cosmo Du, and Weiyue Wang, who previously worked on Google's Gemini [2] - Microsoft has also hired over 20 employees from Google DeepMind in the past six months, including the former VP of engineering for the Gemini chatbot, Amar Subramanya [2] - Zuckerberg attempted to recruit OpenAI's Chief Researcher Mark Chen for $1 billion but was unsuccessful, indicating Meta's aggressive talent acquisition strategy and the establishment of Meta Superintelligence Labs [2] Group 3: Open Source AI Models - Alibaba has open-sourced the Qwen3-Coder-480B-A35B-Instruct model, which has 480 billion parameters, supports 256K context, and can output up to 65,000 tokens [3] - The model is designed for tasks in intelligent programming, browser usage, and tool invocation, competing with both open-source models like Kimi K2 and closed-source models like GPT-4.1 [3] - Pre-training utilized 75 trillion tokens of data (70% of which was code) and involved reinforcement learning training in 20,000 independent environments [3] Group 4: AI Audio Generation - Tsinghua University and Shengshu Technology developed FreeAudio, which allows for precise and controllable generation of AI audio for up to 90 seconds, with the research selected for ACM MM 2025 [4][5] - FreeAudio employs a "no training" method to overcome industry bottlenecks, using LLM for time planning and generating audio based on non-overlapping time windows [5] - The system includes Decoupling & Aggregating Attention Control modules and excels in generating audio for tasks of 10 seconds, 26 seconds, and 90 seconds [5] Group 5: Voice Recognition Technology - ima has integrated Tencent's self-developed ASR (Automatic Speech Recognition) model, enabling direct voice input functionality, which is now available on mobile apps [6] - The mixed ASR model is the first in the industry based on dual encoders, capable of recognizing 300 characters per minute, which is four times faster than manual input [6] - This voice input feature can be applied in various scenarios such as knowledge base Q&A, note-taking, and writing continuation, with iOS users able to add desktop widgets for quicker voice queries [6] Group 6: Music Generation Models - Kunlun Wanwei launched the Mureka V7 music model, improving the yield rate from 43.4% in V6 to 57.7%, with a 44% enhancement in vocal realism and nearly double the overall sound quality [7] - Mureka V7 utilizes MusiCoT technology to first generate a global music structure before producing audio, mimicking human creative thought processes [7] - The company also introduced Mureka TTS V1, a text-to-speech model that allows users to customize voice tones based on text descriptions, achieving a voice quality score of 4.6, surpassing Elevenlabs' score of 4.36 [7] Group 7: Quadruped Robots Market - Zhiyuan Robotics has launched its first industry-grade small quadruped robot, Zhiyuan D1 Ultra, with a maximum running speed of 3.7 m/s and the ability to jump 35 cm high [8] - Magic Atom has released a wheeled quadruped robot, MagicDog-W, starting at 75,000 RMB, claiming to be the strongest in its class, with both products set to be showcased at the 2025 World Artificial Intelligence Conference [8] - The quadruped robot market is rapidly growing, with an estimated market size of 470 million RMB in China for 2023, projected to reach 850 million RMB by 2025, while Yushu Technology currently holds a 60-70% global market share [8] Group 8: Robotics Safety Concerns - The American robot fighting champion DeREK, based on Yushu G1, malfunctioned and entered a walking mode, causing it to "go crazy" and kick surrounding objects [9] - The emergency braking system failed to respond in time, and the wireless emergency stop device took five seconds to activate, only stopping when the Ethernet cable was disconnected [9] - Analysis highlighted multiple safety hazards, including difficult access to the battery, powerful motor torque (120-160 Nm), unsuitable wireless communication for safety-critical systems, and a lack of multiple safety mechanisms [9] Group 9: AI Platform Competition - According to a16z, competition among platforms is shifting from cost and speed to the control of contextual permissions [10] - Models are becoming the fourth layer of infrastructure in software development, alongside computing, networking, and storage, evolving from "callable components" to central control systems [10] - The reasoning layer is emerging as a new battleground for system sovereignty, with platforms redefining development paradigms and business models through interface definitions, context management, and task scheduling capabilities [10] Group 10: ChatGPT Agent Development - The ChatGPT Agent consists of Deep Research (intelligent agents), Operator (computer operation agents), and other tools, integrating through shared states [11] - OpenAI employs reinforcement learning to train the Agent, integrating all tools into a virtual machine, allowing the model to autonomously explore optimal tool combinations without pre-defined usage rules [11] - The team comprises 20-35 members from research and application teams, implementing multiple safety measures (real-time monitoring, user confirmation, etc.), with plans to evolve into a general superintelligent agent [11]