Workflow
Baltra芯片
icon
Search documents
苹果首款AI服务器芯片曝光,联手博通
3 6 Ke· 2025-12-17 02:51
芯东西12月16日消息,今日凌晨,美国金融数据分析平台unusual whales在社交平台X上发文称,苹果正在开发首款代号为"Baltra"的AI服务器芯 片,并与博通合作开发关键网络技术,以避免从英伟达购买芯片。该账号还特别提到,苹果"宁愿重新发明芯片,也不愿为英伟达的高额利润买 单"。外媒WccfTech推测,苹果Baltra是为满足AI推理需求而生,2027年亮相。 基于此,WccfTech推测,苹果主要会使用Baltra来满足其庞大的AI推理需求。 推理芯片的架构与用于训练模型的芯片架构不同,前者更注重时延与吞吐量的优化,同时还采用了基于低精度运算的架构设计,例如INT8精度。因 此,这也可能是苹果和博通在推进Baltra整体设计流程时,会重点关注的方面。 科技博主Max Weinbach在社交平台X上发文称,他认为苹果大概率不会打造大规模的算力集群,反而可能会推出类似(英伟达)GB300架构的产 品,采用64颗芯片互连,搭配更大容量的高带宽LPDDR内存。这种方案的成本应该会比目前市面上多数芯片低不少,同时又能精准匹配实际需 求。 据WccfTech报道,苹果是垂直集成的爱好者,倾向于尽可能保留关 ...
苹果自研AI芯片…带动手机、NB功能再进化 鸿海大赢家
Jing Ji Ri Bao· 2025-12-16 23:30
苹果加入AI自研芯片战局,携手博通打造首款自家AI芯片"Baltra",传出将于2027年开始部署建置搭载 自研芯片的AI服务器,冲刺人工智慧应用,并让Apple Intelligence服务再进化,借此带动iPhone、 iPad、MacBook等终端硬体装置销量,主力供应商鸿海(2317)集团有望受惠。 苹果在自研AI芯片开发稳步推进,今年10月下旬宣布,位在美国德州休士顿工厂提前投产,并开始生 产人工智慧服务器,以支持特朗普美国制造政策与Apple Intelligence的服务。 业界看好,随着苹果将Apple Intelligence各项AI功能陆续导入在iPhone新机中,相关功能预估随iOS版本 更新日趋多元化,未来iPhone的AI功能只会有增无减,需要更多AI服务器支援。 法人指出,虽然苹果采购的AI服务器数量,远不及英伟达或四大云端服务供应商(CSP)的资料中心, 但苹果目前仍是鸿海第一大客户,而且鸿海身为iPhone最大组装厂,稳坐个人终端装置出货龙头,现在 鸿海在苹果云端服务器上也成为主力供应商,掌握云端与终端两大优势,与苹果合作关系将更稳固。 外电报导,苹果自研AI服务器芯片"Bal ...
腾讯研究院AI速递 20251217
腾讯研究院· 2025-12-16 16:32
Group 1: Apple AI Server Chip - Apple is developing its first AI server chip, codenamed "Baltra," in collaboration with Broadcom, utilizing TSMC's 3nm process, expected to be deployed in 2027 [1] - Apple has shifted from building its own large models to paying approximately $1 billion annually for Google's customized 1.2 trillion parameter Gemini model, with Baltra primarily aimed at meeting significant AI inference demands [1] - The chip architecture will focus on optimizing latency and throughput, employing low-precision operations like INT8, and may utilize a configuration of 64 interconnected chips with large-capacity LPDDR memory [1] Group 2: NVIDIA Nemotron 3 Series - NVIDIA has launched the Nemotron 3 series of open models, which includes Nano, Super, and Ultra scales, featuring a breakthrough heterogeneous mixture expert architecture [2] - The Nemotron 3 Nano has a throughput that is four times higher than its predecessor, achieving leading token generation rates per second in large-scale multi-agent systems, significantly enhancing inference efficiency [2] - The model achieves exceptional accuracy through advanced reinforcement learning techniques and large-scale parallel multi-environment post-training, providing a complete training dataset and reinforcement learning library [2] Group 3: ChatGPT Memory System - Developer Manthan Gupta has reverse-engineered ChatGPT's memory system, revealing a four-layer architecture: session metadata, user memory, recent conversation summaries, and a sliding window [3] - The system does not utilize vector databases or RAG retrieval but instead relies on pre-generated lightweight summaries and explicitly stored structured information to achieve the effect of "remembering users" [3] - GPT-4 has a maximum context window of 128k tokens, beyond which the earliest content is forgotten, and users can request the model to delete or modify memory content at any time [3] Group 4: Tencent Yuanbao Writing Mode - Tencent Yuanbao has launched a writing mode that supports automatic completion of plot character outlines and one-click generation of manuscripts, capable of producing tens of thousands of words in a single session [4] - The feature is adaptable to various genres, including historical, science fiction, and fan fiction, allowing users to set a single sentence to let AI complete the outline and chapter structure, with customizable story direction and endings [4] - Yuanbao can generate approximately 30,000 words in about 14 minutes and 50,000 words in half an hour, with support for one-click export to local documents or Tencent documents [4] Group 5: Tongyi Wanxiang 2.6 Release - Tongyi Wanxiang 2.6 has become the first video model in China to support role-playing functions, featuring audio-visual synchronization, multi-camera generation, and voice-driven capabilities, making it the most comprehensive video generation model globally [5] - The video generation supports 15-second long videos, multi-camera narratives, and natural audio-visual synchronization, allowing for single and multi-person collaborations based on input video character appearance and voice [5] Group 6: ByteDance Seedance 1.5 Pro Model - ByteDance has released the Seedance 1.5 Pro audio-video generation model, which supports precise audio-visual synchronization, multilingual dialects, cinematic-level camera movements, and 15-second long video generation [6] - The model employs the MMDiT architecture to achieve precise audiovisual collaboration, natively supporting multiple languages, including Chinese, English, Japanese, Korean, and dialects like Sichuanese and Cantonese, with audio instructions at industry-leading levels [6] - In comprehensive evaluations, SeedVideoBench 1.5 demonstrated rich dynamic performance, vivid character expressions, and significantly reduced audio-visual misalignment, applicable in film, advertising, and short drama scenarios [6] Group 7: L3 Autonomous Driving Models - The Ministry of Industry and Information Technology has conditionally approved Chang'an's Deep Blue SL03 and Arcfox Alpha S as the first L3 autonomous driving models in China [8] - The Deep Blue SL03 can achieve single-lane autonomous driving at a maximum speed of 50 km/h in congested environments, limited to designated routes like the Chongqing Inner Ring; the Arcfox Alpha S can reach 80 km/h, restricted to routes like the Beijing-Jingtai Expressway [8] - Both companies have completed product testing and safety evaluations, with plans to conduct on-road trials in designated areas through Chang'an Vehicle Networking Technology and Beijing Travel Automotive Services [8] Group 8: Eric Schmidt's Views on AI - Former Google CEO Eric Schmidt proposed the "San Francisco Consensus," suggesting that the combination of language agents and reasoning capabilities will approach human core abilities, leading to recursive self-improvement in AI as technology converges [9] - He predicts that AI mathematicians will emerge within the next year, driving the birth of new mathematical theories, with industry consensus on this transformation occurring within 2-4 years, while emphasizing the need to maintain human agency and decision-making authority [9] - The paths of US-China AI competition are diverging: the US focuses on superintelligence development but faces power shortages, while China is fully promoting AI commercial applications with ample power supply, both relying on the private sector for development [9] Group 9: AI "Finger Problem" - Multiple AI models failed to accurately count the number of fingers in images depicting six-fingered hands, even when prompts explicitly stated there were six fingers, with models insisting on five [10] - The root of the problem lies in the strong association in training data of "human hands = five fingers" and the lack of explicit structural constraints in the Transformer architecture, which cannot track state information in a single forward pass [10] - Diffusion models excel at capturing overall distributions and textures but struggle with precise control of local discrete structures, revealing current AI's Achilles' heel in visual reasoning and causal relationship understanding [10]
苹果2026年MacBook Pro升级计划曝光:OLED显示屏与M6芯片齐亮相
Huan Qiu Wang· 2025-06-18 06:54
Group 1 - Apple plans a comprehensive upgrade for the MacBook Pro in 2026 to celebrate the product line's fifth anniversary [1] - The 2026 MacBook Pro is expected to feature OLED display technology for the first time, enhancing brightness, color vibrancy, and contrast [3] - The new MacBook Pro will adopt a thinner design and introduce the new M6 chip series, which is based on TSMC's advanced 2nm process, promising significant performance and energy efficiency improvements [3][4] Group 2 - The 2026 MacBook Pro series is likely to include M6, M6 Pro, and M6 Max versions, available in 14-inch and 16-inch sizes [4] - Apple is also developing a custom chip for its upcoming smart glasses, expected to begin production by the end of 2026 or in 2027, which will also be manufactured by TSMC [4] - The company is collaborating with Broadcom to develop its first chip specifically designed for AI servers, codenamed Baltra, expected to be completed by 2027 [4]