Workflow
Apple Foundation模型
icon
Search documents
马斯克称Grok最迟下周登陆特斯拉汽车;庞若鸣晒出在苹果的最新论文丨全球科技早参
Mei Ri Jing Ji Xin Wen· 2025-07-11 00:00
Group 1: AI and Energy Sector - US electricity suppliers are seeking significant price increases for consumers due to the surge in demand from AI data centers, with a reported 142% increase in rate hike applications totaling $29 billion in the first half of 2025 compared to the previous year [2] - This trend highlights the impact of AI technology on energy infrastructure and the challenges in balancing energy policy with technological innovation and consumer protection [2] Group 2: Tesla and AI Innovations - Elon Musk announced that Grok will soon be integrated into Tesla vehicles, with a timeline set for next week, indicating Tesla's ongoing innovation in AI and autonomous driving technology [3] - Tesla is also expanding its Robotaxi service in Austin and is awaiting regulatory approval to launch the service in the Bay Area within one to two months [3] Group 3: Amazon and AI Investment - Amazon is considering a substantial additional investment in AI startup Anthropic to strengthen their strategic alliance, building on an existing investment of $8 billion [4][5] - This move reflects Amazon's strategic positioning in the AI sector and its focus on acquiring top talent [5] Group 4: AI in the Food Industry - A restaurant named WOOHOO, set to open in Dubai in September, will feature an "AI chef" named Aiman, which will design menus and service, although human chefs will still prepare the food [6][7] - This opening signifies the innovative application of AI technology in the restaurant industry [7] Group 5: Research and Development in AI - A research paper titled "AXLearn: Modular Large Model Training on Heterogeneous Infrastructure" was shared by a key figure from Apple's foundational model team, showcasing advancements in scalable deep learning model training [8] - This research underlines Apple's technical capabilities in AI foundational model training [8]
Meta为他豪掷2亿美元,上交校友庞若鸣,晒出在苹果的最新论文
机器之心· 2025-07-10 10:49
Core Viewpoint - The article discusses Ruoming Pang's transition from Apple to Meta, highlighting his contributions to Apple's foundational model and the development of AXLearn, a modular large model training system designed for heterogeneous infrastructure. Group 1: Ruoming Pang's Transition - Ruoming Pang, head of Apple's foundational model team, is moving to Meta's newly established superintelligence team, with a reported offer of $200 million [2][3]. - Despite the transition, Pang continues to contribute to Apple by promoting his research on AXLearn [3][4]. Group 2: AXLearn Overview - AXLearn is a production-grade system designed for large-scale deep learning model training, emphasizing scalability and high performance [6]. - The system features a modular design and comprehensive support for heterogeneous hardware infrastructure, allowing for efficient integration of functionalities like Rotary Position Embeddings (RoPE) with minimal code [6][8]. - A new method for measuring modularity, based on lines of code (LoC-complexity), is introduced, showing that AXLearn maintains constant complexity during system expansion, unlike other systems that exhibit linear or quadratic growth [7][23]. Group 3: Performance Evaluation - AXLearn's training performance is compared with systems like PyTorch FSDP, Megatron-LM, and MaxText across various hardware platforms, demonstrating competitive iteration times and throughput [26][29]. - The system shows near-linear scalability in weak-scaling experiments, indicating its robustness in handling increased workloads [30]. Group 4: Production Use and Impact - AXLearn has evolved from a tool for a few developers to a large platform supporting hundreds of developers in training models with billions to trillions of parameters [35]. - It can concurrently support over 10,000 experiments and is deployed across various heterogeneous hardware clusters, contributing to features used by billions of users [36][37].