MLX框架

Search documents
腾讯研究院AI速递 20250718
腾讯研究院· 2025-07-17 14:12
Group 1 - Google DeepMind's MoR architecture achieves two times inference speed by combining parameter sharing and adaptive computation, resulting in fewer parameters while maintaining large model performance [1] - The dynamic routing mechanism allocates different recursive depths based on token complexity, reducing redundant computations and optimizing KV cache [1] - Experimental results show that MoR improves inference throughput by 2.06 times, reduces training time by 19%, and decreases peak memory usage by 25% [1] Group 2 - Amazon launches Bedrock AgentCore preview, offering seven core AI agent services including runtime, memory, and authentication [2] - The introduction of Nova customization options and Strands Agents V1.0 simplifies agent development and enables multi-agent collaboration [2] - Amazon S3 Vectors cloud object storage is released, reducing vector storage costs by 90%, along with Kiro AI IDE to enhance developer experience [2] Group 3 - Elon Musk is seeking names for the male AI companion Grok, with suggestions like "Draven" that align with characters from "Twilight" and "Fifty Shades of Grey" [3] - A user named Jackywine has created an open-source 3D digital companion "Bella," which retains only the visual aspect without large language model capabilities [3] - The "Bella" project follows an "AI native" development path in three phases: perception core, generative self, and proactive companionship, with plans to incorporate voice recognition and affinity systems [3] Group 4 - Google Search introduces an AI feature that can make phone calls to book local services for users, such as pet grooming [4] - The search integrates the Gemini 2.5 Pro model and Deep Search functionality, capable of handling complex queries and generating in-depth reports [4] - This new feature has launched in the U.S. and will be gradually rolled out globally, sparking discussions about the effectiveness of AI automated calls and merchant experiences [4] Group 5 - The AI programming platform Windsurf reintroduces the Claude Sonnet 4 model, allowing Pro users 250 free calls per month [6] - Claude Sonnet 4 offers advantages such as cross-file intelligent refactoring, a 200,000 token context window, and precise code completion [6] - This renewed partnership follows OpenAI's acquisition failure and executive team changes, representing Windsurf's strategic move to regain user trust [6] Group 6 - Anthropic successfully rehires core programming leaders Boris Cherny and Cat Wu from Cursor within two weeks [7] - Anthropic reveals that direct sales of models and Claude yield a gross margin of 60%, while sales through AWS and Google Cloud result in a negative 30% margin [7] - Claude Code has become a new asset for Anthropic, with weekly downloads increasing sixfold to 3 million since June, contributing over $200 million in annualized revenue [7] Group 7 - CrePal launches the first AI video creation agent, allowing users to produce videos through a single command that orchestrates multiple models [8] - The system can automatically plan scripts, select appropriate models, generate visuals, and add sound effects, addressing high barriers in traditional AI video creation [8] - The innovation lies in transforming the creative process, enabling users to focus on creative expression rather than technical operations by integrating dispersed tools into a unified intelligent task [8] Group 8 - Apple's MLX framework adds CUDA support, enabling developers to train models using NVIDIA GPUs and deploy them back to Apple devices [9] - This move is seen as Apple's concession to the NVIDIA ecosystem, which dominates AI development with 5 million developers [9] - Despite past tensions over NVIDIA support, Apple opts to leverage NVIDIA's ecosystem for compliance and to expand its influence [9] Group 9 - HeShan Technology, founded by alumni from Tsinghua and Beihang University, focuses on AI tactile sensing technology and has developed the world's first AI tactile perception chip [10] - Utilizing capacitive tomography technology, HeShan achieves "sensing and control integration," addressing the tactile feedback needs in robotic precision operations [10] - The company has completed four rounds of financing and serves over 70% of domestic robot manufacturers, transitioning from a hardware provider to a comprehensive tactile solution provider [10] Group 10 - Nobel laureate John Jumper discusses the journey of AlphaFold, highlighting that the value of algorithm research is 100 times that of data [11] - AlphaFold predicts protein structures with atomic-level precision and has been cited 35,000 times, accelerating scientific discoveries [11] - Jumper predicts that AI4Science will become more generalized in the future, with AlphaFold enhancing the pace of structural biology development by 5-10%, leading to widespread advancements across scientific fields [11]
苹果向英伟达生态妥协了!MLX框架主动适配CUDA
量子位· 2025-07-17 05:52
一水 发自 凹非寺 量子位 | 公众号 QbitAI 苹果向英伟达生态妥协了! 最新消息,苹果之前特意为端侧AI模型训练推出的 MLX框架 , 主动增加了CUDA支持 。 消息一出即在Hacker News引发热烈讨论: 要知道苹果一直以来都以"封闭"著称,但随着英伟达CUDA生态在AI开发领域占据绝对主导地位,苹果这下也不得不转变姿态了。 再加上英伟达市值创下前无古人的4万亿美元新纪录,以及最近释出的一系列利好消息,苹果选择避其锋芒也就不难理解。 可以说,苹果这就是明晃晃地借了英伟达东风,以进一步抢夺AI市场。 CUDA太强,不得不拥抱 为啥要拥抱CUDA?没啥,太强了,苹果自己也这么说。 官方理由如下: (1) 统一内存支持 :CUDA提供统一内存机制,便于不同设备间的数据共享与迁移,提升开发效率和性能表现。 (2) 跨平台部署需求 :英伟达硬件在学术研究和大规模计算中应用广泛,支持CUDA能让开发者在Mac上本地开发测试,随后无缝部署到 配备英伟达GPU的服务器或超级计算机上。 而通过让MLX框架主动适配CUDA, 今后苹果开发者也能利用英伟达GPU训练模型 。 其本质是增加了对CUDA的后端支持,方便 ...
苹果MLX框架新增对CUDA的支持
news flash· 2025-07-15 08:01
Core Viewpoint - The addition of CUDA support to Apple's MLX framework enhances its capabilities for machine learning and artificial intelligence applications, potentially increasing its competitiveness in the tech industry [1] Group 1 - The MLX framework is now compatible with CUDA, which is widely used for parallel computing and deep learning [1] - This update may attract developers who rely on CUDA for their machine learning projects, expanding Apple's developer ecosystem [1] - The integration of CUDA could lead to improved performance and efficiency in machine learning tasks on Apple devices [1]
特朗普呼吁伊朗无条件投降;油价大涨逾4%,特斯拉跌近4%;外交部:正迅速组织撤离中国公民;阿里+苹果,大消息丨每经早参
Mei Ri Jing Ji Xin Wen· 2025-06-17 22:01
Market Overview - US stock markets opened lower and closed down, with the Nasdaq falling by 0.91%, S&P 500 down 0.84%, and Dow Jones down 0.7% [5] - Major tech stocks declined, with Tesla dropping nearly 4% and Apple down over 1% [5] - WTI crude oil futures rose by $3.07, a 4.28% increase, closing at $74.84 per barrel [7] - European major stock indices also closed lower, with Germany's DAX30 down 0.86% and France's CAC40 down 0.63% [8] Foreign Investment and Market Trends - In May, foreign investment in domestic stocks increased compared to the previous month, with a net inflow of $33 billion in cross-border funds [13][14] - The foreign exchange market remained stable, with a shift to surplus in bank foreign exchange sales and purchases [14] Corporate Developments - Nvidia will debut at the upcoming Chain Expo, which is expected to enhance collaboration and innovation in the AI industry [21] - Alibaba's Tongyi team has launched a new quantization model compatible with Apple's MLX framework, which is anticipated to boost AI applications on Apple hardware [22] - JD Group's chairman Liu Qiangdong announced plans to develop a business model distinct from Meituan, focusing on supply chain profitability [23] - Li Auto responded to the stock reduction by Meituan's CEO, clarifying it was a personal decision and did not affect Meituan's holdings [24] - Baoneng Auto reassured stakeholders of its normal operations amid rumors of liquidation, stating new vehicle launches are forthcoming [28] Technological Innovations - Rokid and Alipay have collaborated to introduce a smart glasses payment solution, enhancing user experience in payment methods [29] - WeChat is optimizing its chat record backup feature to support multiple storage devices, improving user convenience [31] Legal and Regulatory Issues - Tesla faces a lawsuit from a Chinese owner regarding the non-functionality of the FSD feature, highlighting potential consumer rights concerns [32] - A former president of Zheshang Securities has filed a lawsuit against the company, raising questions about management stability [27]
为国行苹果智能做准备!阿里巴巴发布升级版Qwen3:全系适配苹果MLX架构
硬AI· 2025-06-17 14:30
这意味着,从Mac Pro、Mac Studio到Mac mini、MacBook,再到iPad,甚至内存更小的iPhone,都能轻松部署 Qwen3。 硬·AI 作者 | 李笑寅 编辑 | 硬 AI 周一,阿里巴巴通义千问宣布,正式发布基于苹果MLX框架深度优化的全部Qwen3系列模型。 此举被看作是为国行苹果智能做准备。此前有消息称,阿里巴巴将成为苹果在中国大陆的大模型合作商。 公告显示,团队将一次性全部开源32款官方Qwen3 MLX模型,每款模型都有4bit、6bit、8bit和BF16等4 种不同精度的量化版本,从而实现这些模型在iPhone、iPad,以及Mac电脑上的轻松部署,做到全场景覆 盖。 目前,Qwen3的MLX模型已在魔搭社区和Hugging Face全面开源。 硬·AI * 感谢阅读! * 转载、合作、交流请留言,线索、数据、商业合作请加微信:IngAI2023 * 欢迎大家在留言区分享您的看法, 如果您能点个并分享的话,那就太感谢啦! * 让我们一起,好奇地看世界 据官方介绍,MLX是一个开源的机器学习框架,专为苹果芯片深度适配。MLX框架可高效地训练和部署AI 大模型,被越来越多 ...