全模态模型

Search documents
「CV 铁三角」落定Meta,视觉 AI 如何向多模态演进?
机器之心· 2025-07-19 05:49
Group 1 - The core viewpoint of the article discusses the strategic hiring by Meta, focusing on the "CV Triangle" and its implications for the evolution of visual AI towards multimodal capabilities [4][5][6] - The "CV Triangle" consists of three key researchers from OpenAI Zurich, previously from GoogleBrain, whose work has significantly influenced the development of modern multimodal AI frameworks [5][6] - The article outlines five representative works led by the "CV Triangle," including S4L, BiT, ViT, MLP-Mixer, and PALI, which collectively contribute to the advancement of visual AI and its integration with other modalities [5][6][7] Group 2 - The article highlights the milestones necessary for the transition from visual AI to multimodal AI, emphasizing the importance of continuous research and development in this field [8]
整理:每日科技要闻速递(5月27日)
news flash· 2025-05-26 23:36
New Energy Vehicles - Lithium carbonate futures have fallen below 60,000 [1] - Concerns arise over a new price war initiated by BYD, with industry insiders suggesting that "hidden price cuts" may persist long-term [1] Technology Developments - Tencent is set to release the world's first multimodal model "Hunyuan-O" [2] - Microsoft has open-sourced a browser agent that can track and control intelligent agents in real-time [2] - Apple is expected to undergo a design revolution for its all-platform operating system [2] - A new myasthenia gravis drug, Udis, has been launched in China by UCB [2] - Apple is rumored to adjust its release strategy to launch two new iPhone models each year [2] - OpenAI plans to establish an office in Seoul within the next few months [2] - Xiaomi has denied rumors that its Xuanjie O1 is a custom chip for Arm [2] - Samsung's HBM3E has nearly passed Nvidia's single-chip certification, although final product certification may be delayed until the second half of the year [2] E-commerce and Delivery Services - Meituan reported that the average monthly income for high-frequency delivery riders in first-tier cities is 10,010 yuan [2] - Meituan's CEO Wang Xing responded to JD.com's 10 billion yuan subsidy for food delivery, stating that the company will spare no effort to win the competition [2] - Approximately 52% of Meituan's new code is generated by AI [2]
王健林再卖48座万达广场,腾讯等“熟人团”接盘;两辆车在充电站起火燃烧,蔚来回应;董明珠孟羽童合体带货500万元丨邦早报
创业邦· 2025-05-26 00:03
Group 1 - Wang Jianlin sells 48 Wanda Plaza properties to a consortium including Tencent and other familiar investors, with the transaction approved unconditionally by the State Administration for Market Regulation [3] - NIO responds to a fire incident at a charging station, stating that its vehicles were ignited by another brand's vehicle, with no injuries reported [3] Group 2 - Dong Mingzhu and Meng Yutong's joint live-streaming event achieved sales of 5 million yuan, with viewership reaching 2.92 million, a significant increase compared to the usual 40 viewers [5] Group 3 - BYD launches a promotional campaign with price reductions on 22 models, with discounts up to 53,000 yuan, indicating a competitive shift in the automotive market [12] - BYD's electric vehicle sales in Europe reached 7,231 units in April, a 169% year-on-year increase, surpassing Tesla for the first time [19] Group 4 - Nvidia plans to launch a new AI chip for the Chinese market, priced between $6,500 and $8,000, significantly lower than the previous H20 chip [9][10] - Apple is expected to release a smart home hub by the end of the year, which has been delayed due to challenges in AI development [10] Group 5 - Guangzhou is set to introduce measures to support the gaming and esports industry, including funding and tax incentives [19] - The Middle East smartphone market saw a 4% decline in Q1 2025, with Samsung, Transsion, and Xiaomi leading the market [20]