量子位

Search documents
 人工智能年度榜单火热报名中!五大奖项,寻找AI+时代的先锋力量
 量子位· 2025-10-26 04:01
组委会 发自 凹非寺 量子位|公众号 QbitAI 为了让更多从业者感受智能浪潮的跃迁,也为了给予更多同行同路人掌声与鼓舞,我们将正式启动 「2025人工智能年度榜单」评选报名 。 本次评选将从 企业 、 产品 、 人物 三大维度,设立五类奖项。欢迎企业踊跃报名! 让我们共同见证年度之星,点亮未来的方向。 企业榜 产品榜 人物榜 2025 人工智能年度 焦点人物 详细评选标准及报名方式如下。 2025 人工智能年度领航企业 将面向中国人工智能领域,评选出最具综合实力的企业, 参选条件 : 2025 人工智能年度 领航企业 2025 人工智能年度 潜力创业公司 2025 人工智能年度 杰出产品 2025 人工智能年度 杰出解决方案 1、注册地在中国,或主营业务主要面向中国市场; 2、主营业务属于人工智能及相关产业,或已将人工智能广泛应用于主营业务,并在细分领域居于行业领先地位; 3、具备成熟的产品或服务,已获得实际客户应用及市场认可; 4、近一年在技术创新、产品落地、市场拓展或商业模式上取得显著突破。 评选标准 : 2025 人工智能年度潜力创业公司 聚焦于中国人工智能领域创新创业力量,将评选出最具投资价值和发 ...
 OpenAI被曝瞄准AI音乐赛道商业化,Suno首当其冲
 量子位· 2025-10-26 04:01
 Core Viewpoint - OpenAI is preparing to enter the AI music generation market, which poses a significant threat to existing startups like Suno, valued at $2 billion, as they may be overshadowed by OpenAI's capabilities [1][2][11].   Group 1: OpenAI's Entry into AI Music - OpenAI has been collaborating with the Juilliard School to develop a music generation model, aiming to automate and personalize music creation for content creators [7][8]. - The new music model is expected to integrate with existing OpenAI products, potentially allowing users to generate background music for videos easily [7][10]. - The competition in the AI music space is currently limited, with the top ten platforms holding only about 24% of the market share, indicating room for growth and disruption [12].   Group 2: Market Dynamics and Competitors - Suno and Udio are the two most notable players in the AI music generation market, with Suno focusing on accessibility for all users and Udio targeting professional users [12][13][14]. - Suno has reported an annual recurring revenue (ARR) of $150 million, with a nearly fourfold year-on-year growth, and a gross margin exceeding 60%, highlighting the profitability of the AI music sector [29][30][31]. - Other companies, including ByteDance, Alibaba, and Tencent, are also exploring AI music generation, indicating a growing interest in this market [16][18].   Group 3: Historical Context and Future Implications - OpenAI previously attempted to enter the music space with models like MuseNet and Jukebox but faced funding challenges that limited their progress [22][25]. - The renewed focus on music generation aligns with OpenAI's strategy to diversify its product offerings and generate revenue to offset operational costs [26][34]. - The entry of a tech giant like OpenAI into the AI music market is expected to accelerate innovation and provide consumers with more choices [20][34].
 破解AI对不同上下⽂位置的敏感度不⼀致,新框架使出“解铃还须系铃人”
 量子位· 2025-10-26 04:01
Pos2Distill团队 投稿 量子位 | 公众号 QbitAI 语言模型遭遇严重的位置偏见,即模型对不同上下⽂位置的敏感度不⼀致。模型倾向于过度关注输⼊序列中的特定位置,严重制约了它们在复 杂推理、⻓⽂本理解以及模型评估等关键任务上的表现。 例如,在对⽐两个候选答案时,模型常因偏好⾸个选项⽽损害其作为评估器的公正性与可靠性。 针对这⼀挑战,论⽂提出了 Pos2Distill,⼀个创新的"位置到位置"蒸馏框架。该框架旨在将模型在优势位置的强⼤能⼒迁移⾄劣势位置,从 ⽽有效缓解位置偏⻅。 其核⼼思想恰如古语所云:"解铃还须系铃⼈",利⽤模型⾃⾝已习得的知识,来纠正其⾃⾝的系统性偏差。 一类工作 试图通过修改与上下⽂敏感度不均相关的关键架构组件或内部表示来进⾏减轻位置偏见。然⽽,尽管近期在缩⼩性能差距⽅⾯取得 了⼀些进展,模型在"优势位置"和"劣势位置"之间的信息利⽤率依然存在巨⼤差异。 其基本原理可以概括为:利⽤位置本⾝造成的性能不均衡,来对抗位置偏差这⼀问题。 团队发现,位置偏差在"检索"和"推理"这两类任务中诱发的表现不同,因此基于上述核⼼原理,团队分别设计了两种专⻔的实现⽅案: Pos2Distill- ...
 P图老本事搭上了对话框,美图这AI Agent到底香不香?
 量子位· 2025-10-26 04:01
梦瑶 发自 凹非寺 量子位 | 公众号 QbitAI 这些刷屏的AI图片,你刷到了没? 就是那种——和自己专属emoji合影的黏土风照片,画风长这样: 本来以为是NanoBanana整的新玩法,结果点开评论区一看:嗐,原来是美图自家的AI Agent——叫 RoboNeo ~ 一向本着"啥都得测测"的原则,我也来试了试,立刻跟风搓了个黏土风Q版形象! 还有这个,全网最近爆火的马维斯风宋朝打工人图,仕女眼神已经写明白了——不想卷、不敢躺: 图上整活还不够,这Agent还支持生成视频,来看看网友搞的这一段转绘动画,有热血校园内味儿了: 还是熟悉的配方,熟悉的味道——美图这一波,又把自家祖传手艺拿出来开整了。 话不多说,直接开测,看看这RoboNeo到底香否! 一个能边唠嗑边P图的快乐老家 实测前先唠唠RoboNeo的web端 生图页面 。 说实话,第一眼看上去,其实有点像Lovart的表亲…… 只不过页面布局反着来了:左边聊天,右边编辑~ 我合理怀疑产品经理是考虑了Chinese宝宝的写字习惯: 从左到右更顺手~(毕竟从小老师都教这么写的) 操作也简单,就像日常跟AI唠嗑一样跟它聊就行,把甲方需求丢进对话框,它就 ...
 盲人复明!马斯克Neuralink联创实现人工视觉里程碑
 量子位· 2025-10-26 04:01
鹭羽 发自 凹非寺 量子位 | 公众号 QbitAI 盲人复明 ,太了不起了。 这可能是2025年最低调但又最闪亮的科技进展了。 Nature最新刊登了新研究进展,人工视觉技术刚刚帮助一位70岁奶奶重获光明。 在失明之前,我是个狂热的书虫,我想把它找回来。 70岁的Sheila Irvine (希拉·欧文) 最大的愿望是能够再次阅读,而就在最近她的愿望成真了。 原因来自于一项世界首创的人工视觉研究 PRIMA 。 其背后带队的还是当年和马斯克一起创办Neuralink的联合创始人,现在自己创业,做的还是 视网膜植入物 。 厚度只有一根头发丝大小,却能够让 80% 的患者视力得到显著改善,并且能够顺利阅读字母、数字和单词。 对此,论文主要作者Frank Holz表示: 该研究首次证明人工视觉可以恢复患者的功能性中央视力,为失明者带来了希望。 而对于患者本身及其家人,或许这将是人至暮年,一次宝贵的再次见面的机会: 失明15年,终于重获光明的她 在参与实验之前,希拉是这样描述自己的: 我的眼睛就像有两个黑色的椎间盘,外面扭曲了。 故事的开始还要回到15年前,那是她第一次发现自己开车不受控制,一直在撞击人行道,当她走 ...
 这个时代最缺的是「个人上下文」丨对话flomo浮墨笔记
 量子位· 2025-10-26 01:21
 Core Insights - The article discusses the evolution and differentiation of AI note-taking products, emphasizing the need for unique features and user trust in a competitive market [4][6]. - Flomo, a lightweight note-taking tool, focuses on fragmented knowledge management, encouraging users to record thoughts first and organize them later [7][10].   Group 1: AI Note-Taking Market - The AI note-taking market is becoming crowded with various products, including traditional note apps with AI features and native AI note apps [4]. - Differentiation in this market is crucial, especially as AI summarization and question-answering features become standard [4][10].   Group 2: Flomo's Unique Positioning - Flomo aims to help users reconnect with their personal context and thoughts, emphasizing the importance of personal records over AI-generated content [13][14]. - The primary user demographics for Flomo include those who record emotions and knowledge, with a significant portion using it for daily notes and creative content collection [15][16].   Group 3: Product Features and User Engagement - Key features of Flomo include AI voice input, daily reviews, related notes through semantic analysis, and AI insights that help users identify patterns in their thoughts [10][11]. - Flomo's design encourages users to record thoughts quickly, with features like a small input box and visual feedback to motivate recording [22][24].   Group 4: Product-Market Fit (PMF) and User Research - Flomo's approach to confirming PMF involves extensive user research and understanding the diverse needs of note-taking users [31][32]. - The company has identified that many users prefer simple, fragmented recording tools over complex note-taking systems [32][33].   Group 5: AI Integration and Future Directions - Flomo is cautious about integrating AI, focusing on features that meet genuine user needs, such as AI insights and related note matching [57][59]. - Future developments include enhancing AI capabilities, expanding context handling, and providing users with various perspectives on their notes [66][67].   Group 6: User Trust and Emotional Connection - Building user trust is essential, with Flomo prioritizing user privacy and ensuring that personal data remains secure [74][78]. - The emotional connection with users is fostered through features that allow for personal reflection and recognition of progress in their recorded thoughts [75].
 智元办机器人挑战赛:清华&上海AILab夺冠,华南理工“单人成团”拿亚军
 量子位· 2025-10-25 10:30
 Core Insights - The AGIBOT World Challenge, organized by Zhiyuan Robotics and OpenDriveLab, concluded successfully in Hangzhou during IROS, showcasing intense competition among top global teams in various physical tasks [2][4][48] - The AIR-DREAM team from Tsinghua University and Shanghai AI Lab won the championship, while South China University of Technology and the University of Hong Kong secured the second and third places respectively [4][10][50]   Competition Overview - The competition featured 11 elite teams from around the world, competing in six real-world physical tasks such as object manipulation, dynamic sorting, and kitchen operations [4][6][19] - The event aimed to test the operational precision and generalization capabilities of embodied intelligent systems [6][19]   Task Details - Each team performed 10 attempts per task, with scores averaged for the final results, using the UniVLA baseline model [20] - The six tasks included:   - **Pack groceries**: Teams had 90 seconds to grab three snacks and place them in a bag, with a maximum score of 6 points [22][24]   - **Pack items from conveyor**: In 90 seconds, teams needed to identify and grab items from a moving conveyor, also scoring up to 6 points [26][29]   - **Fold short sleeves**: Teams had 150 seconds to fold clothing, with a maximum score of 4 points [30][32]   - **Microwave the food**: This task involved a series of steps to operate a microwave within 150 seconds, scoring up to 6 points [35][37]   - **Restock the hanging area**: Teams had 60 seconds to place items on shelves, scoring up to 2 points [39][41]   - **Pour water**: In 60 seconds, teams had to pour a specified amount of water, with a maximum score of 4 points [43][45]   Technical Insights - The AIR-DREAM team introduced the X-VLA model, a scalable and simplified visual-language-action model that addresses challenges in heterogeneous robot data [11][13] - The second-place team shared strategies for achieving high success rates with limited computational resources, focusing on quick fine-tuning of pre-trained models [15] - The third-place team utilized a pre-trained model and a simulation platform for data generation and parallel reinforcement learning, achieving efficient technical solutions in a short timeframe [17]   Event Highlights - The AGIBOT World Challenge featured a total prize pool of $560,000, with the manipulation track offering $60,000, and the champion team receiving $10,000 [48][51] - The event also highlighted the launch of the new "archery" robot, the Spirit G2, which was showcased for the first time at IROS [53]
 AI产品先发优势在于用户迁移成本高,持续为用户提供价值是保持竞争优势的关键  | 对话AI智能电子衣橱工具搭搭
 量子位· 2025-10-25 10:30
 Core Insights - The article discusses the emerging field of AI smart wardrobes, which aims to transform consumers' existing clothing resources into personalized styling services using AI technology [3][4]. - The current market for AI smart wardrobe products is relatively sparse, with existing functionalities primarily focused on clothing uploading, categorization, and outfit suggestions [4]. - The competitive landscape is less intense compared to other AI sectors, but challenges remain in user acquisition, feature optimization, and value creation [5][6].   Group 1: Product Features and User Engagement - The AI smart wardrobe product "Dada" has reached 2 million users, offering features such as AI storage, wardrobe management, and smart outfit recommendations [8]. - Users can upload clothing through various methods, including photo uploads and smart recognition, and the system categorizes items based on multiple tags [8]. - The platform emphasizes user engagement through DIY outfit creation and community sharing, which enhances the overall user experience [15][19].   Group 2: Market Potential and Strategic Insights - The founder of Dada, Guo Liangbing, highlights the significant market potential in the clothing sector, driven by a growing demand for fashion and aesthetics [21][22]. - The initial focus on electronic wardrobe tools is seen as a starting point, with plans to integrate AI capabilities for broader wardrobe management services [23]. - The company aims to differentiate itself by focusing on maximizing the utility of existing clothing rather than promoting new purchases, positioning itself as a "wardrobe manager" [44][45].   Group 3: User Acquisition and Growth Strategy - Dada's user growth strategy capitalized on the traffic benefits from platforms like Douyin, achieving a low customer acquisition cost of around 0.1 to 0.3 yuan [66]. - The company utilized a "probability" strategy by engaging "ordinary" users to create content, which proved effective in driving user engagement and conversion [66][67]. - The app's features, such as outfit diaries and community sharing, encourage users to actively participate and promote the product organically [68][70].   Group 4: AI Integration and Future Development - AI technology plays a crucial role in automating clothing recognition and outfit generation, significantly reducing the manual workload for users [41][42]. - The company plans to enhance its AI capabilities further, focusing on personalized recommendations and visualizing how clothes will look on users through AR technology [88][90]. - Continuous iteration and user feedback are central to the product development process, ensuring that new features align with user needs and preferences [52][56].
 量子计算摆脱GPU!IBM一句话AMD市值暴涨2000亿元:用FPGA芯片即可
 量子位· 2025-10-25 08:30
 Core Insights - IBM has made significant progress in the commercialization of quantum computing by successfully running a key quantum error correction algorithm on existing AMD chips, achieving a speed that is ten times faster than required [2][4] - This breakthrough allows quantum error correction to be implemented without the need for expensive GPU clusters, utilizing FPGA chips instead, which enhances scalability and cost-effectiveness [2][4]   Company Impact - Following the announcement, AMD's stock price rose by 7.63%, increasing its market capitalization by $29 billion to $410 billion, which is approximately 1/11th of Nvidia's market cap [5] - IBM also experienced a market cap increase of $20.9 billion, bringing its total to $286.4 billion [7]   Quantum Computing Challenges - The algorithm addresses one of the core challenges in quantum computing: the fragility and high error rates of quantum bits (qubits) [10] - Quantum bits are highly unstable and can lose their quantum properties due to environmental factors, a process known as "decoherence" [11][12]   Quantum Error Correction Mechanism - To overcome the challenges of qubit instability, quantum error correction codes (QECC) are employed, which use multiple unstable physical qubits to encode a stable logical qubit [14] - The process involves auxiliary qubits performing "ancillary measurements" to detect errors without destroying the quantum information encoded in the logical qubit [15] - The measurement results are sent to a classical processor that runs a decoding algorithm to identify and correct errors, which must be completed within tens of microseconds to prevent loss of quantum information [16][17]   FPGA Advantage - The use of FPGA chips is crucial as they can respond in nanoseconds, making them thousands of times faster than traditional software decoding methods [18] - IBM's original plan to develop the Starling quantum computer by 2029 has been accelerated to 2028 due to this breakthrough [19]
 马斯克盛赞朱雀三号:能够击败SpaceX猎鹰9号
 量子位· 2025-10-25 08:30
 Core Viewpoint - The article discusses the potential of China's reusable rocket, Zhuque-3, to surpass SpaceX's Falcon 9 in the near future, highlighting advancements in China's aerospace technology [1][2][3].   Group 1: Zhuque-3 Overview - Zhuque-3 is expected to be China's first truly reusable launch vehicle, with its maiden flight scheduled for November [7][9]. - The rocket features a stainless steel structure, a diameter of 4.5 meters, a length of 66.1 meters, and a launch mass of approximately 570 tons [11]. - It is equipped with nine Tianque-12A liquid oxygen-methane engines, providing a thrust of over 750 tons [11].   Group 2: Technological Advancements - Zhuque-3 utilizes a liquid oxygen-methane fuel combination, which offers advantages such as cleanliness, reusability, and cost-effectiveness compared to traditional fuels [12]. - The rocket is designed for high-precision autonomous return and soft landing for reuse after missions, embodying the concept of "fly, recover, and fly again" [12].   Group 3: Cost Competitiveness - Zhuque-3 aims to reduce launch costs to below 20,000 yuan per kilogram, making it competitive with Falcon 9, which costs approximately 3,000 USD per kilogram [13].










