Workflow
量子位
icon
Search documents
北大团队让AI学会考古!全球首个古希腊陶罐3D视觉问答数据集发布,还配了专用模型
量子位· 2025-11-06 13:22
Core Insights - The article discusses a groundbreaking research initiative from Peking University that has developed the world's first 3D visual question-answering dataset focused on ancient Greek pottery, named VaseVQA-3D, along with a specialized visual language model called VaseVLM [1][5]. Group 1: AI Development in Cultural Heritage - AI is evolving from being merely an image recognition tool to becoming a "cultural archaeology agent" capable of understanding complex cultural artifacts [2]. - Traditional visual language models (VLMs) like GPT-4V and Gemini struggle with cultural heritage objects due to limitations in training data and semantic modeling capabilities [3][6]. Group 2: VaseVQA-3D Dataset and Model - The VaseVQA-3D dataset includes over 30,000 2D images of ancient Greek pottery, which were transformed into 664 high-fidelity 3D models using TripoSG technology [11]. - The dataset also features 4,460 pairs of questions and answers related to the pottery, enhancing the AI's ability to provide detailed descriptions and answers [11][17]. Group 3: Model Training and Performance - The VaseVLM model was trained using a two-phase reinforcement learning approach, focusing on six semantic dimensions related to pottery [18]. - VaseVLM significantly outperformed existing models in various visual question-answering tasks, achieving a 12.8% increase in R@1 accuracy and a 6.6% improvement in vocabulary similarity [20]. Group 4: Future Prospects - The project aims to expand into more cultural heritage areas and establish improved digital heritage display methods, providing a new technological pathway for digital archaeology [22].
告别盲目卷参数!科大讯飞1024亮出底牌:all in“更懂你”
量子位· 2025-11-06 13:22
Core Viewpoint - The article emphasizes that the true competitive barrier in AI is not just about model size or intelligence, but about creating AI that truly understands and resonates with human needs, as demonstrated by iFLYTEK's latest advancements in AI technology [10][12][114]. Group 1: AI Understanding and Interaction - iFLYTEK's new AI model, Spark X1.5, aims to enhance emotional understanding and task comprehension, moving beyond traditional capabilities to truly "understand you" [6][14]. - The AI's ability to dynamically engage with users, recognizing emotions and intentions, marks a shift from basic interaction to empathetic communication [38][44]. - The integration of multi-modal interaction capabilities allows the AI to process and respond to complex human cues, enhancing user experience [42][46]. Group 2: Technological Advancements - The Spark X1.5 model is fully domestically developed, utilizing a completely independent computing platform without reliance on foreign technology [8][19]. - Significant improvements in reasoning and task decomposition capabilities have been achieved, with the model's reasoning efficiency rising from 25% to over 84% [22]. - The model's architecture has been upgraded to MoE, allowing for a reduction in total parameters while enhancing performance, achieving a 100% increase in reasoning speed compared to its predecessor [30][34]. Group 3: Industry Applications - iFLYTEK's AI technology is being applied across various sectors, including education and healthcare, with specific tools designed to enhance learning and medical diagnostics [75][83]. - The AI's capabilities in medical settings have reached a level comparable to senior physicians, showcasing its potential in assisting with diagnosis and patient management [76][84]. - In education, the AI has advanced from simple grading to detailed error analysis, significantly improving the efficiency and accuracy of assessments [83][86]. Group 4: Ecosystem and Developer Engagement - The growth of the developer ecosystem around iFLYTEK's AI has been rapid, with a notable increase in new developers contributing to the platform [106]. - iFLYTEK has launched an open-source platform to support the development of intelligent agents, aiming to foster innovation within the AI community [108]. - The company believes that a thriving ecosystem is essential for the future of artificial intelligence, emphasizing collaboration and shared growth [104].
量子位2025年度榜单申报倒计时!企业/产品/人物三大维度5类奖项即将截止
量子位· 2025-11-06 13:22
组委会 发自 凹非寺 量子位|公众号 QbitAI 人物榜 2025 人工智能年度 焦点人物 为了让更多从业者感受智能浪潮的跃迁,也为了给予更多同行同路人掌声与鼓舞,我们将正式启动 「2025人工智能年度榜单」评选报名 。 本次评选将从 企业 、 产品 、 人物 三大维度,设立五类奖项。欢迎企业踊跃报名! 让我们共同见证年度之星,点亮未来的方向。 企业榜 产品榜 详细评选标准及报名方式如下。 2025 人工智能年度领航企业 将面向中国人工智能领域,评选出最具综合实力的企业, 参选条件 : 2025 人工智能年度 领航企业 2025 人工智能年度 潜力创业公司 评选标准 : 2025 人工智能年度潜力创业公司 聚焦于中国人工智能领域创新创业力量,将评选出最具投资价值和发展潜力的AI创业公司, 参选条件 : 评选标准 : 2025 人工智能年度 杰出产品 2025 人工智能年度 杰出解决方案 1、注册地在中国,或主营业务主要面向中国市场; 2、主营业务属于人工智能及相关产业,或已将人工智能广泛应用于主营业务,并在细分领域居于行业领先地位; 3、具备成熟的产品或服务,已获得实际客户应用及市场认可; 4、近一年在技术 ...
报名启动!快来和张亚勤孙茂松一起参与MEET2026智能未来大会
量子位· 2025-11-06 09:16
Core Viewpoint - The article emphasizes the transformative impact of artificial intelligence (AI) on various industries and society as a whole, highlighting the upcoming MEET2026 Intelligent Future Conference as a platform to explore these advancements and trends [1][3]. Event Overview - The MEET2026 Intelligent Future Conference will focus on cutting-edge technologies and industry developments, particularly in AI [2]. - The theme of the conference is "Coexistence without Boundaries, Intelligence to Ignite the Future," aiming to discuss how AI can penetrate various industries, disciplines, and scenarios [3]. Key Topics - The conference will cover hot topics in the tech circle, including reinforcement learning, multimodal AI, chip computing power, AI applications in industries, and AI's global expansion [4]. - It will showcase the latest collisions between academic frontiers and commercial applications, featuring leading technological achievements from infrastructure, models, and product industries [5]. Reports and Awards - The conference will also feature the authoritative release of the annual AI rankings and the Annual AI Trend Report, which are highly anticipated [6]. Notable Speakers - The first wave of speakers includes prominent figures such as Zhang Yaqin, a world-class scientist and entrepreneur in AI and digital video [12][13]. - Other notable speakers include Sun Maosong, Wang Zhongyuan, Zhao Junbo, and Liu Fanping, all of whom have significant contributions to AI research and development [17][22][27][43]. Participation and Recognition - The AI Annual Rankings, initiated by Quantum Bit, have become one of the most influential rankings in the AI industry, evaluating companies, products, and individuals across three dimensions [60]. - The call for submissions for the Annual AI Trend Report is open, focusing on identifying and analyzing ten major AI trends for 2025 [65][66]. Conference Logistics - The MEET2026 Intelligent Future Conference will take place at the Beijing Jinmao Renaissance Hotel, with registration now open for attendees [70]. - The event is recognized as an annual influential technology business summit, attracting thousands of tech professionals and millions of online viewers [72].
量子位2025年度榜单申报倒计时!企业/产品/人物三大维度5类奖项即将截止
量子位· 2025-11-06 09:16
组委会 发自 凹非寺 量子位|公众号 QbitAI 为了让更多从业者感受智能浪潮的跃迁,也为了给予更多同行同路人掌声与鼓舞,我们将正式启动 「2025人工智能年度榜单」评选报名 。 本次评选将从 企业 、 产品 、 人物 三大维度,设立五类奖项。欢迎企业踊跃报名! 企业榜 产品榜 人物榜 2025 人工智能年度 焦点人物 详细评选标准及报名方式如下。 让我们共同见证年度之星,点亮未来的方向。 2025 人工智能年度领航企业 将面向中国人工智能领域,评选出最具综合实力的企业, 参选条件 : 2025 人工智能年度 领航企业 2025 人工智能年度 潜力创业公司 2025 人工智能年度 杰出产品 2025 人工智能年度 杰出解决方案 1、注册地在中国,或主营业务主要面向中国市场; 2、主营业务属于人工智能及相关产业,或已将人工智能广泛应用于主营业务,并在细分领域居于行业领先地位; 评选标准 : 2025 人工智能年度潜力创业公司 聚焦于中国人工智能领域创新创业力量,将评选出最具投资价值和发展潜力的AI创业公司, 参选条件 : 评选标准 : 3、具备成熟的产品或服务,已获得实际客户应用及市场认可; 4、近一年在技术 ...
双11电脑遭遇内存涨价潮,华强北“一天一个价”,雷军都感叹涨太多
量子位· 2025-11-06 06:38
Core Viewpoint - The article discusses the significant increase in memory prices driven by the surge in AI demand, affecting both consumer and enterprise markets, with prices expected to remain high for the foreseeable future [1][9][28]. Group 1: Price Increases - Memory prices have skyrocketed, with consumer-grade memory experiencing substantial increases [2][4]. - A report from TrendForce indicates that DRAM prices rose by 171.8% year-on-year in Q3 [12]. - Specific products, such as a 64GB DDR5 memory kit from Kingston, saw a price increase of 100 euros (50%) within a week, surpassing the price of the 9700X CPU [14]. - Personal transactions have also seen extreme price hikes, with one user reporting a memory kit purchased for $285 now priced at nearly $1600, representing over a 400% increase [17]. Group 2: Market Impact - The price surge is not limited to memory modules; entire systems are affected, with companies like Minisforum raising prices on all models equipped with SSDs and memory due to rising costs [24]. - The smartphone market is also impacted, with flagship models expected to see memory price increases of 20% to 30% next year [24]. - The CEO of Phison Electronics noted that such price increases may become commonplace in the near future [27]. Group 3: Causes of Price Surge - The demand for memory is shifting from traditional consumer markets to AI infrastructure and enterprise clients, leading to a significant increase in demand for server-grade memory and HBM (High Bandwidth Memory) [30][31]. - Major manufacturers like Samsung, SK Hynix, and Micron have paused DDR5 pricing for October, with inventory levels running low [28]. - IT companies are securing long-term DRAM supply contracts to ensure stability, contributing to a "lock-in" trend [32]. - The shift in demand is causing a reduction in DDR4 production as manufacturers prioritize AI-related memory types, leading to a direct impact on consumer-grade memory availability [33][34].
熬夜后走神不是你的错,而是自救机制在“洗脑” | Nature
量子位· 2025-11-06 06:38
Core Insights - A new study from MIT provides scientific evidence that lack of sleep can impair attention due to a physiological mechanism where cerebrospinal fluid (CSF) is involved in "cleaning" the brain during sleep, which is disrupted when sleep is insufficient [2][18] Group 1: Research Findings - The study reveals that during moments of attention lapses, a wave of cerebrospinal fluid flows out of the brain, a process typically occurring during sleep to clear waste accumulated during the day [2][18] - Insufficient sleep leads to the activation of this cleaning process during waking hours, resulting in significant attention deficits [18][21] - The research involved 26 healthy volunteers who underwent attention tests under both sufficient and insufficient sleep conditions, showing that sleep-deprived individuals had longer reaction times and sometimes failed to respond altogether [19][21] Group 2: Mechanism of Action - The study indicates that during deep sleep, a synchronized activity of neurons allows for a significant withdrawal of blood, creating space for cerebrospinal fluid to enter and cleanse the brain [15][26] - In contrast, when awake, the active state of neurons prevents this cleaning process, leading to a buildup of waste and forcing the brain to initiate a similar cleaning mechanism while awake, which detracts from attention resources [26][27] - The research suggests that there may be a central switch in the brain that regulates attention and the flow of cerebrospinal fluid, potentially influenced by norepinephrine [29]
何恺明MIT两名新弟子曝光:首次有女生入组,另一位是FNO发明者,均为华人
量子位· 2025-11-06 04:04
Core Insights - The article highlights the recruitment of two new Chinese students, Hu Keya and Li Zongyi, by AI expert He Kaiming at MIT, emphasizing their impressive academic backgrounds and contributions to the field of AI [1][4]. Group 1: Hu Keya's Background and Achievements - Hu Keya graduated from Shanghai Jiao Tong University and was involved in the Brain-Machine Interface Laboratory, focusing on AI applications in neuroscience [5][7]. - She authored a paper on self-supervised EEG representation learning, which was accepted at the EMBC conference, and presented her work in the U.S. [8][10]. - Hu participated in a project that improved self-supervised learning, leading to a paper accepted at the Cognitive Science 2025 conference [10]. - During her undergraduate studies, she interned at Cornell University, contributing to a project on program synthesis and code repair, resulting in a paper accepted at NeurIPS 2024 [11][12]. - Hu Keya led her team to win the "Best Paper Award" at the ARC Prize 2024 competition, showcasing her innovative approach to AI problem-solving [15][17]. - By the end of her undergraduate studies, she had published four high-impact papers, making her a highly sought-after candidate for PhD programs, ultimately choosing MIT [21][22]. Group 2: Li Zongyi's Contributions - Li Zongyi, known for his work on the Fourier Neural Operator (FNO), published a significant paper during his PhD that enabled the large-scale application of neural operators [27][29]. - The FNO allows neural networks to learn solutions to physical equations efficiently, significantly improving computational speed in various scientific applications [30][34]. - Li Zongyi's research has made him a key figure in the field of neural operators, with over 12,000 citations of his work [36]. - Currently, he is a postdoctoral researcher at MIT and is set to join New York University as an assistant professor in the upcoming fall [38][39]. Group 3: He Kaiming's Research Focus - He Kaiming has indicated that "AI for Science" will be a primary focus of his research in the coming years, aligning with the expertise of his newly recruited team members [46][48]. - The combination of Hu Keya's background in neuroscience and Li Zongyi's expertise in neural operators strengthens the team's capabilities in advancing AI applications in scientific research [48][49].
推翻「预测下一个token」范式!微信AI新研究:把token压缩成连续向量更具性价比
量子位· 2025-11-06 04:04
鱼羊 发自 凹非寺 量子位 | 公众号 QbitAI 大模型一个token一个token生成,效率太低怎么办? 微信AI联手清华大学,提出了一个新的解法: 一个token能装下的信息太少,不如把它们打包成 连续向量 , 让大模型从预测下一个token,转变为预测下一个向量 。 研究团队给这种新范式取名 CALM(连续自回归语言模型) 。 实验表明,将K个词元压缩成一个连续向量,可以将语言模型建模为一系列连续向量,生成步骤减少至原来的1/K。 还有网友提出,CALM像是DeepSeekOCR/Glyph的改进版。 研究人员指出,预测下一个token的现有模型范式,一开始是因为基于字符级运行的模型计算量太大而被提出的。 也就是说,方法背后的关键思想是:提升每个文本单元的信息密度,能够缩短序列长度并显著提升模型效率。 进一步挖掘本质,可以总结出一条提升大模型生成效率的有效途径: 持续提升每个预测单元的语义带宽 。 这样一来,模型就能在平衡性能和计算成本时,实现更高的性价比。 有网友认为,这种方法看上去越来越接近大脑实际处理上下文的方式。 提升每个预测单元的语义带宽 问题在于,如果想让一个token装更多的信息,就得 ...
第一批买机器人做家务的人崩溃了
量子位· 2025-11-05 09:30
Core Viewpoint - The article discusses the challenges and limitations of autonomous home robots, suggesting that remote-controlled services may be a more practical solution for household tasks [3][7][16]. Group 1: Autonomous Robots Issues - Autonomous home robots often cause accidents, such as crashing into mirrors or kitchen cabinets, leading to frustration among users [2][9][11]. - The current technology for home robots lacks sufficient autonomy, making it difficult for them to perform basic tasks like avoiding obstacles or cooking [16][27]. - Users have reported dissatisfaction with the high prices of these robots, which do not meet their expectations for functionality [15][27]. Group 2: Remote-Controlled Solutions - Carmack suggests that selling remote-controlled home robots, like the NEO from 1X Technologies, could be more beneficial than fully autonomous models [3][4]. - The NEO robot, which can perform tasks like folding clothes and fetching water, is operated remotely via a VR headset, allowing for flexibility in task management [5][7]. - Renting the NEO at $500 per month equates to hiring a domestic helper at a rate of $2 per hour, presenting a cost-effective alternative [17]. Group 3: Consumer Concerns - Many consumers lack clear evaluation standards for emerging technologies like home robots, making them susceptible to misleading marketing [24][25]. - The trend of pre-selling home robots, with payments made for products that will not be delivered until years later, raises concerns about consumer trust and product viability [20][21][22]. - The expectation of "hands-free" convenience often leads to disappointment when the actual performance of the robots falls short [25][27].