Workflow
量子位
icon
Search documents
PPIO姚欣:AI正在进入自主行动与创造时代,智能体需要全新的操作系统|MEET2026
量子位· 2025-12-15 10:33
Core Insights - The industry is transitioning into the era of Agentic AI, where AI applications evolve from merely answering questions to autonomously executing tasks, necessitating a new foundational infrastructure known as Agent Infra [1][2][3] - The complexity of agent architecture is increasing exponentially, requiring higher demands on the underlying framework, with the operating system being a crucial middle layer across different technological eras [1][3][18][22] Group 1: Evolution of AI - AI is moving from generative capabilities to Agent AI, exemplified by products like Doubao Phone, which can autonomously place orders and compare prices, showcasing the shift towards intelligent agents that automate tasks [8][12] - The true form of intelligent agents requires capabilities such as autonomous analysis, decision-making, and task execution, moving beyond early-stage tools that merely enhance search or processing abilities [11][13] Group 2: Agent Infrastructure - The concept of Agent Infra is likened to an operating system for the AI era, managing model capabilities, tool invocation, and task execution, thereby facilitating resource management and unified scheduling for developers [23][24] - The core component of Agent Infra is Runtime, which addresses the adaptability and stability of intelligent agents across various environments, ensuring comprehensive scheduling of different capabilities [24] Group 3: PPIO's Role - PPIO is building a complete AI cloud capability from the ground up, integrating distributed computing resources and creating a GPU inference cloud platform to support the Agent Infra [26][28] - The PPIO Agent Sandbox, designed for executing tasks, provides a secure and efficient cloud environment for agents, supporting dynamic tool invocation and ensuring high concurrency and rapid deployment [29][31]
小米语音首席科学家:AI发展的本质就像生物进化,不开源要慢1000倍 | MEET2026
量子位· 2025-12-15 08:05
Core Insights - The evolution of AI closely mirrors the biological evolution process, characterized by trial and error to identify superior solutions for specific tasks [7][10] - AI development is not linear but follows a pattern of "long-term stagnation + sudden leaps," similar to the concept of "punctuated equilibrium" in biology [7][25] Group 1: AI Evolution and Open Source - Open source is deemed a crucial accelerator for AI evolution; without it, the research speed could decrease to one-thousandth of its current pace [3][34][35] - The design process of AI "recipes" involves experimenting with different variants and selecting effective ones for publication, which others can then replicate [12][13] - The time required to replicate a new idea in AI, akin to the "generation time" in biology, has decreased from approximately two years to about six months [18][20] Group 2: Strategies for Survival in AI Competition - Large companies should adopt a dual strategy: leveraging current leading technologies while also exploring unknown territories to find the next disruptive opportunity [5][13][45] - Maintaining a balance between "generalists" and "specialists" in AI models is essential, as different evolutionary strategies can adapt to varying environments [44][45] - Companies should preserve a diversity of model architectures to increase the chances of discovering practical new technologies [45][46] Group 3: Future Directions and Innovations - The AI field must continuously explore new ideas across various tasks, as breakthroughs can emerge from unexpected areas [39][42] - The current focus on Transformer technology is likened to a "musical chairs" scenario, where companies must keep up with the prevailing trends while preparing for future shifts [46][47] - The company is developing a new model architecture called Zapformer, which aims to enhance voice recognition accuracy by 10%-15% and improve general robustness [53][54][56]
布林坦承谷歌低估Transformer,“还被OpenAI挖走了Ilya”
量子位· 2025-12-15 08:05
Core Insights - The article discusses Google's journey from its inception to its current challenges in the AI space, highlighting mistakes made and opportunities missed, particularly in relation to OpenAI's rise [1][2][5][26]. Group 1: Google's History and Development - Google was founded by Sergey Brin and Larry Page, initially focusing on a project called BackRub, which evolved into the Google search engine [10][16][19]. - The name "Google" reflects their ambition to organize vast amounts of information, derived from a mathematical term representing a large number [21]. - Google fostered a strong academic environment, attracting top talent and focusing on foundational research, which laid the groundwork for its future innovations in AI [22][25]. Group 2: AI Strategy and Mistakes - After the release of the Transformer model, Google underestimated the potential of AI and failed to allocate sufficient resources, allowing OpenAI to capitalize on the opportunity [26][29]. - Despite setbacks, Google's long-term investments in AI research and development, including the creation of specialized TPU chips, have helped maintain its technological edge [30][29]. Group 3: Future Directions and Recommendations - Sergey Brin emphasizes the importance of leveraging AI in various aspects of life and encourages students to pursue computer science, as coding skills remain crucial for developing better AI [32][35]. - He suggests that quantum computing and materials science are undervalued future technologies that could have significant impacts, particularly in conjunction with AI [37]. - Brin advises against prematurely commercializing ideas without adequate preparation, using the example of Google Glass to illustrate the importance of refining concepts before market introduction [42][45].
量子位编辑作者招聘
量子位· 2025-12-15 08:05
AI热潮还在汹涌,但如果你还不知道如何参与……那为什么不来 量子位 呢? 我们是一家以 追踪AI新进展 为核心的内容平台,经过8年积累,目前拥有顶流影响力,广泛且备受认可的产业资源,以及时代风口的最佳观 测和学习生态位。 目前,我们有 三大方向 岗位招聘,希望你是 (或者能成为) 这三个方向的内容专家: 岗位均为全职,工作地点:北京中关村。 岗位面向: 加入我们,你可以获得: 以下是岗位详情: 编辑部 发自 凹非寺 量子位 | 公众号 QbitAI 参与核心采访,对话产业专家、技术大牛、撰写AI云落地案例。 任职要求: AI财经商业方向 所有岗位不同能力层级职位均在开放,欢迎结合个人履历和经验申请。 AI产业方向 岗位职责: AI产业方向 :关注基建层创新,包含芯片、AI Infra、云计算; AI财经方向 :关注AI领域创投和财报,跟踪产业链资本动向; AI产品方向 :关注AI在应用和硬件终端方向的进展。 社招:覆盖编辑、主笔、主编各个层级,按能力匹配岗位; 校招:应届毕业生,接受实习且可转正。 站在AI浪潮之巅 :第一时间接触和了解AI领域最新技术和产品,构建完整的AI认知体系。 玩转AI新工具 :将各种 ...
Minion Skills: Claude Skills的开源实现
量子位· 2025-12-15 08:05
Minion Agent 团队 投稿 量子位 | 公众号 QbitAI 引言 Claude最近推出了一个令人兴奋的特性—— Skills系统 。它让AI Agent能够动态加载专业能力,按需"学习"处理PDF、Excel、PPT等专业 文档的技能。 作为一个开源爱好者,我立刻意识到这个设计的价值,并在Minion框架中实现了完整的开源版本。本文将介绍Skills的设计理念,以及我的开 源实现细节。 Skills解决了什么问题? 在开发AI Agent的过程中,有一个核心矛盾: Context Window的有限性vs能力需求的无限性 传统做法是把所有工具、所有指令都塞进system prompt: System Prompt = 基础指令 + 所有工具描述 + 所有专业知识 = 50K+ tokens = 高延迟 + 高成本 + 低效率 更糟的是,大多数时候用户只需要其中一小部分能力。当用户问"帮我处理这个PDF"时,系统却加载了处理Excel、数据库、代码等所有能力 的上下文。 Skills的核心理念 Minion的开源实现 看到Claude Code的Skills设计后,我决定在Minion框架中实现一个 ...
昆仑万维方汉:通用Agent是伪命题,AI Office仍有存在空间丨MEET2026
量子位· 2025-12-15 05:57
Core Viewpoint - The current wave of AI Agents represents a shift from general artificial intelligence to a system focused on automating verifiable processes, emphasizing the replication of established workflows rather than creating new paradigms [2][12][16]. Group 1: Evolution of AI Agents - The transition from models like ChatGPT to DeepSeek signifies a leap from merely retrieving answers to understanding and replicating processes, marking a new phase centered on process generalization [5][18]. - The essence of Agents is not general AI but the automation of verifiable processes, excelling in structured decision-making and mathematical tasks while lacking in innovative breakthroughs [12][16]. Group 2: Market and Product Insights - Kunlun Wanwei has developed the Skywork Super Agents, which includes five specialized Agents and one general Agent, capable of generating a 30-page PPT in five minutes, with 40% of daily active users engaging with this feature [11][12]. - The company has a strong international presence, with 93% of its revenue coming from overseas markets, allowing it to effectively cater to diverse global demands in AI products and services [10]. Group 3: Challenges and Opportunities - The deployment of Agents in various industries, such as healthcare and finance, faces challenges due to the lack of quality process datasets, which are essential for effective application [21][24]. - The competition for channels in the Agent market is critical, as traditional software vendors may resist new Agents that threaten their established ecosystems [26][27]. Group 4: Organizational Transformation - The rise of Agents will fundamentally reshape organizational structures, with traditional roles being replaced by process architects who design and maintain workflows, leading to increased efficiency [28][29]. - As repetitive tasks diminish, the demand for roles focused on process design and innovation will grow, positioning employees as creators and maintainers of new processes [31].
马斯克猛猛带货太空数据中心!“能耗比地球香太多”
量子位· 2025-12-15 05:57
Core Viewpoint - The article discusses the emerging trend of space data centers as a new frontier for AI infrastructure, driven by key figures like Elon Musk and supported by other tech giants such as Amazon and Google [1][12]. Group 1: Space Data Centers and AI Infrastructure - Space data centers are becoming a focal point in discussions within Silicon Valley and beyond, with significant interest from major tech leaders [2][12]. - Elon Musk has been a prominent advocate for space data centers, indicating that SpaceX plans to deploy data centers in space and expressing support for Google's similar initiatives [4][6]. - Musk argues that the energy potential in space is vastly greater than on Earth, suggesting that deploying AI systems in space could be more cost-effective within the next 4-5 years [8][27]. Group 2: Advantages of Space Data Centers - Space offers abundant and stable energy sources, as solar panels in space can provide continuous power without the interruptions caused by weather or day-night cycles [24]. - Cooling in space is more efficient due to the extreme cold temperatures, allowing for effective heat dissipation without the need for complex cooling systems [25]. - The cost of launching payloads into space is decreasing, with estimates suggesting it could drop to $100 per kilogram in the near future, enhancing the feasibility of space data centers [30]. Group 3: Industry Response and Developments - Major companies are actively pursuing space data center projects, with Starcloud successfully launching a satellite to train a language model in space [38]. - Google is working on "Project Suncatcher," which aims to create a constellation of solar-powered satellites equipped with their tensor processing units (TPUs) [41][42]. - Jeff Bezos has also indicated that moving data centers to orbit is a rational approach, predicting that costs will surpass terrestrial AI infrastructure within 20 years [46]. Group 4: Future Prospects and Challenges - The article highlights the potential for space data centers to alleviate the energy shortages projected for data centers on Earth, particularly in the U.S., where demand for electricity is expected to exceed supply due to AI growth [33][34]. - The construction of space data centers could provide a solution to the regulatory and environmental challenges faced by terrestrial data centers, offering a more agile and sustainable approach to meet increasing computational demands [36]. - The article concludes that both domestic and international players are recognizing the potential of space data centers, marking a significant shift in the landscape of AI infrastructure [50][55].
苏州大学首篇数学四大刊!解决了40年未决的丢番图逼近问题
量子位· 2025-12-15 04:04
闻乐 发自 凹非寺 量子位 | 公众号 QbitAI 中国学者又一篇数学四大刊成果出炉,还是 苏州大学 的首篇四大刊成果。 论文《Khintchine dichotomy for self-similar measures》已被Journal of the American Mathematical Society (《美国数学杂志》) 录用。 该项成果的作者是 苏州大学副教授张涵 ,合作者有 Timothée Bénard (法国国家科学研究中心 (CNRS),巴黎北索邦大学 (LAGA) 的研 究员) 和 何伟鲲 (中国科学院数学与系统科学研究院副研究员) 。 《数学年刊》《数学学报》《数学新进展》和《美国数学杂志》并称为数学四大刊,是国际数学界公认的数学顶级期刊,每年中国研究机构中 选论文经常不超过10篇。 这次的突破是把描述有理数如何近似表达实数的 辛钦定理 推广到了 所有自相似测度 上。 接下来咱就看看是怎么个拓展法。 比如用22/7逼近π,误差不到0.0015;用355/113逼近π,误差更是能缩小到千万分之三。 而数论领域的辛钦定理,就从数学层面量化了这种逼近的可能性和效率。它给出了一个明确的判 ...
何恺明组三位本科生领衔!持续聚焦Flow模型,突破归一化流生成效率瓶颈
量子位· 2025-12-15 04:04
鱼羊 发自 凹非寺 量子位 | 公众号 QbitAI 何恺明团队新作,持续聚焦Flow模型。 论文提出名为 双向归一化流 (BiFlow) 的新框架,通过解耦前向过程——将数据映射为噪声,和逆向过程——把噪声再转回来生成图片, 成功打破了传统归一化流生成模型效率低下的问题。 值得一提的是,论文的三位一作分别是来自清华姚班和MIT的本科生。 BiFlow:逆向过程不必是前向过程的精确逆运算 归一化流方法 (NFs) 已经成为生成建模的一种原则性框架。 标准的归一化流包含前向过程和逆向过程: 与MeanFlow对流匹配的优化不同,这次主要旨在解决归一化流在生成模型中的局限。 前向过程将数据映射为噪声,逆向过程则通过对前向过程求逆来生成样本。 传统的NF模型有一个硬性规定,逆向过程必须是前向过程的精确逆运算——要像钥匙和锁一样完全匹配。这就导致了两个问题: BiFlow的核心创新就在于, 打破了"逆向过程必须是前向过程的精确逆运算"这一规则 。 设计思路是这样的: BiFLow解耦了前向过程和逆向过程的设计。 模型设计受限:因为要保证 "可逆",不能使用很多强大的通用架构 (比如视觉Transformer) ,得特 ...
低调霸榜全球最难SQL榜单超两月,国产AI这次选择高调开源!
量子位· 2025-12-14 07:12
Core Viewpoint - Ant Group's AI division, Ant Financial Technology, has made significant strides in the AI data analysis field, recently achieving top rankings in global SQL benchmarks and announcing the open-source release of its Agentar-SQL series, which includes comprehensive frameworks for real-time text-to-SQL conversion and other data capabilities [2][4][5]. Group 1: Achievements and Innovations - Ant Group's Agentar-Scale-SQL achieved a dual first-place ranking in the BIRD benchmark with an execution accuracy of 81.67% and execution efficiency of 77% [5]. - The average query accuracy of Ant Group's Agentar SQL tools exceeded 92% during a trial with a major city commercial bank, representing over a threefold improvement compared to traditional query methods [7]. - Ant Group's AI solutions have been adopted by 100% of state-owned commercial banks and over 60% of local commercial banks in China, indicating a strong market presence [18]. Group 2: Strategic Focus and Market Approach - Ant Group's CEO emphasized that the true value of AI lies in its ability to address real-world industry challenges rather than just technological advancement [9]. - The company has adopted a unique "pay-for-performance" model, reducing the barriers for small and medium-sized institutions to implement AI by allowing them to pay based on tangible business outcomes [42][43]. - Ant Group has established deep partnerships with 300 collaborators, serving over 13,000 end customers, and has upgraded its "Xinglan Plan" to enhance partner capabilities across various dimensions [45][47]. Group 3: Broader Applications and Future Directions - The AI methodologies developed in the financial sector are being adapted for broader applications, such as in public transportation and energy sectors, showcasing the versatility of Ant Group's AI capabilities [27][30][37]. - Ant Group's AI solutions have gained international recognition, serving over a hundred overseas financial institutions and being selected for the Hong Kong Monetary Authority's generative AI sandbox project [48][49]. - The company is positioned as a leader in the AI industry, with its technology being recognized for its robustness and applicability in various sectors beyond finance [20].