RAG

Search documents
OpenAI o3-pro发布,也许当前的RAG过时了
Hu Xiu· 2025-06-16 06:33
前两天,OpenAI 发布 o3-pro,号称最强推理 AI 模型上线,推理能力再创新高。对于推理最强这个信息,很多人都是无所谓的状态,但随后的信息就很嗨 了: 伴随o3-pro的推出,OpenAI还做出了一个令人意外的决定,o3的价格下调80%,降至与GPT-4o相当的水平。具体来说: 1. 调整前:输入token每百万10美元,输出token每百万40美元; 2. 调整后:输入token每百万约2美元,输出token每百万约8美元。 虽然对比DeepSeek的费用来说还是偏贵,但已经是很有诚意的降价了,一些同学对此可能没什么概念: 10000字的提示词之前要花0.72元,现在只需要0.144元了。 除此之外,o3-pro上下文窗口大小为 200k,最大输出 token 数为 100k,这意味着至少可以输入约15万字的提示词! 大家知道15万字是什么概念吗,一篇短篇小说,各位得看一晚上了! 而无论是更便宜的资费还是更强的上下文,都利好于Agent架构的记忆问题,用大白话说就是,RAG有了更长的提示词上下文,可以玩得更花了! 作为AI应用80%会涉及的技术,今天我们就来简单介绍下RAG的几种玩法。 AI应用很 ...
深度|吴恩达:语音是一种更自然、更轻量的输入方式,尤其适合Agentic应用;未来最关键的技能,是能准确告诉计算机你想要什么
Z Potentials· 2025-06-16 03:11
Core Insights - The discussion at the LangChain Agent Conference highlighted the evolution of Agentic systems and the importance of focusing on the degree of Agentic capability rather than simply categorizing systems as "Agents" [2][3][4] - Andrew Ng emphasized the need for practical skills in breaking down complex processes into manageable tasks and establishing effective evaluation systems for AI systems [8][10][12] Group 1: Agentic Systems - The conversation shifted from whether a system qualifies as an "Agent" to discussing the spectrum of Agentic capabilities, suggesting that all systems can be classified as Agentic regardless of their level of autonomy [4][5] - There is a significant opportunity in automating simple, linear processes within enterprises, as many workflows remain manual and under-automated [6][7] Group 2: Skills for Building Agents - Key skills for building Agents include the ability to integrate various tools like LangGraph and establish a comprehensive data flow and evaluation system [8][9] - The importance of a structured evaluation process was highlighted, as many teams still rely on manual assessments, which can lead to inefficiencies [10][11] Group 3: Emerging Technologies - The MCP (Multi-Context Protocol) is seen as a transformative standard that simplifies the integration of Agents with various data sources, aiming to reduce the complexity of data pipelines [21][22] - Voice technology is identified as an underutilized component with significant potential, particularly in enterprise applications, where it can lower user interaction barriers [15][19] Group 4: Future of AI Programming - The concept of "Vibe Coding" reflects a shift in programming practices, where developers increasingly rely on AI assistants, emphasizing the need for a solid understanding of programming fundamentals [23][24] - The establishment of AI Fund aims to accelerate startup growth by focusing on speed and deep technical knowledge as key success factors [26]