Workflow
Retrieval Augmented Generation (RAG)
icon
Search documents
金山办公朱熠锷:从“看到”到“掌握”:AI应用进入“知识增强生成”时代|Alpha峰会
Hua Er Jie Jian Wen· 2025-12-25 08:53
Core Insights - The core challenge of AI applications has shifted from model capability competition to effectively utilizing enterprise private data [3] - The key to AI application value lies in transforming complex, unstructured document data into high-quality, model-understandable knowledge assets [3][4] - The future path involves developing Knowledge Augmented Generation (KAG), which requires systematic governance, modeling, and application of knowledge [3][4] Group 1: Transition from RAG to KAG - AI applications are transitioning from a model-centric approach to a data-centric approach, with data quality being crucial for AI effectiveness [4][5] - The traditional Retrieval-Augmented Generation (RAG) faces limitations as "documents do not equal knowledge" and "semantic similarity does not equal logical relevance" [4][6] - KAG represents a paradigm shift that emphasizes high-quality input through knowledge governance and the integration of multi-modal, structured knowledge assets [7] Group 2: Implementation of KAG - KAG architecture includes a dual-layer structure: a knowledge governance layer for document parsing and knowledge extraction, and a knowledge application layer for integrating various knowledge sources [7][8] - The KAG framework is applied in four key scenarios: knowledge governance, professional intelligent Q&A, intelligent extraction of complex documents, and specialized intelligent writing [8][9] - The intelligent writing process involves two agents working together to generate compliant and accurate professional reports, significantly reducing writing time [9] Group 3: Knowledge Management - Companies must manage knowledge as they do data, establishing a dual lake architecture of "data lake" and "knowledge lake" to transition from digitalization to intelligentization [10] - Effective knowledge management will be the cornerstone for AI to deliver real efficiency improvements in professional fields [10]
KIOXIA AiSAQ Technology Integrated into Milvus Vector Database
Businesswire· 2025-12-17 02:05
Core Insights - Kioxia America, Inc. has integrated its AiSAQ technology into Milvus, enhancing the capabilities of one of the most widely adopted open-source vector databases for AI applications [1][2][3] Group 1: Technology Integration - The integration of Kioxia's AiSAQ technology into Milvus (starting with version 2.6.4) provides developers and enterprises with a cost-effective solution for scaling AI applications without high memory costs [1][2] - AiSAQ technology significantly reduces DRAM requirements while enabling high-quality vector search, making large-scale Retrieval Augmented Generation (RAG) deployments more accessible and affordable [2][3] Group 2: Market Demand and Challenges - As organizations adopt larger AI models and develop complex RAG pipelines, the demand for vector databases is increasing, with DRAM cost becoming a major barrier to growth [2][3] - Milvus now supports SSD-optimized vector indexing due to the integration of AiSAQ, which dramatically reduces memory usage while maintaining high-quality search performance [3][4] Group 3: Future Developments - Kioxia's AiSAQ technology is designed to enhance vector scalability by storing all RAG database elements on SSDs, with tuning options available to prioritize performance or scalability [4][5] - The ongoing development of AiSAQ aims to support trillion-vector scale, further advancing the capabilities of AI applications [4][5]
挖掘“非结构化”数据价值的5种方法
3 6 Ke· 2025-12-09 04:06
Core Insights - The future of data management is shifting towards integrating unstructured data with structured data, emphasizing the need for advanced data platforms that can handle both types effectively [1][15]. Group 1: Unstructured Data Challenges - By 2025, CEOs will prioritize insights from unstructured data, such as vendor contracts in PDF format, over traditional structured data queries [3]. - The current disconnect in data management stems from the lack of efficient connections between vector databases and relational databases, complicating the retrieval of specific information from unstructured sources [4]. - The processing of unstructured data is costly, with estimates suggesting that handling 1 PB of unstructured text for retrieval-augmented generation (RAG) could incur API costs up to $150,000 if not optimized [6]. Group 2: Solutions and Recommendations - Experts recommend building a model routing system that utilizes smaller language models for basic extraction tasks, reserving more complex models for intricate reasoning tasks [6]. - Investment in better data ingestion layers is crucial, as improved parsers yield a return on investment ten times greater than enhancements in language learning models [9]. - The importance of metadata is highlighted, as successful data teams will embed structured attributes into unstructured data before it enters vector storage [10]. Group 3: Evolution of Data Products - Documents are evolving from mere data blocks to data products, with a focus on extracting actionable insights from contracts and other unstructured formats [12]. - The emergence of a "universal data lake" is anticipated, where various data types coexist and are managed under a single directory, enhancing accessibility and usability [12]. - Companies are advised to audit their data directories to ensure that search results yield relevant data formats, indicating the effectiveness of their data management systems [13].
Coveo Introduces RAG-as-a-Service for AWS Agentic AI Services
Prnewswire· 2025-12-01 13:05
Core Insights - Coveo has launched Retrieval Augmented Generation (RAG)-as-a-Service for AWS agentic AI services, enhancing enterprise generative AI with precision, security, and scalability [1][3] - The new offering integrates with AWS services like Amazon Bedrock AgentCore and Amazon Quick Suite, allowing organizations to leverage their knowledge bases effectively [2][3] Product Features - The RAG-as-a-Service is delivered through a fully managed Coveo-hosted MCP Server, providing configurable tools for enterprises [3] - Key functionalities include Passage Retrieval for relevant knowledge, Answer generation from organizational data, Search for ranked results, and Fetch for complete document text [7] Market Positioning - Coveo emphasizes the importance of relevance in AI applications, stating that the enterprise value of large language models (LLMs) relies on their ability to provide contextually grounded responses [3][4] - The company aims to set a competitive standard by tailoring experiences to individual user needs and preferences through advanced AI capabilities [4]
X @Avi Chawla
Avi Chawla· 2025-08-14 06:34
Core Idea - Contextualized chunk embedding models, such as voyage-context-3, process entire documents to embed chunks, leading to document-aware embeddings [1] - This approach enables semantically aware retrieval in Retrieval-Augmented Generation (RAG) systems [1] Technology - Voyage-context-3 is highlighted as an example of a contextualized chunk embedding model [1] - The method contrasts with producing independent chunk embeddings [1] Collaboration - The MongoDB team is acknowledged for their collaboration on this topic [1]
Scaling Enterprise-Grade RAG: Lessons from Legal Frontier - Calvin Qi (Harvey), Chang She (Lance)
AI Engineer· 2025-07-29 16:00
[Music] All right. Uh, thank you everyone. We're excited for to be here and thank you for uh, coming to our talk.Uh, my name is Chong. I'm the CEO and co-founder of LANCB. I've been making data tools for machine learning and data science for about 20 years.I was one of the co-authors of pandas library and I'm working on LANCB today for all of that data that doesn't fit neatly into those pandas data frames. And I'm Calvin. I lead one of the teams at Harvey Aai working on rag um tough rag problems across mass ...
Elastic(ESTC) - 2025 Q4 - Earnings Call Transcript
2025-05-29 22:02
Financial Data and Key Metrics Changes - Total revenue in Q4 was $388 million, growing 16% year-over-year on an as-reported and constant currency basis [30] - Subscription revenue in Q4 totaled $362 million, also growing 16% as reported and 17% in constant currency [30] - Elastic Cloud revenue grew 23% on an as-reported and constant currency basis [30] - Non-GAAP operating margin for Q4 was 15%, with a gross margin of 77% [35][36] - Adjusted free cash flow margin improved by approximately 600 basis points to end the year at 19% [36] Business Line Data and Key Metrics Changes - The number of customers with over $1 million in annual contract value grew approximately 27%, adding about 45 net new customers [34] - Customers with over $100,000 in annual contract value grew approximately 14%, adding about 180 net new customers [34] - Subscription revenue excluding Monthly Cloud was $315 million, growing 19% in Q4 [32] Market Data and Key Metrics Changes - Strong growth was observed in the APJ region, followed by EMEA and The Americas, while some pressure was noted in the U.S. Public sector [34] - Over 2,000 Elastic Cloud customers are using Elastic for Gen AI use cases, with over 30% of these customers spending $100,000 or more annually [12] Company Strategy and Development Direction - The company is focusing on leveraging AI to automate business processes and drive innovation, positioning itself as a strategic partner for enterprises [11][18] - Elastic aims to strengthen its position as the preferred vector database, enhancing its offerings with new technologies like better binary quantization [13][19] - The company is committed to maintaining a balance between growth and profitability while continuing to innovate and expand its product offerings [40][43] Management's Comments on Operating Environment and Future Outlook - Management acknowledged potential uncertainty in the macro environment but expressed confidence in the healthy pipeline and demand signals [39] - The company expects continued growth and strong margins in FY 2026, projecting total revenue in the range of $1.655 billion to $1.670 billion [42] Other Important Information - Elastic Cloud now accounts for over 50% of subscription revenue, with strong growth in cloud adoption [18] - The company announced a strategic collaboration agreement with AWS to enhance solution integrations and accelerate AI innovation [25] Q&A Session Summary Question: Guidance and Metrics - Inquiry about the conservativeness of guidance and leading indicators of business performance [45] - Response highlighted the balance of positive demand signals with macro uncertainty, emphasizing the importance of CRPO and subscription revenue metrics [46][49] Question: Partnerships and Market Opportunities - Question regarding the impact of recent partnerships, particularly with AWS and NVIDIA, on market opportunities [53] - Management noted the growing acceptance of Elastic as a leading vector database and the importance of partnerships for driving cloud adoption [54] Question: Retrieval Augmented Generation (RAG) - Inquiry about the durability of RAG architectures and Elastic's positioning [59] - Management affirmed the critical role of retrieval in enterprise data management and the growing adoption of their vector database for RAG use cases [60][61] Question: Cloud Performance and Consumption Hesitation - Question about the sequential growth in cloud performance and the impact of the leap year [62] - Management clarified that the leap year and fewer days in Q4 affected consumption rates, but normalized growth rates remained strong [64][66] Question: Go-to-Market Strategy and Changes - Inquiry about the effectiveness of go-to-market changes made in the previous fiscal year [69] - Management confirmed that the changes have settled and are yielding positive results, with plans to continue hiring sales capacity [70][72] Question: AI Commitments and Emerging Use Cases - Question about the $1 million AI commitments and emerging use cases [93] - Management clarified that 25% of $1 million customers are using Elastic for AI workloads, with a variety of sophisticated use cases emerging across industries [94][96]