Milvus
Search documents
2026,进入AI记忆元年
3 6 Ke· 2026-01-27 10:28
前不久,LMArena.ai对全球大模型的市场地位变化做了统计后,得到了一个有意思的发现: 自2023年年中起,SOTA模型的迭代周期被快速压缩至35天,曾经的SOTA模型,只要短短5个月就可能跌出Top5,7个月后连Top10的门槛都摸不到。 但SOTA不断更新的背后,模型的确在进步,但曾经ChatGPT、Deepseek这样让人眼前一亮的新产品却越来越少,技术进步已经进入了不断小修小补却始 终难以突破的瓶颈期。 与逐渐偃旗息鼓的模型进化形成鲜明对比的,是过去两年多围绕AI记忆形成的你方唱罢我登场的热闹。 其中,最先一步出发的,是2023年先后涌现出的诸如Milvus、Pinecone、faiss为代表的向量数据库产品。 此后一年,建立在成熟的语义、知识图库以及关键词检索基础上,2024—2025年期间,Letta(MemGPT)、Mem0、MemU、MemOS为代表的各种AI记忆 框架,如雨后春笋般冒出,GitHub上各种Mem"X"系产品,多到可以组成连连看。 热闹很快传导至模型玩家阵营,一周前,Claude被爆要在Cowork中为模型增加记忆能力引发的讨论尚未消退,谷歌又紧随其后,宣布了最新的Nest ...
计算机行业点评:AI投资从硬到软拐点:再谈谁是中国MongoDB
Minsheng Securities· 2025-12-30 05:47
Investment Rating - The report maintains a "Recommended" rating for the industry [1] Core Insights - The report highlights a significant performance improvement in the ArgoDB solution based on the Haiguang 7000 series processor, achieving a 62% enhancement over the previous world record in the TPC-DS benchmark [4] - The trend of using low-cost storage to replace expensive memory in vector databases is identified as a major technological shift [4] - The domestic vector database market is transitioning from technical exploration to large-scale implementation, with positive signals of million-level purchases in key sectors like finance and energy [4] Summary by Sections - **Performance Records**: The ArgoDB solution has set new performance records in both TPC-DS and TPC-C benchmarks, indicating strong technological advancements [4] - **Cost Structure**: The report outlines two billing models for vector databases: annual subscription and pay-as-you-go, with specific pricing examples provided for different regions [4] - **Investment Opportunities**: The report suggests focusing on companies such as StarRing Technology, Dameng Data, Haima Data, Taiji Co., SuperMap Software, and Torus [4]
KIOXIA AiSAQ™ Technology Integrated into Milvus Vector Database
Businesswire· 2025-12-17 02:51
Core Insights - Kioxia Corporation has integrated its KIOXIA AiSAQ™ into the open-source vector database Milvus starting with version 2.6.4 [1] Company Summary - Kioxia Corporation is enhancing its product offerings by integrating KIOXIA AiSAQ™ into Milvus, which is a significant step in expanding its capabilities in the database sector [1]
KIOXIA AiSAQ Technology Integrated into Milvus Vector Database
Businesswire· 2025-12-17 02:05
Core Insights - Kioxia America, Inc. has integrated its AiSAQ technology into Milvus, enhancing the capabilities of one of the most widely adopted open-source vector databases for AI applications [1][2][3] Group 1: Technology Integration - The integration of Kioxia's AiSAQ technology into Milvus (starting with version 2.6.4) provides developers and enterprises with a cost-effective solution for scaling AI applications without high memory costs [1][2] - AiSAQ technology significantly reduces DRAM requirements while enabling high-quality vector search, making large-scale Retrieval Augmented Generation (RAG) deployments more accessible and affordable [2][3] Group 2: Market Demand and Challenges - As organizations adopt larger AI models and develop complex RAG pipelines, the demand for vector databases is increasing, with DRAM cost becoming a major barrier to growth [2][3] - Milvus now supports SSD-optimized vector indexing due to the integration of AiSAQ, which dramatically reduces memory usage while maintaining high-quality search performance [3][4] Group 3: Future Developments - Kioxia's AiSAQ technology is designed to enhance vector scalability by storing all RAG database elements on SSDs, with tuning options available to prioritize performance or scalability [4][5] - The ongoing development of AiSAQ aims to support trillion-vector scale, further advancing the capabilities of AI applications [4][5]
Agentic AI时代,向量数据库成“必选项”
Tai Mei Ti A P P· 2025-12-05 05:18
Core Insights - The emergence of Agentic AI is driving a fundamental shift in the database industry, necessitating the transition from traditional databases to vector databases as essential infrastructure for AI applications [1][3][9]. Group 1: Market Trends and Predictions - Gartner predicts that the market for Agentic AI will exceed $100 billion by 2025, with a compound annual growth rate (CAGR) of over 65% [2]. - By 2028, spending on databases supporting generative AI is expected to reach $218 billion, accounting for 74% of the market [3]. Group 2: Requirements for Vector Databases - Agentic AI introduces four rigid demands on underlying databases: enhanced read/write performance, personalized data storage, cost-performance balance, and multi-modal processing capabilities [5][6][7]. - The frequency of read/write operations in Agentic AI applications is significantly higher than in traditional retrieval-augmented generation (RAG) scenarios, necessitating optimizations in database performance [5]. Group 3: Role of Vector Databases - Vector databases are becoming the core component for Agentic AI, providing efficient retrieval of semantic similarities and managing vast amounts of unstructured data [9][10]. - The core value of vector databases lies in their ability to store and retrieve high-dimensional vectors generated from unstructured data, which is essential for the functioning of Agentic AI [9]. Group 4: Zilliz's Position and Offerings - Zilliz, as a pioneer in vector databases, has developed Milvus, an open-source vector database that supports high-performance read/write operations and can handle billions of vectors [3][12]. - Zilliz Cloud, the commercial version of Milvus, offers a seamless transition for users from the open-source version, allowing for flexible deployment and scalability [12][15]. Group 5: Collaboration with Cloud Providers - Zilliz has partnered with Amazon Web Services (AWS) to leverage cloud-native architecture, enhancing the performance and scalability of its vector database solutions [13][14]. - The collaboration with AWS allows Zilliz to provide optimized solutions that meet diverse user needs, contributing to a significant market presence in the cloud database sector [15][16].
模力工场 020 周 AI 应用榜:灵臂 Lybic 登顶榜首,榜单聚光“Agent 原生工作基建”!
AI前线· 2025-11-19 07:00
Core Insights - The article emphasizes the importance of AI infrastructure (AI Infra) as a comprehensive set of tools necessary for the effective deployment and scaling of AI applications, rather than a single technology [2] - The article highlights the launch of 49 AI Infra tools by the company, encouraging users to explore and contribute to the platform [2] - The article discusses the recent AI Open Source Ecology Conference in Hangzhou, where the company showcased its applications and facilitated discussions among industry experts [2] AI Applications Overview - The 20th weekly AI application ranking showcases developers making strides in integrating AI into real-world business processes, with applications like Lybic enabling agents to understand and interact with graphical user interfaces [6][7] - The top three applications in the ranking demonstrate a complete link from interface operation to algorithm execution and data insights, indicating a trend towards more integrated AI solutions [6][7] - The article identifies key applications such as Lybic, TDgpt, and AskTable, which collectively enhance the capabilities of AI agents in various operational contexts [6][7] Application Features and Developer Insights - Lybic is designed to provide a graphical interface for AI agents, allowing them to understand and operate within various software environments without traditional API or scripting limitations [10][12] - The development team of Lybic emphasizes the need for AI to operate in a real-world environment, addressing the limitations of traditional automation methods [12][13] - Future development for Lybic will focus on stability and reliability, ensuring that AI can effectively handle repetitive tasks and complex workflows [16][17] Trends and Future Directions - The article notes a shift in focus from what large models can do to how they can be effectively integrated into real-world applications, with a clear emphasis on operational efficiency [7][24] - The company aims to establish Lybic as a standard execution layer for AI agents, facilitating seamless integration across various platforms and enhancing task execution capabilities [18][24] - The overarching theme is the transformation of work infrastructure to accommodate AI agents as primary collaborators in business processes, reshaping how tasks are performed [24]
为什么 Claude Code 放弃代码索引,使用 50 年前的 grep 技术?
程序员的那些事· 2025-09-25 02:53
Group 1 - The article discusses the seemingly counterintuitive choice of Claude Code to use a grep-only approach instead of vector indexing, which has sparked debate among developers [3][5]. - Critics argue that this decision represents a technological regression, while supporters highlight its alignment with Unix philosophy and the redefinition of what constitutes a good tool [3][5]. - Claude Code's approach emphasizes real-time search without maintaining a persistent code index, which has been shown to outperform other methods in performance tests [5][49]. Group 2 - The essence of state is explored, distinguishing between stateful and stateless systems, with examples illustrating the impact of state on system design [9][10]. - Historical context is provided, tracing the origins of stateless design from mathematical functions to the Unix pipeline philosophy, which emphasizes simplicity and composability [11][14]. - The advantages of stateless design include composability, natural parallelism, simplicity, and testability, making it a preferred choice in modern computing [30][34][36]. Group 3 - The article discusses scenarios where state is necessary, such as in gaming, user interfaces, and resource management, emphasizing the importance of context in design choices [41][47]. - A mixed strategy is suggested, where stateless computation is combined with stateful storage, allowing for flexibility and efficiency in system architecture [43][46]. - The core insight is that the choice between stateless and stateful design is not a matter of technical belief but an engineering trade-off, focusing on managing necessary state wisely [47]. Group 4 - In the AI era, Claude Code's choice reflects a shift in understanding intelligence, prioritizing predictability and behavior over mere functionality [54]. - The article concludes that simple tools endure, and the design that embraces "forgetfulness" offers greater freedom and adaptability in a rapidly evolving technological landscape [55].
X @Avi Chawla
Avi Chawla· 2025-09-11 06:33
AI Infrastructure Tools - Tensorlake enables transformation of unstructured documents into AI-ready data [1] - Zep facilitates building human-like memory for Agents [1] - Firecrawl empowers LLM applications with clean web data [1] - Milvus provides a high-performance vector DB for scalable vector search [1]
对话Zilliz产品负责人郭人通:向量数据库将成为承接AI上下半场的“桥梁”
Zhong Guo Jing Ying Bao· 2025-04-24 07:48
随着近年来企业数字化转型的深入,海量非结构化数据的处理与价值挖掘成为企业竞争的关键。据 Gartner测算,从2019年到2024年,包括各类文本、图片、视频、音频在内的非结构化数据容量增加了2 倍。企业花费大量成本长期存放这些数据,却常未能带来满意的附加价值。 而在生成式AI出现后,企业数据的灵活管理与价值释放,正在进一步变得便捷。如何借助AI将其转化 为可落地的应用,也成为企业能否赢得AI时代主动权的关键命题。 在近日举办的亚马逊云科技出海大会上,作为开源向量数据库Milvus的缔造者的Zilliz合伙人与产品负责 人郭人通向《中国经营报》记者表示,Zilliz正在通过亚马逊云科技提供的全球基础设施和生成式AI能 力,为各类企业构建多样化、高可用、合规且可扩展的向量数据库解决方案,助力企业高效应对AI时 代的挑战。 "如果把人工智能的发展分为上、下半场,上半场主要是利用大规模数据训练AI能力;而下半场,则是 AI能力反过来深入行业,产生海量关键数据。企业更需关注如何挖掘数据价值,实现AI应用快速落 地。"郭人通表示。 构建AI数据新基建:向量数据库的全球化演进 在郭人通看来,承接上、下半场趋势的"桥梁", ...