向量数据库
Search documents
联想申请数据处理方法专利,支持向量数据跨数据库存储
Jin Rong Jie· 2026-01-09 06:53
声明:市场有风险,投资需谨慎。本文为AI基于第三方数据生成,仅供参考,不构成个人投资建议。 本文源自:市场资讯 国家知识产权局信息显示,联想(北京)有限公司申请一项名为"数据处理方法和电子设备"的专利,公 开号CN121301402A,申请日期为2025年9月。 专利摘要显示,本申请公开了一种数据处理方法和电子设备,电子设备将第一执行数据挂载至数据写入 操作的触发区;第一执行数据为用于将写入的数据转换为向量的执行逻辑数据;响应于数据写入操作, 在将数据写入存储器的过程中,触发第一执行数据的执行,得到数据对应的向量;通过预设存储接口将 向量存储至多个向量数据库中的任意一个;预设存储接口为基于驱动层实现的、用于适配多个向量数据 库的接口。 天眼查资料显示,联想(北京)有限公司,成立于1992年,位于北京市,是一家以从事计算机、通信和 其他电子设备制造业为主的企业。企业注册资本565000万港元。通过天眼查大数据分析,联想(北京) 有限公司共对外投资了107家企业,参与招投标项目5000次,财产线索方面有商标信息1747条,专利信 息5000条,此外企业还拥有行政许可238个。 作者:情报员 ...
英伟达CES发布了什么-星环科技为何受益
2026-01-07 03:05
英伟达 CES 发布了什么?星环科技为何受益 20260106 摘要 英伟达通过 PU 和 SSD 优化,旨在提升 GPU 计算效率,特别是在线学 习等新模型架构的数据处理能力,通过低成本 SSD 替换 DRAM,实现 更高效的数据存储。 英伟达新架构通过提升内存使用效率,打破内存墙,增加热数据需求, 显著提升向量数据库的数据流量,尤其是在 H200 芯片大量采用的情况 下,业务增量可能达数百亿级别。 星环科技作为国内领先的独立第三方向量数据库厂商,受益于英伟达新 技术和按流量计费模式,有望充分利用存算一体化带来的流量增长,实 现业务空间成百倍放大。 向量数据库与传统数据库的主要区别在于按流量计费而非按节点收费, 更适合实时训练和在线学习等应用场景,其商业模式因灵活性和经济性 更具吸引力。 英伟达收购 Groq 并采用 Atrium 方式优化 HBM 交互层,使得未来模型 架构中的固定权重更新更加高效,并促进 SSD 与 HBM 之间的数据传输 速度,大幅提高系统性能。 向量数据库与传统数据库最大的不同在于其按流量计费,而非按节点收费。传 统数据库主要管理冷数据,确保数据不丢失、不变形,并且需要持续付费。而 ...
广发证券:AI推理RAG向量数据库推动SSD需求增长 建议关注产业链核心受益标的
智通财经网· 2025-12-31 01:39
智通财经APP获悉,广发证券发布研报称,RAG架构为大模型提供长期记忆,企业和个性化需求推动了 对RAG存储需求的增长。AI推理中的RAG向量数据库存储介质正在从"内存参与检索"向"全SSD存储架 构"过渡,推动高带宽、大容量SSD的需求将持续增加。建议关注产业链核心受益标的。 火山引擎TOSVectors开启向量存储新范式,对SSD需求提高 根据火山引擎开发者社区公众号,TOS推出Vector Bucket,该架构采用字节自研的Cloud-Native向量索 引库Kiwi与多层级本地缓存协同架构(涵盖DRAM、SSD与远程对象存储)。在大规模、长周期存储和低 频查询的场景下,该架构不仅满足高/低频数据的分层需求,而且显著降低企业大规模使用向量数据的 门槛。TOSVector与火山引擎高性能向量数据库、火山AI agent等产品深度协同,以交互型Agent场景来 看,将高频访问的记忆(如用户的核心偏好、近期的任务执行结果等)存放在向量数据库中,实现毫秒级 的高频检索;将低频访问的记忆(如半年前的交互记录或历史执行结果)沉淀到TOSVector中,允许秒级延 迟,以此换取更低的存储成本和更广阔的记忆空间;以处理 ...
计算机行业点评:AI投资从硬到软拐点:再谈谁是中国MongoDB
Minsheng Securities· 2025-12-30 05:47
Investment Rating - The report maintains a "Recommended" rating for the industry [1] Core Insights - The report highlights a significant performance improvement in the ArgoDB solution based on the Haiguang 7000 series processor, achieving a 62% enhancement over the previous world record in the TPC-DS benchmark [4] - The trend of using low-cost storage to replace expensive memory in vector databases is identified as a major technological shift [4] - The domestic vector database market is transitioning from technical exploration to large-scale implementation, with positive signals of million-level purchases in key sectors like finance and energy [4] Summary by Sections - **Performance Records**: The ArgoDB solution has set new performance records in both TPC-DS and TPC-C benchmarks, indicating strong technological advancements [4] - **Cost Structure**: The report outlines two billing models for vector databases: annual subscription and pay-as-you-go, with specific pricing examples provided for different regions [4] - **Investment Opportunities**: The report suggests focusing on companies such as StarRing Technology, Dameng Data, Haima Data, Taiji Co., SuperMap Software, and Torus [4]
计算机行业周报20251214:谁是中国的MongoDB-20251214
Guolian Minsheng Securities· 2025-12-14 03:12
计算机行业周报 20251214 谁是中国的 MongoDB glmszqdatemark 2025 年 12 月 14 日 市场回顾 本周(12.08-12.12)沪深 300 指数下跌 0.71%,中小板指数下跌 0.13%,创业板指 数上涨 1.75%,计算机(中信)板块下跌 1.44%。板块个股涨幅前五名分别为:开普 云、淳中科技、荣科科技、达华智能、赢时胜;跌幅前五名分别为:佳华科技、恒银 科技、中科曙光、思创医惠、天源迪科。 行业要闻 工信部:14 个行业数字化转型"场景导航图"发布; 谷歌:首款 AI 眼镜将于明年上市。 公司动态 神州信息:12 月 10 日消息,持有神州数码信息服务集团股份有限公股份 389,540,110 股(占公司总股本的 39.9211%)的控股股东神州数码软件有限公司,计划在本公告 披露之日起 15 个交易日后的 3 个月内(即 2025 年 12 月 31 日至 2026 年 3 月 30 日),以集中竞价交易或大宗交易相结合的方式减持其持有的公司不超过 28,827,300 股股份,占公司总股本的 2.9543%,占剔除回购专用证券账户中的股份数量后的公司 总股本 ...
Agentic AI时代,向量数据库成“必选项”
Tai Mei Ti A P P· 2025-12-05 05:18
当OpenAI的GPT-4开始展现出自主任务分解能力,当AutoGPT、Devin等智能体能够独立完成复杂工作 流程,一个根本性问题摆在整个AI产业面前:这些有记忆、会反思、能行动的Agent,究竟该把它们 的"海马体"存放在哪里?传统数据库的磁盘I/O、精确匹配与静态架构,在高频读写、语义模糊、成本 敏感的Agentic AI时代显得格格不入。向量数据库,这个曾被视为AI"锦上添花"的技术,正迅速从幕后 走向台前,成为支撑下一代智能体系统的关键基础设施。 Agentic AI对数据库提出了新要求 生成式AI以内容创造为核心,Agentic AI以自主决策交互为特征,二者的快速演进推动向量数据库从基 础存储检索工具向AI能力基座升级,催生出在数据处理、性能表现、功能适配等多维度的全新需求, 据Gartner预测,2025年Agentic AI市场规模将突破千亿美元,年复合增长率超65%。这一爆发式增长背 后,是向量数据库技术的持续突破。 2023年初,当ChatGPT掀起第一波大模型热潮时,市场对向量数据库的认知还停留在外挂知识库层面。 并且此后很长一段时间里,AI的核心价值体现在内容生成——无论是撰写报告还 ...
百亿向量,毫秒响应:清华研发团队向量数据库 VexDB 首发,攻克模型幻觉难题
AI前线· 2025-09-25 08:04
Core Insights - The article discusses the challenges faced by enterprises in integrating AI technologies into their core business processes, particularly focusing on the "hallucination" problem of generative AI models [2][6][8] - It highlights the urgent need for reliable AI infrastructure, such as vector databases, to mitigate these issues and enhance the trustworthiness of AI applications [6][14][21] Group 1: AI Hallucination Issues - Generative AI models often produce inaccurate information due to their statistical nature, leading to significant risks in sectors like healthcare and finance [6][8] - The hallucination problem has escalated from a technical issue to a critical business risk, affecting user trust and potentially causing severe consequences [8][9] - A benchmark test revealed varying hallucination rates among different models, with some models like DeepSeek-R1 exhibiting a hallucination rate of 14.3% [6][8] Group 2: Vector Database Solutions - The introduction of vector databases, such as VexDB, aims to provide a reliable knowledge base for AI applications, addressing the hallucination problem by enhancing data retrieval processes [4][15][21] - VexDB supports high-dimensional vector data queries with millisecond response times and over 99% accuracy in recall, making it suitable for enterprise-level applications [4][15] - The global vector database market is projected to grow significantly, reaching $2.2 billion in 2024 and expected to grow at a CAGR of 21.9% from 2025 to 2034 [14][16] Group 3: RAG Framework - The RAG (Retrieval-Augmented Generation) framework is emerging as a trend to enhance the reliability of AI applications by integrating external knowledge sources [9][10] - RAG systems improve the accuracy of AI outputs by constraining the generative process within a controlled and trustworthy range [9][10] - Performance bottlenecks in RAG systems, such as data processing and retrieval speed, directly impact user experience and business outcomes [11][12] Group 4: Practical Applications of VexDB - VexDB has been successfully implemented in various industries, including healthcare and telecommunications, demonstrating its capability to enhance AI application efficiency [17][19][21] - In healthcare, a system built on VexDB reduced medical record generation time by over 60%, showcasing its effectiveness in real-world scenarios [17] - In telecommunications, VexDB improved customer conversion rates by 30% and reduced solution delivery time by 60%, enhancing overall user satisfaction [19] Group 5: Future of AI Infrastructure - The evolution of vector databases is shifting from merely enhancing retrieval capabilities to becoming integral components of AI data infrastructure [20][21] - VexDB is positioned to support complex roles in AI lifecycle management, including knowledge asset management and multi-modal semantic connections [20][21] - The adoption of vector databases is expected to rise significantly, with predictions indicating that 30% of companies will utilize them by 2026 [16][21]
国产数据库群雄逐鹿,谁是下一个中国“甲骨文”?
3 6 Ke· 2025-09-23 00:04
Core Insights - The emergence of generative AI is seen as a transformative phase akin to the Fourth Industrial Revolution, significantly impacting various industries, including the database sector [2] - Oracle's stock surged by 36% on September 10, 2025, leading to a market capitalization increase of over $240 billion, attributed to a $300 billion five-year computing power procurement agreement with OpenAI [2] - The global AI server market is projected to grow from $125.1 billion in 2024 to $158.7 billion in 2025, and potentially reach $222.7 billion by 2028, indicating robust growth opportunities in the database industry [2] Industry Trends - The domestic database industry is experiencing new growth opportunities driven by both domestic substitution and the AI revolution [2] - The competition among domestic database providers has intensified, with companies like Nanda Tongyong (GBase) enhancing their product offerings to meet AI demands [4][8] - The shift from structured to unstructured data processing presents challenges for database companies, necessitating improvements in data handling capabilities [7] Company Developments - Nanda Tongyong has upgraded its core products to include vector data management, compute-storage separation, and AI Native capabilities, positioning itself to better meet enterprise needs in the AI era [4][8] - The company has established a comprehensive product line, including GBase 8s, GBase 8c, and GBase 8a, which are being applied in critical sectors such as finance, telecommunications, and energy [9] - GBase Cloud Data Warehouse (GCDW) is designed to efficiently manage and analyze massive datasets, supporting both on-premises and cloud deployments [10][11] Competitive Landscape - The domestic database market is entering a rapid growth phase, with the ability to adapt to AI becoming a critical competitive factor [13] - Nanda Tongyong aims to be the "data cornerstone of the AI era," leveraging advancements in data lake and warehouse integration, vector databases, and AI Native technologies [13][14] - The company has introduced intelligent operation and maintenance tools that significantly enhance efficiency in database management, reducing health check times and improving SQL optimization accuracy [15] Market Position - Nanda Tongyong ranks highly in various industry reports, being recognized as a leader in domestic analytical databases and independent databases [15] - The ongoing acceleration of domestic substitution and AI integration is expected to lead to a reshuffling in the database industry over the next two to three years, with product and operational capabilities being key competitive factors [15][16]
万字长文!RAG实战全解析:一年探索之路
自动驾驶之心· 2025-08-07 09:52
Core Viewpoint - The article discusses the Retrieval Augmented Generation (RAG) method, which combines retrieval-based models and generative models to enhance the quality and relevance of generated text. It addresses issues such as hallucination, knowledge timeliness, and long text processing in large models [1]. Group 1: Background and Challenges - RAG was proposed by Meta in 2020 to enable language models to access external information beyond their internal knowledge [1]. - RAG faces three main challenges: retrieval quality, enhancement process, and generation quality [2]. Group 2: Challenges in Retrieval Quality - Semantic ambiguity can arise from vector representations, leading to irrelevant results [5]. - User input has become more complex, transitioning from keywords to natural dialogue, which complicates retrieval [5]. - Document segmentation methods can affect the matching degree between document blocks and user queries [5]. - Extracting and representing multimodal content (e.g., tables, charts) poses significant challenges [5]. - Integrating context from retrieved paragraphs into the current generation task is crucial for coherence [5]. - Redundancy and repetition in retrieved content can lead to duplicated information in generated outputs [5]. - Determining the importance of multiple retrieved paragraphs for the generation task is challenging [5]. - Over-reliance on retrieval content can exacerbate hallucination issues [5]. - Irrelevance of generated answers to the query is a concern [5]. - Toxicity or bias in generated answers is another issue [5]. Group 3: Overall Architecture - The product architecture consists of four layers, including model layer, offline understanding layer, online Q&A layer, and scenario layer [7]. - The RAG framework is divided into three main components: query understanding, retrieval model, and generation model [10]. Group 4: Query Understanding - The query understanding module aims to improve retrieval by interpreting user queries and generating structured queries [14]. - Intent recognition helps select relevant modules based on user queries [15]. - Query rewriting utilizes LLM to rephrase user queries for better retrieval [16]. - Query expansion breaks complex questions into simpler sub-questions for more effective retrieval [22]. Group 5: Retrieval Model - The retrieval model's effectiveness depends on the accuracy of embedding models [33]. - Document loaders facilitate loading document data from various sources [38]. - Text converters prepare documents for retrieval by segmenting them into smaller, semantically meaningful chunks [39]. - Document embedding models create vector representations of text to enable semantic searches [45]. - Vector databases support efficient storage and search of embedded data [47]. Group 6: Generation Model - The generation model utilizes retrieved information to generate coherent responses to user queries [60]. - Different strategies for prompt assembly are employed to enhance response generation [62][63]. Group 7: Attribution Generation - Attribution in RAG is crucial for aligning generated content with reference information, ensuring accuracy [73]. - Dynamic computation methods can enhance the generation process by matching generated text with reference sources [76]. Group 8: Evaluation - The article emphasizes the importance of defining metrics and evaluation methods for assessing RAG system performance [79]. - Various evaluation frameworks, such as RGB and RAGAS, are introduced to benchmark RAG systems [81]. Group 9: Conclusion - The article summarizes key modules in RAG practice and highlights the need for continuous research and development to refine these technologies [82].
数据治理对人工智能的成功至关重要
3 6 Ke· 2025-07-21 03:09
Group 1 - The emergence of large language models (LLMs) has prompted various industries to explore their potential for business transformation, leading to the development of numerous AI-enhancing technologies [1] - AI systems require access to company data, which has led to the creation of Retrieval-Augmented Generation (RAG) architecture, essential for enhancing AI capabilities in specific use cases [2][5] - A well-structured knowledge base is crucial for effective AI responses, as poor quality or irrelevant documents can significantly hinder performance [5][6] Group 2 - Data governance roles are evolving to support AI system governance and the management of unstructured data, ensuring the protection and accuracy of company data [6] - Traditional data governance has focused on structured data, but the rise of Generative AI (GenAI) is expanding this focus to include unstructured data, which is vital for building scalable AI systems [6] - Collaboration between business leaders, AI technology teams, and data teams is essential for creating secure and effective AI systems that can transform business operations [6]