MIRIX - filings, earnings calls, financial reports, news

MIRIX

Search documents

机器之心· 2025-11-07 07:17

Core Insights - The article emphasizes that "memory" is becoming a crucial factor for intelligent agents to achieve long-term intelligence, especially in the context of rapidly evolving large language models [2] - Mem-α is introduced as a solution to the limitations of existing memory-enhanced agents, which often rely on manual rules and prompts, by incorporating reinforcement learning for autonomous memory management [2][9] Memory Management Challenges - Existing memory-enhanced agents face three main challenges: not knowing which information to retain long-term, when to update old memories, and how to allocate different types of memories effectively [8] - Prior to Mem-α training, models like Qwen3-4B struggled with memory updates, leading to frequent errors in question answering [6] Mem-α Contributions - Mem-α transforms memory construction into a sequence decision problem optimized through reinforcement learning, allowing agents to autonomously explore optimal memory management strategies [9] - The architecture of Mem-α is inspired by cognitive science, featuring a three-layer memory system that enables flexible use of different memory types [15] Training and Evaluation - Mem-α's training dataset is constructed from four dimensions, focusing on accurate retrieval, test-time learning, and long-range understanding, while excluding conflict resolution due to the lack of real-world benchmarks [17] - Experimental results show that Mem-α significantly outperforms existing methods across all evaluation tasks, particularly in accurate retrieval and long-range understanding [22] Key Findings - Mem-α demonstrates a strong generalization ability, effectively managing memory usage while maintaining high performance, reducing memory consumption by nearly 50% compared to other models [22] - The structured memory architecture of Mem-α enhances the organization and retrieval of complex information, outperforming flat memory baselines [24] - Mem-α exhibits robust extrapolation capabilities, generalizing well to extremely long sequences despite being trained on shorter samples [24] Ablation Study - An ablation study reveals that prior to Mem-α, models had low accuracy and struggled with memory management, but after training, accuracy improved significantly, showcasing the effectiveness of reinforcement learning in memory management [25] Future Implications - Mem-α indicates a trend where memory management evolves from an engineering problem to a learnable one, suggesting potential applications in multimodal memory and personalized memory strategies [27]

那天，AI大模型想起了，被「失忆」所束缚的枷锁

机器之心· 2025-08-31 05:33

Core Insights - The article discusses the advancements in memory capabilities of large language models (LLMs), highlighting how companies like Google, OpenAI, and Anthropic are integrating memory features into their AI systems to enhance user interaction and continuity in conversations [1][3][10]. Memory Capabilities of LLMs - Google's Gemini has introduced memory capabilities that allow it to retain information across multiple conversations, making interactions more natural and coherent [1]. - OpenAI's ChatGPT has implemented a memory feature since February 2024, enabling users to instruct the model to remember specific details, which improves its performance over time [3][42]. - Anthropic's Claude has also added memory functionality, allowing it to recall previous discussions when prompted by the user [3][6]. Types of Memory in LLMs - Memory can be categorized into sensory memory, short-term memory, and long-term memory, with a focus on long-term memory for LLMs [16][17]. - Contextual memory is a form of short-term memory where relevant information is included in the model's context window [18]. - External memory involves storing information in an external database, allowing for retrieval during interactions, which is a common method for building long-term memory [22][23]. - Parameterized memory attempts to encode information directly into the model's parameters, providing a deeper form of memory [24][29]. Innovations in Memory Systems - New startups are emerging, focusing on memory systems for AI, such as Letta AI's MemGPT and RockAI's Yan 2.0 Preview, which aim to enhance memory capabilities [11][12]. - The concept of hybrid memory systems is gaining traction, combining different types of memory to improve AI's adaptability and performance [37][38]. Notable Memory Implementations - OpenAI's ChatGPT allows users to manage their memory entries, while Anthropic's Claude retrieves past conversations only when requested [42][44]. - Gemini supports user input for memory management, enhancing its ability to remember user preferences [45]. - The M3-Agent developed by ByteDance, Zhejiang University, and Shanghai Jiao Tong University integrates long-term memory capabilities across multiple modalities, including video and audio [10][70]. Future Trends in AI Memory - The future of AI memory is expected to evolve towards multi-modal and integrated memory systems, allowing for a more comprehensive understanding of user interactions [97][106]. - There is a growing emphasis on creating memory systems that can autonomously manage and optimize their memory, akin to human cognitive processes [101][106]. - The ultimate goal is to develop AI systems that can exhibit unique personalities and emotional connections through their memory capabilities, potentially leading to the emergence of artificial general intelligence (AGI) [109][110].

全球首次，「AI记忆」开源落地，MIRIX同步上线APP

3 6 Ke· 2025-07-30 03:32

Core Insights - MIRIX is the world's first truly multimodal, multi-agent AI memory system, developed by researchers from the University of California, San Diego, and New York University [1][2] - The introduction of MIRIX marks a significant evolution in AI, transitioning from "dialogue" to "memory" as a necessary path for AI advancement [1] Performance Metrics - MIRIX outperforms traditional RAG methods by 35% in accuracy while reducing storage overhead by 99.9% [4][26] - In the LOCOMO long dialogue task, MIRIX achieved an accuracy of 85.4%, setting a new performance benchmark [4][28] - Compared to long context methods, MIRIX shows a 410% performance increase and a 93.3% reduction in storage requirements [26] Application and Usability - A desktop application for MIRIX has been launched, allowing users to build their own AI personal assistant [4][31] - The application records users' digital life moments and creates a personalized digital memory [8][31] - Users can interact with the intelligent agent to retrieve past activities and information [11][31] Memory Structure - MIRIX introduces a novel memory architecture divided into six modules: Core Memory, Episodic Memory, Semantic Memory, Procedural Memory, Resource Memory, and Knowledge Vault [14][16][17] - This structure allows for a more nuanced approach to memory management compared to traditional long-term and short-term memory classifications [14] Multi-Agent Workflow - MIRIX employs a multi-agent system to manage its complex memory architecture, featuring a Meta Memory Manager and six sub Memory Managers [18] - The workflow includes processes for memory updates and retrieval, ensuring efficient information management [22][23] Dataset and Training - The development of MIRIX utilized a dataset comprising over 45,000 high-resolution screenshots, creating a challenging benchmark for multimodal understanding [24] - The dataset includes sequences with nearly 20,000 screenshots, emphasizing the model's long-term memory capabilities [28] Conclusion - MIRIX signifies a new development phase for large models, transitioning from "instant dialogue generation" to "long-term memory-driven intelligence" [31] - The application emphasizes user privacy by storing all memory locally in SQLite [31]

AI记忆系统

多模态长期记忆

Artificial Intelligence

Artificial Intelligence

MIRIX

MIRIX桌面端APP

腾讯研究院AI速递 20250730

腾讯研究院· 2025-07-29 16:01

Group 1 - Anthropic announced a weekly usage limit for Claude Pro and Max users, affecting less than 5% of subscribers [1] - Some users reported extreme cases where a $200 plan resulted in actual consumption of tens of thousands of dollars due to continuous operation [1] - Users expressed a lack of transparency regarding usage, leading many to seek alternative products [1] Group 2 - Microsoft Edge introduced a "Copilot mode" that enhances context awareness across tabs, allowing simultaneous reading and analysis of all open pages [2] - The new interface features a simplified input box that understands user intent and supports voice control and thematic journey functions [2] - This feature is currently available for free in all Copilot markets but may be bundled with a subscription service in the future [2] Group 3 - Wuwen Chipong launched a comprehensive AI efficiency enhancement solution, including three core products: Wuqiong AI Cloud, Wujie Intelligent Computing Platform, and Wuyin Terminal Intelligence [3] - The solution covers 26 provinces and cities with 53 core data centers, integrating over 15 mainstream chip architectures and achieving a total computing power scale exceeding 25,000 P [3] - Innovations on the edge include the world's first edge intrinsic model "Wuqiong Tianquan," which maintains cloud-level intelligence with 21 billion parameters while controlling memory usage to 7 billion [3] Group 4 - Step 3 launched a new AI research assistant called "Jieyue Deep Research," capable of completing complex research tasks and generating in-depth professional reports within ten minutes [4][5] - The assistant achieved a 70% high pass rate in the xbench-DeepSearch evaluation [5] - It is based on reinforcement learning and multi-agent architecture, enabling autonomous thinking, reasoning, and dynamic tool usage for real-world complex tasks [5] Group 5 - JD.com upgraded its large model brand to JoyAI, introducing solutions like JoyAgent intelligent agent platform, JoyInside embedded intelligence, and digital humans [6] - JoyAgent is the first 100% open-source enterprise-level intelligent agent, receiving over 2,000 GitHub stars and possessing a complete product-level closed-loop capability [6] - JoyAI's products have been implemented in various scenarios, with digital human services exceeding 20,000 brands and the interactive AI toy Fuzozo selling out during its first pre-sale [6] Group 6 - Researchers from UC San Diego and NYU launched and open-sourced MIRIX, the world's first multi-modal, multi-agent AI memory system, along with a desktop app [7] - The system categorizes memory into six modules: core, context, semantics, programs, resources, and knowledge repository, managed by a meta-memory manager and six memory sub-modules [7] - MIRIX achieved a 35% higher accuracy than traditional RAG in the ScreenshotVQA test and reduced storage by 99.9%, setting a record of 85.4% in the LOCOMO long dialogue task [7] Group 7 - The National Satellite Meteorological Center, Nanchang University, and Huawei jointly released the "Fengyu" model, the world's first full-chain space weather AI forecasting model [8] - The model features a pioneering chain training structure, including solar wind, Earth's magnetic field, and ionosphere models [8] - In practical tests, "Fengyu" maintained a prediction error of around 10% for global electron density and performed excellently during multiple major magnetic storm events, with 11 national invention patents applied [8] Group 8 - Shanghai AI Lab released and open-sourced the "Shusheng" scientific multi-modal large model Intern-S1, which surpasses top closed-source models in scientific capabilities [9] - The model features a "cross-modal scientific analysis engine" that can accurately interpret complex scientific data such as chemical formulas and protein structures [9] - The research team proposed a method for synthesizing scientific data that combines general reasoning capabilities with multiple top professional abilities, creatively reducing reinforcement learning training costs [9] Group 9 - a16z partner Martin Casado stated that the AI large model competition will evolve into an oligopoly similar to the cloud computing battle, creating a new brand effect [10] - In AI competition, the application layer lacks a technological moat, and rational business decisions will focus on "sacrificing profits for distribution," with value emerging from foundational infrastructure and vertical domain deepening [10] - AI will not transform ordinary developers into super engineers but will allow "10x engineers to become 2x," simplifying programming by eliminating cumbersome tasks and returning to the essence of creation [10] Group 10 - Tencent's Robotics X Lab and Futian Lab jointly launched the embodied intelligence open platform Tairos, aimed at enhancing software capabilities for robot developers and application developers [11] - The platform is based on the SLAP³ technology system, providing three core capabilities: planning large models, multi-modal perception large models, and perception-action joint large models [11] - Five major trends in the future development of embodied intelligence were identified: integration of virtual and real worlds, reduced technical barriers, intelligent evolution, agentification, and multi-modal perception [11]

Artificial Intelligence

具身智能

Artificial Intelligence

微软AI Edge浏览器

无穹AI云

无界智算平台

Artificial Intelligence

具身智能

Artificial Intelligence

腾讯研究院· 2025-07-15 15:09

Group 1 - The U.S. government has granted Nvidia permission to resume sales of the H20 AI chip to China, following a meeting between Jensen Huang and President Trump [1] - Nvidia reported a record revenue of $26.044 billion for Q1 FY2025, a 262% year-over-year increase, with data center revenue of $22.6 billion being the main growth driver [1] Group 2 - Meta is building the "Prometheus" AI supercomputer cluster, expected to reach 1GW of computing power by 2026, comparable to the power consumption of a nuclear power plant or a city of one million residents [2] - The "Hyperion" plan in 2027 aims to deploy over 5GW of computing power, with Meta planning to build a natural gas power plant to ensure supply [2] Group 3 - Elon Musk launched the Grok 4 "smart companion" feature, which includes animated characters with interactive voice capabilities, although the functionality is still in early stages [3] - Grok 4 can generate playable HTML5 games and integrate 3D models and textures, showcasing Musk's ambitions in the AI companion and gaming sectors [3] Group 4 - Amazon introduced a new IDE tool called Kiro, which offers "ambient coding" and "planning" modes, enabling specification-driven development through specs and hooks [4][5] - Kiro can convert simple requirements into complete specifications, generating technical design diagrams and automating tasks [5] Group 5 - Google's first Gemini embedding model scored 68.37 in the MTEB evaluation, surpassing OpenAI's score of 58.93, making it the strongest embedding model currently available [6] - The new model is cost-effective, priced at $0.15 per million tokens, and has an open API for independent creators [6] Group 6 - The launch of DeepResearch by BitAI features a visual problem chain to display the AI's thought process, providing detailed research reports and interactive web pages [7] - Free users have a daily limit of 100 searches, while annual members can search up to 500 times per day, making it a cost-effective option compared to other AI services [7] Group 7 - The MIRIX multi-modal AI memory system, developed by UCSD and NYU, achieved a 35% higher accuracy than traditional RAG methods while reducing storage by 99.9% [8] - MIRIX is designed with six types of human memory systems and supports multi-modal input, allowing local memory storage in SQLite databases for privacy protection [8] Group 8 - Microsoft's AI4S team developed the Orbformer model to balance precision and efficiency in quantum chemistry calculations, achieving chemical accuracy while significantly reducing computational costs [10] - The model consists of three main modules and has shown improved performance in various chemical tests [10] Group 9 - An article from The New Yorker discusses the potential of AI companions to alleviate loneliness but warns that complete reliance on them may hinder personal growth and the development of real relationships [11] - The article suggests that AI should be accessible to those in genuine need, such as the elderly or cognitively impaired, while cautioning against over-reliance for the general population [11] Group 10 - An OpenAI engineer argues that coding represents only 10-20% of a programmer's core value, with structured communication accounting for 80-90% [12] - The engineer emphasizes the importance of specifications over code, as specifications capture intent and values more comprehensively [12]