Yan 2.0 Preview

Search documents
那天,AI大模型想起了,被「失忆」所束缚的枷锁
机器之心· 2025-08-31 05:33
Core Insights - The article discusses the advancements in memory capabilities of large language models (LLMs), highlighting how companies like Google, OpenAI, and Anthropic are integrating memory features into their AI systems to enhance user interaction and continuity in conversations [1][3][10]. Memory Capabilities of LLMs - Google's Gemini has introduced memory capabilities that allow it to retain information across multiple conversations, making interactions more natural and coherent [1]. - OpenAI's ChatGPT has implemented a memory feature since February 2024, enabling users to instruct the model to remember specific details, which improves its performance over time [3][42]. - Anthropic's Claude has also added memory functionality, allowing it to recall previous discussions when prompted by the user [3][6]. Types of Memory in LLMs - Memory can be categorized into sensory memory, short-term memory, and long-term memory, with a focus on long-term memory for LLMs [16][17]. - Contextual memory is a form of short-term memory where relevant information is included in the model's context window [18]. - External memory involves storing information in an external database, allowing for retrieval during interactions, which is a common method for building long-term memory [22][23]. - Parameterized memory attempts to encode information directly into the model's parameters, providing a deeper form of memory [24][29]. Innovations in Memory Systems - New startups are emerging, focusing on memory systems for AI, such as Letta AI's MemGPT and RockAI's Yan 2.0 Preview, which aim to enhance memory capabilities [11][12]. - The concept of hybrid memory systems is gaining traction, combining different types of memory to improve AI's adaptability and performance [37][38]. Notable Memory Implementations - OpenAI's ChatGPT allows users to manage their memory entries, while Anthropic's Claude retrieves past conversations only when requested [42][44]. - Gemini supports user input for memory management, enhancing its ability to remember user preferences [45]. - The M3-Agent developed by ByteDance, Zhejiang University, and Shanghai Jiao Tong University integrates long-term memory capabilities across multiple modalities, including video and audio [10][70]. Future Trends in AI Memory - The future of AI memory is expected to evolve towards multi-modal and integrated memory systems, allowing for a more comprehensive understanding of user interactions [97][106]. - There is a growing emphasis on creating memory systems that can autonomously manage and optimize their memory, akin to human cognitive processes [101][106]. - The ultimate goal is to develop AI systems that can exhibit unique personalities and emotional connections through their memory capabilities, potentially leading to the emergence of artificial general intelligence (AGI) [109][110].
计算机行业周报:国产AI开源模型爆发,RockAI颠覆端侧架构-20250803
SINOLINK SECURITIES· 2025-08-03 14:03
Investment Rating - The report suggests a focus on leading domestic generative AI model companies such as iFlytek, and highlights potential in AI hardware applications with recommendations for Hikvision, Hongsoft Technology, and Hesai [3] Core Insights - The AI industry is expected to see significant growth, particularly in the AI application sector, with a projected increase in profitability driven by cost savings from AI integration [5][12] - The report anticipates a stable performance in the AI industry chain and stablecoin-related sectors, with further advancements expected in the second half of the year [5][12] - The report identifies high-growth areas for 2025, including AI computing power and lidar technology, while noting stable growth in software outsourcing, financial IT, quantum computing, and data elements [12][13] Summary by Sections Weekly Insights - The report discusses the advancements in AI models, including the release of the GLM-4.5 series and the Qwen3-235B-Thinking model, which have shown significant improvements in performance metrics [12] - It highlights the expected increase in investor focus on fundamentals as the market approaches the mid-year reporting period [12] Industry Performance Review - From July 28 to August 1, 2025, the computer industry index (Shenwan) decreased by 0.20%, outperforming the CSI 300 index by 1.55 percentage points [14] - The report lists the top-performing companies in the computer sector during this period, indicating a mixed performance across the industry [15] Key Events Ahead - Upcoming events include the 40th China Computer Application Conference and the 2025 World Robot Conference, which are expected to present opportunities within the industry [24][25]
岩山科技:旗下RockAI亮相WAIC 2025,赋予AI离线记忆能力
Zheng Quan Shi Bao Wang· 2025-07-28 09:00
Core Insights - The World Artificial Intelligence Conference (WAIC 2025) showcased over 800 companies, highlighting the rapid development of AI technology across various sectors, including model applications, intelligent agents, embodied intelligence, and AI hardware [1] - RockAI's new generation model, Yan 2.0 Preview, features "native memory" and "offline intelligence," allowing devices to operate without internet connectivity and perform complex tasks autonomously [1][2] - The model's advancements include support for video modalities and a memory module that enables autonomous decision-making, moving beyond traditional AI limitations [2] Company Developments - RockAI is targeting a diverse customer base, including consumer electronics brands, ODM manufacturers, and companies in need of high-performance, low-power solutions [3] - A memorandum of understanding was signed with AMD to collaborate on AI PC technology, marking a significant step in the commercialization of the Yan architecture and positioning Chinese AI technology on the global stage [3] - RockAI aims for general artificial intelligence (AGI) by focusing on collective learning and collaborative evolution, allowing multiple intelligent units to solve complex problems through local interactions [3][4] Technological Impact - The technology showcased by RockAI at WAIC 2025 represents a shift from one-time delivery of smart hardware to continuous evolution throughout the product lifecycle, potentially paving the way for AGI [4]
腾讯研究院AI速递 20250728
腾讯研究院· 2025-07-27 10:15
Group 1: AI Model Developments - GPT-5, codenamed "Lobster," has been quietly launched on the WebDev Arena testing platform, showing performance significantly surpassing Grok-4 [1] - The new Step 3 foundational model by Jieyue Xingchen is a native multimodal reasoning model with a total parameter count of 321 billion and an active parameter count of 38 billion, achieving high inference efficiency [2] - RockAI showcased the Yan 2.0 Preview model, which operates offline and incorporates a "native memory module" for continuous learning and evolution [7] Group 2: AI Applications and Products - Tencent unveiled the "Hunyuan 3D World Model 1.0," the first open-source 3D world generation model, enabling quick generation of interactive 3D scenes [3] - Alibaba previewed its self-developed "Quark AI Glasses," which integrate various functionalities from the Alibaba ecosystem and are set to be released within the year [4][5] - Lovart launched the ChatCanvas feature, combining visual understanding and multimodal design, allowing users to perform advanced design operations on a smart canvas [6] Group 3: Marketing and Robotics Innovations - The Navos AI Agent by Taidong Technology can generate marketing materials in 5 minutes and execute cross-national campaigns within 72 hours, addressing localization cost challenges [8] - Unitree Technology introduced the humanoid robot Unitree R1, priced from 39,900 yuan, featuring 26 degrees of freedom and advanced capabilities [10] Group 4: AI Ethics and Future Perspectives - Geoffrey Hinton emphasized the potential for large models to achieve "immortality" while warning of the risks associated with AI surpassing human intelligence [11] - Hinton suggested separating the research on making AI "smarter" from making it "kinder," advocating for shared "kindness technology" to mitigate future AI risks [12]
在WAIC现场,全球首个拥有「原生记忆力」的大模型亮相,但不是Transformer
机器之心· 2025-07-26 09:32
Core Viewpoint - Google is initiating a transformation in AI architecture, moving beyond the limitations of the existing attention mechanisms with the introduction of the MoR architecture, indicating a consensus on the need for architectural innovation in the AI field [2][3]. Group 1: RockAI's Innovations - RockAI has developed the Yan architecture, which is a non-Transformer framework that significantly reduces computational complexity, allowing for offline operation on low-power devices like Raspberry Pi [5][10]. - The Yan 2.0 Preview model possesses native memory capabilities, enabling it to remember interactions over time, unlike traditional models that forget previous conversations [6][12]. - The architecture allows for end-to-end memory integration, making it easier for users as it does not require manual management of external knowledge bases [19][22]. Group 2: Challenges with Transformer Models - Transformer models face issues such as data scarcity and high computational requirements, making it difficult to deploy on low-power devices without significant modifications [9][10]. - The reliance on massive datasets for pre-training is becoming increasingly challenging due to the difficulty in acquiring high-value data [9]. - Current models often lack the ability to learn and update parameters during inference, limiting their adaptability [9][10]. Group 3: Vision for AI Development - RockAI aims to create intelligent devices that can learn and evolve independently, moving away from static models that require cloud connectivity [24][25]. - The concept of collective intelligence is emphasized, where individual devices with learning capabilities can share knowledge and evolve together, contributing to the development of AGI [26][29]. - The company’s mission is to ensure that every device possesses its own intelligence, promoting offline smart capabilities [27][28]. Group 4: Market Reception and Future Directions - RockAI's approach has gained recognition at industry events, with hardware manufacturers showing interest in their technology due to its unique memory capabilities [34][35]. - The company is committed to continuing its challenging yet correct technological path, focusing on enhancing autonomous learning capabilities [36][37]. - RockAI's long-term vision reflects a commitment to fundamental technological advancements, akin to the journeys of leading AI research labs [37][38].