多模态长期记忆

Search documents
嚯!刚刚,张麻子陪我玩黑猴了
量子位· 2025-08-18 04:00
Core Viewpoint - The article discusses the launch and features of DouDou AI 1.0, an AI gaming companion designed to enhance the gaming experience by providing real-time assistance and emotional support during gameplay [8][56]. Group 1: Product Features - DouDou AI 1.0 utilizes real-time VLM (Visual Language Model) technology to understand game visuals and provide context-aware assistance [11][45]. - The AI can recognize various game scenarios and initiate conversations, offering gameplay tips, strategies, and emotional encouragement [12][19]. - It supports a wide range of game genres, from hardcore action games like "Black Myth: Wukong" to casual games like "Stardew Valley" [13][21]. Group 2: User Experience - Users have reported a highly engaging experience, with the AI providing real-time feedback and suggestions during gameplay, enhancing both performance and enjoyment [6][26]. - The AI's ability to analyze gameplay and provide post-game reviews adds a layer of depth to the gaming experience [28][29]. - Emotional support is a key feature, with the AI offering encouragement and maintaining a positive atmosphere during gameplay [24][36]. Group 3: Broader Applications - Beyond gaming, DouDou AI 1.0 is positioned as a versatile companion for various activities, including watching shows, shopping, and online learning [43][62]. - The AI's capabilities extend to real-time screen reading and analysis, making it applicable in multiple contexts beyond just gaming [45][46]. Group 4: Technical Insights - The AI's multi-modal perception capabilities allow it to integrate visual and auditory inputs for a more human-like interaction [47][48]. - DouDou AI 1.0 incorporates long-term memory features, enabling it to learn user preferences and provide personalized recommendations over time [52][53]. - Data security measures are in place, limiting the AI's visual recognition capabilities to specific applications, ensuring user privacy [54].
全球首次,「AI记忆」开源落地,MIRIX同步上线APP
3 6 Ke· 2025-07-30 03:32
加利福尼亚大学圣迭戈分校博士生王禹和纽约大学教授陈溪联合推出并开源了 MIRIX,全球首个真正意义上的多模态、多智能体AI记忆系 统。MIRIX团队同步上线了一款桌面端APP,可直接下载使用! 还记得第一次用 GPT 写邮件的惊喜吗?却也一定遇到过今天的 AI「忘性」——聊得再深入,窗口一关,历史烟消云散。 因此,研究人员认为:从「对话」到「记忆」,将是AI进化的必经之路。 研究人员推出并开源MIRIX,全球首个真正意义上的多模态、多智能体AI记忆系统。 在ScreenshotVQA这一需要深度多模态理解的挑战性基准上,MIRIX的准确率比传统RAG方法高出35%,存储开销降低99.9%,与长文本方法相比超出 410%,开销降低93.3%。 在LOCOMO长对话任务中,MIRIX以85.4%的成绩显著超越所有现有方法,树立了新的性能标杆。 与此同时,研究人员在Mac端上线了一款应用产品,通过这款开箱即用的应用程序,终于可以为每个人构建专属于自己的AI个人助理。 桌面端APP使用场景 直接访问官方网站,即可直接下载APP: 论文链接:https://arxiv.org/abs/2507.07957 官方网站:h ...