Workflow
OK Computer
icon
Search documents
腾讯研究院AI速递 20250928
腾讯研究院· 2025-09-27 16:01
Group 1: OpenAI's New Feature - OpenAI launched a new feature "Pulse" in ChatGPT, initially available to Pro users, providing personalized content based on user chat history and feedback [1] - The feature is developed based on an intelligent agent, capable of asynchronous searches and linking with Gmail and Google Calendar for more relevant suggestions [1] - Pulse presents content in thematic card format, allowing users to provide feedback through likes or dislikes, marking a shift from passive to active personalized service [1] Group 2: Thinking Machines' Research - Thinking Machines, valued at 84 billion, released its second research paper "Modular Manifolds," enhancing training stability and efficiency by constraining and optimizing different layers of the network [2] - Researcher Jeremy Bernstein introduced a modular manifold method to address instability issues caused by extreme weight values in neural network training, supported by theoretical analysis and experimental validation [2] - The company's founders, including Mira Murati, have publicly supported the research, following the release of their first paper focused on reducing uncertainty in large model inference [2] Group 3: Google's Gemini Robotics - Google DeepMind introduced the Gemini Robotics 1.5 series, including Gemini Robotics 1.5 and Gemini Robotics-ER 1.5, aimed at enhancing robot intelligence [3] - Gemini Robotics 1.5 is an advanced visual-language-action model that translates visual information and commands into robotic actions, while Gemini Robotics-ER 1.5 is a powerful visual-language model for reasoning about the physical world [3] - The two models work together to enable robots to perform complex tasks like waste sorting and luggage packing, supporting "think before act" capabilities and skill transfer across different robotic forms [3] Group 4: Kimi's New Agent Model - Kimi launched a new agent model "OK Computer," based on Kimi K2, capable of complex tasks such as website building, PPT creation, and processing millions of data lines [4] - The model generates a Todo List progress report during operation, autonomously conducting web searches, generating materials, and coding, ultimately producing interactive and reusable results [4] - It can autonomously plan and implement functions for design tasks and automatically collect data for analysis tasks, providing visual charts and supporting various content outputs and edits [4] Group 5: Tencent's 3D Component Generation Model - Tencent's Hunyuan 3D team introduced the industry's first native 3D component generation model, Hunyuan3D-Part, featuring P3-SAM (3D segmentation) and X-Part (component generation) modules [5][6] - The model generates high-quality, production-ready, and structurally sound component-based 3D content, addressing the needs of the gaming and 3D printing industries for decomposable 3D shapes [6] - It optimizes the entire process from semantic feature and bounding box detection to part generation, significantly outperforming existing works on multiple benchmarks, and is open-sourced with an online experience portal [6] Group 6: AI in Film Production - The AI short film "Nine Skies," produced by Hong Kong's ManyMany Creations, was selected for the Busan International Film Festival's "Future Images" AI film summit [7] - The summit showcased four other AI short films that utilize AI as a narrative tool to explore themes such as feminism and "banality of evil," moving beyond mere technical demonstrations [7] - Bona Film Group established the first AI production center in China, leveraging AI to reduce film production cycles from several years to 1.5-2 years while significantly lowering costs [7] Group 7: Apple's MCP Support - Apple's iOS 26.1, iPadOS 26.1, and macOS Tahoe 26.1 developer beta codes indicate the introduction of MCP support for App Intents, allowing AI models like ChatGPT and Claude to interact directly with Apple device applications [8] - MCP (Model Context Protocol), proposed by Anthropic, serves as a "universal interface" for AI models to communicate securely with external services, already adopted by Notion, Google, Figma, and OpenAI [8] - Apple is building system-level support for MCP instead of allowing individual applications to support it, reflecting a strategic shift from "fully self-developed" to platform-oriented [8] Group 8: Project Imaging-X - Project Imaging-X, initiated by Shanghai AI Lab and other institutions, systematically reviews over 1,000 medical imaging datasets from 2000 to 2025, revealing a fragmented and specialized landscape in medical data [9] - The research indicates a significant disparity in the quantity of medical imaging data compared to general vision, with pathological data dominating and classification and segmentation tasks being predominant [9] - The project proposes a metadata-driven fusion paradigm (MDFP) to achieve dataset integration through four phases: metadata unification, semantic alignment, fusion blueprint, and index sharing, with an interactive data discovery portal developed to support the advancement of medical foundational models [9] Group 9: Sequoia's AI Productivity Paradox - Sequoia's latest research reveals a "GenAI gap," indicating that only 5% of companies are deriving significant value from AI, while 95% fail to benefit due to static tools and process disconnection [10] - The study identifies three main reasons for AI failures in enterprises: lack of learning capability from user feedback in AI tools, 95% of custom AI solutions failing to scale from pilot to deployment, and the emergence of "shadow AI economy" as employees turn to personal AI services [10] - There is a large-scale replacement of junior positions (ages 22-25) by AI, with AI primarily replacing "book knowledge," while expert experience becomes a new competitive advantage [10]
实测Kimi全新Agent模型「OK Computer」,很OK
量子位· 2025-09-27 01:30
Core Viewpoint - Kimi has launched a new Agent model named OK Computer, which showcases advanced capabilities in web development, data processing, and content generation [1][4][6]. Group 1: Design Tasks - The new Agent can create a Pygame-themed webpage autonomously, including sections on the history of Pygame, game showcases, core features, and development tutorials, demonstrating its ability to design and implement content independently [9][10][12]. - The model generates a Todo List to track progress on tasks, marking completed items and allowing users to monitor the workflow [16]. - It can autonomously conduct web searches and generate materials needed for webpage creation, showcasing its self-sufficiency in the design process [17]. Group 2: Generation Tasks - The Agent was tasked with creating a children's story and visualizing it as a picture book, which included story writing, image generation, and audio production, highlighting its multi-modal content creation capabilities [20][21]. - Additionally, it successfully produced an editable PowerPoint presentation on China's top ten original musicals, demonstrating its proficiency in generating presentation materials [22][24][26]. Group 3: Analysis Tasks - The Agent can handle data analysis tasks by searching for financial data and visualizing it, thus alleviating the burden of data collection and analysis from users [29][30]. - It can also analyze lengthy Excel documents and present the data in a clear and understandable manner, indicating its effectiveness in managing complex data sets [31][32].
以AI重构供应链,京东发布大模型落地成果,科创AIETF(588790)回调超1%
Xin Lang Cai Jing· 2025-09-26 02:22
Core Insights - The overall AI penetration rate in China remains relatively low, indicating significant growth potential for the industry as policies are implemented and model capabilities continue to evolve [5] - The current market focus is on AI-related hardware, with a lack of explosive products and clear business models in the downstream applications, leading to insufficient visibility in company performance [5] - The Sci-Tech Innovation AI ETF (588790) has shown significant growth in both scale and shares, reflecting strong investor interest in the AI sector [6][6] Event Updates - JD.com has entered the "mass production phase" of AI applications, launching three major AI products and showcasing four application scenarios at its global technology conference [4] - China has added three U.S. companies to its export control list and another three to its unreliable entity list, indicating ongoing regulatory tensions [4] ETF Performance - The Sci-Tech Innovation AI ETF has seen a recent decline of 1.72%, with a weekly increase of 4.19% as of September 25, 2025 [3] - The ETF has a turnover rate of 2.43% and a transaction volume of 171 million yuan, ranking first among comparable funds in terms of average monthly trading volume [3] Industry Developments - Over 70% of China's energy state-owned enterprises have integrated Alibaba's AI technology, covering a wide range of sectors including electricity, oil, and coal [4] - Intel showcased its next-generation Xeon processors at the 2025 Cloud Summit, utilizing new manufacturing processes and advanced packaging technologies to enhance performance and efficiency [4] Market Composition - The top ten weighted stocks in the Sci-Tech Innovation AI Index account for 71.66% of the index, with companies like Cambricon and Lanke Technology leading the list [7] - The ETF closely tracks the Sci-Tech Innovation AI Index, which includes 30 large-cap companies providing essential resources and technology for the AI industry [6]
Kimi测试全新Agent模式OK Computer
Bei Jing Shang Bao· 2025-09-25 14:11
北京商报讯(记者 魏蔚)9月25日,月之暗面Kimi对全新Agent(智能体)模式OK Computer(计算机) 启动灰度测试,OK Computer延续"模型即Agent"理念,通过端到端训练Kimi K2模型,进一步提升智能 体及工具调用能力。用户下达需求后,Kimi可操作自身的虚拟电脑,完成多功能网站开发、海量数据 分析、图片视频生成及高品质PPT制作等复杂任务。曾打赏过Kimi的用户将获得首批体验资格。 ...
Kimi发布全新Agent模式OK Computer
Xin Lang Cai Jing· 2025-09-25 08:04
Core Insights - The company "月之暗面" has launched a new Agent mode called "OK Computer" and initiated a gray testing phase [1] - "OK Computer" continues the philosophy of "model as agent" by enhancing the capabilities of the Kimi K2 model through end-to-end training [1] - Users can issue requests, allowing Kimi to operate its virtual computer to perform complex tasks such as multi-functional website development, massive data analysis, image and video generation, and high-quality PPT creation [1] - Users who have previously tipped Kimi will receive the first batch of experience qualifications [1]