Workflow
《GTA》
icon
Search documents
谢赛宁也玩MC?开源全新世界模型生成多人一致的游戏视角
机器之心· 2026-03-07 04:20
Core Insights - The article discusses the significant role of video games in advancing AI technology, particularly in training AI to understand physical interactions and world models [1] - The research team led by Xie Saining is exploring new directions in world models using the game "Minecraft" [3] Group 1: Video World Model Development - The Solaris model is the first multiplayer video world model that generates consistent first-person views for multiple players [5] - The research team identified that existing video world models only handle single-player perspectives, which do not accurately reflect real-world interactions [7] - SolarisEngine, a custom-built data collection system, supports coordinated multi-agent interactions and visual capture in games like "Minecraft" [7][14] Group 2: Data Collection and Model Training - The team collected a dataset of 12.6 million frames from 9,240 task rounds, focusing on various tasks such as building, combat, and exploration [16] - The dataset is the first of its kind with action annotations suitable for training world models [17] - The model utilizes a combination of flow matching and diffusion forcing to predict future observations based on players' historical actions [19] Group 3: Model Architecture and Improvements - The model architecture includes enhancements such as expanded action space and multi-player self-attention layers to facilitate information exchange among players [20][22] - The improvements allow the model to generalize to any number of players, although it has currently been trained with data from two players [20] Group 4: Evaluation Metrics and Results - The Solaris Eval dataset was created to test five collaborative abilities, including movement, positioning, consistency, memory, and building capabilities [24][28] - The results indicate that the Solaris model outperforms previous methods in visual quality and quantitative metrics across various evaluation categories [27][29]
Genie 3 引发游戏股暴跌,但游戏的真正灵魂 AI 永远得不到
3 6 Ke· 2026-02-04 03:55
Core Viewpoint - The release of Google DeepMind's third-generation visual language model, Genie 3, has led to a significant drop in the stock prices of major gaming companies, raising concerns about the future of AAA game development [1][3]. Group 1: Market Reaction - Following the announcement of Genie 3, Unity's stock plummeted by over 24%, with other major companies like Take-Two, Nintendo, and CD Projekt Red also experiencing declines [1]. - The market's reaction is driven by the perception that Genie 3 can generate high-quality 3D game worlds quickly, potentially threatening traditional game development practices [3]. Group 2: Misconceptions about Game Development - The belief that visual detail generation equates to complete world-building is a misunderstanding; creating immersive game worlds requires more than just visual fidelity [4]. - Game development involves intricate world-building that cannot be replicated solely through AI-generated visuals, as the essence of a game's "life" comes from its depth and complexity [4][12]. Group 3: Limitations of Genie 3 - Genie 3 operates on a frame generation model that lacks the logical consistency and long-term memory required for a cohesive gaming experience, with a memory window of only one minute [8][10]. - The model's probabilistic nature means it cannot maintain the structured, deterministic environments that traditional game engines provide, leading to potential immersion-breaking inconsistencies [11][10]. Group 4: The Importance of IP and Emotional Connection - Players often value the emotional connection to game IPs, which cannot be generated by AI; successful IPs require time, consistency, and creator investment [22][26]. - The long-term development of iconic IPs, such as Mario and GTA, illustrates that emotional resonance and narrative depth are crucial for player engagement, which AI cannot replicate [22][23][27]. Group 5: Future of AI in Game Development - While AI tools like Genie 3 can enhance efficiency in game development, the creative direction and integration of AI-generated content will still rely on human designers [29][30]. - AI is positioned as a powerful tool for game developers, but the true artistry and cultural significance of games will continue to depend on human creativity and oversight [31].
“GTA之父”丹·豪瑟承认其个人新作使用AI,但作用有限
Sou Hu Cai Jing· 2025-11-24 14:10
Group 1 - The core viewpoint of the article is that the gaming industry has significant potential for innovation, particularly in creating "living narrative experiences," despite the commercial pressures that can influence artistic direction [3][4]. - Dan Houser, co-founder of Rockstar and current producer at Absurd Ventures, emphasizes the dual nature of new developments in the gaming industry, which can either lead to exciting advancements or become mere cash cows [3]. - The new project from Absurd Ventures, titled "Absurdaverse," integrates both gaming and novel storytelling, showcasing a unified worldview while narrating different stories [4]. Group 2 - Houser indicates that their studio's new game will take several years to complete and highlights ongoing efforts to incorporate AI into gaming [6]. - He expresses skepticism about the current capabilities of AI, stating that it does not perform as well as large companies claim and cannot solve all problems [6]. - Houser notes that while AI can excel in certain tasks, many challenges remain, and the true potential of AI in gaming will require time to fully realize [6].
Take-Two CEO 泽尔尼克:AI 不可能生成一款堪比《GTA》的游戏
Sou Hu Cai Jing· 2025-10-29 23:40
Core Insights - Take-Two's CEO Strauss Zelnick stated that generative AI currently has a limited impact on the development of large games like Grand Theft Auto due to its fundamental lack of creativity [1][3] - Zelnick acknowledged that while AI has entered the game production process, its role in enhancing development efficiency remains quite limited [1] Group 1 - Zelnick does not deny the value of AI but believes that the changes it brings to production processes are not as significant as many perceive [3] - One reason for this limitation is intellectual property issues, while a deeper barrier is that AI is inherently "backward-looking," relying on vast databases of historical data [3] - Zelnick emphasized that even if AI were unrestricted, it would not be possible to generate a game comparable to GTA with a complete marketing plan at the push of a button [3] Group 2 - The essence of AI is a combination of large datasets, significant computing power, and large language models, which are fundamentally retrospective in nature [3] - AI can assist in areas that rely on historical data, but it is incapable of true creativity, which is essential in certain business aspects of Take-Two [3] - Take-Two is committed to creating long-lasting series, with Grand Theft Auto being a prime example, showcasing the creativity of its team and Rockstar Games' pursuit of perfection [3]