Workflow
《我的世界》(Minecraft)
icon
Search documents
Meta公司28岁亿万富豪:下一个比尔·盖茨将在这类少年中产生
财富FORTUNE· 2025-10-16 13:06
Core Insights - Wang Tao, at 28, leads one of Silicon Valley's most ambitious AI projects after becoming the youngest self-made billionaire at 24, emphasizing the importance of programming for the Alpha generation [1][2] - He believes that all code written by engineers will soon be replaced by AI-generated code, urging a shift in focus from traditional programming to mastering AI tools [2][4] - Wang compares the current moment to the pre-PC revolution, suggesting that those who invest time in learning AI tools will gain a significant advantage in the future economy [4][5] Company Ambitions - Meta's infrastructure, scale, and product distribution capabilities are described as unparalleled, with a business model capable of supporting the construction of multi-billion dollar computing systems [5][6] - The lab is intentionally kept small, with a focus on high talent density, aiming to outperform larger competitors [6] - The lab is structured around three pillars: research, product, and infrastructure, with the goal of achieving superintelligence through advanced models [6][7] Programming Evolution - Ambient programming, which allows users to generate and iterate code through natural language prompts, is gaining traction among Silicon Valley executives [8][9] - This approach is seen as a cultural mission for the future, emphasizing the importance of intuitive experience gained from challenging AI tools rather than just the code itself [9][10] - The role of engineers is evolving significantly, reflecting a shift in how programming is approached in the industry [10]
梦里啥都有?谷歌新世界模型纯靠「想象」训练,学会了在《我的世界》里挖钻石
机器之心· 2025-10-02 01:30
Core Insights - Google DeepMind's Dreamer 4 supports the idea that agents can learn skills for interacting with the physical world through imagination without direct interaction [2][4] - Dreamer 4 is the first agent to obtain diamonds in the challenging game Minecraft solely from standard offline datasets, demonstrating significant advancements in offline learning [7][21] Group 1: World Model and Training - World models enable agents to understand the world deeply and select successful actions by predicting future outcomes from their perspective [4] - Dreamer 4 utilizes a novel shortcut forcing objective and an efficient Transformer architecture to accurately learn complex object interactions while allowing real-time human interaction on a single GPU [11][19] - The model can be trained on large amounts of unlabeled video data, requiring only a small amount of action-paired video, opening possibilities for learning general world knowledge from diverse online videos [13] Group 2: Experimental Results - In the offline diamond challenge, Dreamer 4 significantly outperformed OpenAI's offline agent VPT15, achieving success with 100 times less data [22] - Dreamer 4's performance in acquiring key items and the time taken to obtain them surpassed behavior cloning methods, indicating that world model representations are superior for decision-making [24] - The agent demonstrated a high success rate in various tasks, achieving 14 out of 16 successful interactions in the Minecraft environment, showcasing its robust capabilities [29] Group 3: Action Generation - Dreamer 4 achieved a PSNR of 53% and SSIM of 75% with only 10 hours of action training, indicating that the world model absorbs most knowledge from unlabeled videos with minimal action data [32]
港科广×腾讯联手打造《我的世界》神操作,400张截图就能让AI挖矿通关,成本降至5%|EMNLP 2025
量子位· 2025-09-04 04:41
Core Insights - The article presents the innovative VistaWise framework developed by a joint team from Hong Kong University of Science and Technology (Guangzhou) and Tencent, which aims to enhance the capabilities of AI in complex open-world environments like Minecraft [2][6]. Group 1: Framework Overview - VistaWise integrates "cross-modal knowledge graphs" and "lightweight visual fine-tuning" to enable AI agents to operate effectively in open-world scenarios [3][6]. - The framework achieved a 33% success rate in the "diamond acquisition" task, surpassing previous state-of-the-art (SOTA) methods by 8 percentage points, with all nine sub-tasks exceeding a 73% success rate [4][18]. Group 2: Methodology - The research team utilized only 471 game screenshots and a consumer-grade GPU with 24 GB VRAM for visual model fine-tuning, significantly reducing training costs and complexity [6][17]. - A lightweight knowledge graph was constructed from text guides and encyclopedic knowledge, which was injected into the large model to minimize hallucinations [7][11]. - The "retrieval-based pooling" mechanism allows the model to quickly access task-relevant information, enhancing efficiency [13]. Group 3: Performance Metrics - VistaWise's training data volume was reduced by five orders of magnitude (471 vs. 160 million frames), and GPU memory requirements decreased by 87.5% (24 GB vs. 192 GB) [18]. - Compared to multi-modal large models, VistaWise's approach resulted in a 30.7% reduction in token usage while maintaining performance levels [18]. Group 4: Decision-Making Process - The decision-making cycle of VistaWise consists of four steps: perception, retrieval, reasoning, and execution [15][20]. - The system operates entirely on a local setup with an 8 GB GPU during the inference phase, demonstrating its efficiency and accessibility [17].
《我的世界》成为AI新「考场」?高三生用游戏评测AI:DeepSeek-R1位列第三
3 6 Ke· 2025-03-25 12:45
Core Insights - A high school student, Adi Singh, has developed a new AI evaluation benchmark called MC-Bench, utilizing the game Minecraft to assess AI models' capabilities in a more intuitive manner [1][2][10] - Traditional standardized tests often give AI models an unfair advantage, as they are optimized for specific tasks, leading to discrepancies in real-world performance [2][8] - MC-Bench allows users to vote on AI-generated architectural designs in Minecraft, providing a crowdsourced method for evaluating AI performance [5][9] Group 1: MC-Bench Overview - MC-Bench is designed to evaluate AI models by having them create structures in Minecraft based on user prompts, such as "a crystal-clear wine glass filled with deep red wine" [2][5] - The evaluation process involves user voting to select the best creations, with results revealed only after voting concludes [5][10] - The project has garnered attention from major AI companies like OpenAI, Google, and Anthropic, which provide computational resources but are not officially collaborating [10][13] Group 2: Advantages of Game-Based Evaluation - Minecraft serves as a familiar and visually engaging platform, making it easier for the general public to understand and participate in AI assessments [7][8] - The game environment allows for a controlled testing space, enabling the evaluation of AI's reasoning and planning abilities in a safe manner [7][8] - Game-based assessments can simulate real-world complexities, test AI's decision-making skills, and provide a repeatable environment for comparison [7][8] Group 3: Current Status and Future Plans - As of now, MC-Bench primarily tests basic construction abilities of AI models, tracking their progress since the GPT-3 era [10][16] - Future plans include expanding the benchmark to more complex tasks that require long-term planning and goal-oriented actions [10][16] - The leaderboard of MC-Bench shows that Claude 3.7 Sonnet ranks first, while DeepSeek-R1 is currently in third place, indicating the platform's effectiveness in reflecting user experiences with these models [14][16]