Workflow
马斯克Grok-4卖货创收碾压GPT-5!AI卖货排行榜曝光,AGI的尽头是卖薯片?
Sou Hu Cai Jing·2025-08-22 09:56

Core Insights - The article discusses the performance of AI models in a unique benchmark called "Vending Bench," where Elon Musk's Grok-4 outperformed OpenAI's GPT-5 in sales capabilities, achieving approximately double the sales volume and a 31% increase in revenue [2][4]. Group 1: Performance Metrics - Grok-4 achieved a net worth of $4,694.15 million, selling 4,569 units, while GPT-5 had a net worth of $3,578.90 million with 2,471 units sold [3][6]. - Grok-4 sold $1,100 more in goods than GPT-5, demonstrating superior stability and sales performance [2][4]. - The Vending Bench benchmark evaluates AI's ability to manage long-term business tasks, with Grok-4 showing exceptional wealth creation and sales performance [5][15]. Group 2: Benchmarking and AI Capabilities - Vending Bench is designed to assess AI agents in managing a vending machine business, requiring them to make long-term decisions that affect future outcomes [15][21]. - The benchmark highlights the challenge of maintaining consistent performance over extended periods, which is crucial for real-world AI applications [30][31]. - The results indicate significant performance variability among different AI models, with some models like Claude 3.5 Sonnet outperforming others in long-term asset accumulation [25][39].