VibeThinker
Search documents
啊?微博7800美元训的大模型,数学能力超了DeepSeek-R1
量子位· 2025-11-18 05:02
Core Insights - Weibo has launched its first self-developed open-source large model, VibeThinker, which has only 1.5 billion parameters but outperformed the much larger DeepSeek R1 model with 671 billion parameters in benchmark tests [1][7] - The cost of a single post-training session for VibeThinker is only $7,800, significantly lower than competitors like DeepSeek and MiniMax, which have costs in the hundreds of thousands [2][10] - This breakthrough may shift the AI industry focus from a "scale competition" to an "efficiency revolution" [3][9] Industry Disruption - The AI industry has traditionally viewed parameter count as the primary measure of model capability, with a belief that complex reasoning requires over 100 billion parameters [5][6] - VibeThinker challenges this notion by demonstrating that a smaller model can achieve superior performance through optimized model structure and training methods, specifically the "Spectrum to Signal Principle" (SSP) [7][8] - The model's performance in high-difficulty mathematical tests has garnered significant attention, with endorsements from platforms like HuggingFace [7] Cost Revolution - VibeThinker's training cost is a fraction of what is typical in the industry, with the total cost being approximately $7,800 for the entire post-training process [10][13] - This cost efficiency allows for broader access to advanced AI capabilities, enabling smaller companies and research institutions to participate in AI innovation [13] Application and Ecosystem Development - Weibo is actively integrating AI technology across various business scenarios, enhancing user experience and content production efficiency [15][20] - The company plans to leverage its unique data assets to create a model that better understands public sentiment and social needs [17][18] - VibeThinker is expected to drive multiple AI applications within Weibo, enhancing user experience and potentially creating a new "social super-ecosystem" [19][20]
微博自研VibeThinker开源模型:15亿参数超越千亿级对手,训练成本仅7800美元
Xin Lang Ke Ji· 2025-11-17 11:40
Core Insights - Weibo AI has introduced its self-developed open-source large model, VibeThinker, which has only 1.5 billion parameters but outperformed models with hundreds of times more parameters in benchmark tests [1][2][3] - The training cost for VibeThinker is only $7,800, significantly lower than competitors, indicating a shift from a "scale competition" to an "efficiency revolution" in the AI industry [1][5][6] Model Performance - VibeThinker achieved impressive results in high-difficulty mathematical tests, surpassing models like DeepSeek-R1 with 671 billion parameters and MiniMax-M1 with 456 billion parameters [2][3] - The model's performance in LiveCodeBench v6 matched or exceeded that of larger models, demonstrating the potential of smaller models in complex reasoning tasks [3] Cost Efficiency - The total training cost for VibeThinker was approximately $7,800, which is 30 to 60 times more cost-effective than other models that require hundreds of thousands of dollars for similar performance [6][7] - This cost advantage allows smaller companies and research institutions to participate in AI innovation, promoting a more inclusive AI research environment [7][8] Application and Ecosystem - Weibo is actively integrating AI technology across various business scenarios, launching features like Weibo Smart Search and AI Interaction Accounts to enhance user experience [8][9] - The development of VibeThinker marks a new phase in Weibo's AI strategy, focusing on leveraging unique data assets to create a model that better understands public sentiment and social needs [9][10] Future Prospects - VibeThinker is expected to drive the growth of Weibo's AI applications, enhancing user experience and potentially creating a new "social super-ecosystem" that combines social attributes with intelligent services [10][11] - The technological advancements of VibeThinker are anticipated to significantly reduce the operational costs of AI applications on the Weibo platform, allowing for scalable AI capabilities without excessive resource burdens [11]