Open Source Model

Search documents
The Industry Reacts to gpt-oss!
Matthew Berman· 2025-08-06 19:22
Wow, I thought the open-source model from OpenAI was going to be popular, but it really struck a chord in the industry. Let me break down all of the industry reactions for you right now. First, of course, Sam Alman's tweet, GPOSS is out.We made an open model that performs at the level of 04 mini and runs on a high-end laptop WTF and a smaller one that runs on a phone, which is just crazy to think about. Super proud of the team. Big triumph of technology.I'm so happy that they put all of this research and ef ...
X @Sam Altman
Sam Altman· 2025-08-05 17:03
Model Release - The company released gpt-oss, an open-source model [1] - The model performs at the level of o4-mini [1] - The model can run on a high-end laptop [1] - A smaller version of the model can run on a phone [1]
We're in an AI gold rush right now. The gap between what models can do vs what products exist is mas
Garry Tan· 2025-06-21 23:13
Model Capabilities & Product Development - The current product development is significantly behind the capabilities of the latest models, indicating substantial opportunities for innovation [2] - Even without model improvements, there is a vast amount of new products to build [2] Pricing & Performance - The cost of 03 model decreased fivefold within a week, suggesting a rapid decline in price per performance [2] - The industry anticipates a significant drop in price per performance [3] Open Source Initiatives - An open-source model is soon to be released, expected to surpass current expectations [3] - The open-source model will enable running powerful models locally, surprising users with its capabilities [3]
阿里“通义千问”成为日本AI开发基础
日经中文网· 2025-05-07 02:45
Core Insights - Alibaba Cloud's AI model "Qwen" ranks 6th among 113 models in the "AI Model Scoring" list published by Nikkei, surpassing China's DeepSeek model [1][3] - The open-source nature of Qwen has led to its adoption by various emerging companies in Japan, including ABEJA, which developed the "QwQ-32B Reasoning Model" based on Qwen [3][4] - Qwen's performance in logical reasoning and mathematics has been highlighted, showcasing its capabilities beyond basic language skills [3] Group 1: Model Performance and Adoption - Qwen's "Qwen2.5-Max" model ranks 6th in a comprehensive performance evaluation conducted by NIKKEI Digital Governance, demonstrating strong performance in grammar, logical reasoning, and mathematics [3] - The open-source model "Qwen2.5-32B" ranks 26th, outperforming Google's "Gemma-3-27B" and Meta's "Llama-3-70B-Instruct" [3] - Japanese companies are increasingly utilizing Qwen, with ABEJA's model based on Qwen ranking 21st overall [3][4] Group 2: Global Recognition and Future Plans - Qwen has gained significant attention outside Japan, with over 100,000 derivative models developed on the "Hugging Face" platform [5] - Alibaba Cloud is considering providing debugging and customization services for Japanese companies, allowing them to utilize Qwen without transferring data overseas [5] - Alibaba Cloud aims to increase the number of projects using Qwen in Japan to over 1,000 within three years [6] Group 3: Research and Evaluation Methodology - The AI model scoring evaluation involved over 6,000 questions across 15 categories, assessing language ability and ethical considerations [7] - The evaluation was conducted in collaboration with Weights & Biases, focusing on models' performance in Japanese [7]