Workflow
模型迭代
icon
Search documents
26天倒计时:OpenAI即将关停GPT-4.5Preview API
3 6 Ke· 2025-06-18 07:34
Core Insights - OpenAI announced the removal of the GPT-4.5 Preview API effective July 14, which will impact developers who have integrated it into their products [2][3] - The removal was planned since the release of GPT-4.1 in April, and GPT-4.5 was always considered an experimental product [5] - OpenAI is focusing on promoting more scalable and cost-effective models, as evidenced by the recent 80% price reduction of the o3 API [8] Pricing and Cost Considerations - The pricing for GPT-4.5 API was significantly high at $75 per million input tokens and $150 per million output tokens, making it commercially unviable [6] - The cost of NVIDIA H100 GPUs, approximately $25,000, and their high power consumption further complicate the financial feasibility of maintaining such models [6] Strategic Implications - The rapid exit of GPT-4.5 highlights the challenges of model iteration speed and external computing costs as critical factors for OpenAI's business model [11] - OpenAI's strategy appears to be consolidating resources towards models that offer better scalability and cost control, while discontinuing less successful or ambiguous products [8]
AI算力大集群:继续Scaling
2025-06-15 16:03
Summary of Key Points from the Conference Call Industry Overview - The conference call focuses on the AI computing power industry, particularly the demand for AI computing clusters and the implications for major tech companies like Microsoft, Meta, and Amazon [1][2][3]. Core Insights and Arguments 1. **AI Computing Demand Trends**: There is a significant expected growth in AI computing demand, particularly in training and inference. The market has shown a discrepancy in expectations, especially before the earnings reports of major companies [2][3]. 2. **Optimistic Outlook for AI Computing Clusters**: The outlook for AI computing clusters is optimistic, with anticipated increases in inference demand in the first half of 2025 and training demand in the second half [1][3]. 3. **U.S.-China AI Development Gap**: The gap in AI development between the U.S. and China may widen, depending on the evolution of large model iterations over the next year. The U.S. is expected to continue advancing parameter optimization, while China may rely on software algorithm innovations [1][5][8]. 4. **Role of Clusters in AI Model Iteration**: Clusters play a crucial role in AI model iterations, especially for large-scale computational tasks. The emergence of technologies like DeepSpeed indicates a shift towards reduced dependency on large clusters [7][9]. 5. **Impact of DeepSpeed**: The introduction of DeepSpeed marks the end of the computing inflation logic and initiates a new deflation logic, reducing the overall reliance on large clusters [9][10]. 6. **Market Focus on Optical Interconnect Technology**: There has been a notable increase in market attention towards optical interconnect technologies and related companies due to the growing demand for large clusters [11][12]. 7. **Changes in Major Tech Companies' Cluster Needs**: Major tech companies have shifted their needs away from large clusters, with many opting for strategies that do not require significant investments in large-scale computing resources [12][24]. 8. **Future Model Iteration Paths**: The next year is expected to see a return to pre-training phases, which will require substantial computational resources. Different companies will adopt varied strategies for this transition [14][15]. 9. **Meta's Data Strategy**: Meta's strategy involves leveraging its vast data resources, but merely increasing data volume has not significantly improved model performance. The acquisition of Skillz AI aims to enhance data quality [16][18]. 10. **Challenges in Large-Scale Cluster Construction**: The construction of large clusters faces various bottlenecks, including data and storage walls, which require hardware upgrades or algorithm optimizations to overcome [32][37]. Other Important but Potentially Overlooked Content - **Market Expectations for 2025**: The A-share market is expected to experience fluctuations in AI computing, with downward expectations in the first half of 2025 and upward expectations in the second half, driven by actual demand and supply chain recovery [40]. - **Technological Innovations**: Innovations in communication technologies, such as Broadcom's "Fat Cat" technology, are crucial for enhancing data synchronization and load balancing in training processes [36]. - **Scalability Trends**: There is an anticipated increase in the demand for scale-up solutions, which enhance the computational capacity of individual nodes, as opposed to scale-out solutions [38][39]. This summary encapsulates the key points discussed in the conference call, highlighting the trends, challenges, and strategic directions within the AI computing power industry.