Core Viewpoint - OpenAI has launched the GPT-5 model, which is now available to both free and paid users, enhancing capabilities in reasoning, coding, and writing [1][3][6]. Group 1: Model Features - GPT-5 is an integrated model that automatically determines when to engage in deep thinking versus providing quick responses, eliminating the need for manual model switching [6][7]. - The model supports multi-modal inputs and outputs, including text, images, voice, and real-time video streams, allowing for interactive explanations and visualizations [7]. - It has achieved a SWE-Bench Verified score of 74.9%, generating over 200 lines of interactive code with audio elements in just a few minutes [7]. Group 2: Performance Metrics - GPT-5 has the highest Arena score to date, ranking first in text, web development, and visual fields, as well as in high-difficulty prompts, programming, mathematics, creativity, and long queries [20][21]. - The model's hallucination rate has significantly decreased to 4.8% overall, with a low of 1.6% in medical scenarios, thanks to the introduction of a universal validator for self-checking [7]. Group 3: Competitive Landscape - The rapid development of AI technologies is highlighted, with OpenAI's GPT-3.5 and GPT-4 models previously setting benchmarks in generative AI [14]. - Competitors like Google DeepMind's Genie 3 and Anthropic's Claude 4 have also made significant advancements, with Genie 3 capable of generating interactive 3D worlds in real-time [16][18]. - Elon Musk has noted that Grok 4 outperformed GPT-5 in specific evaluations, indicating a competitive landscape where multiple models are vying for superiority [22][24].
半夜刷到 GPT-5,免费用户也能玩~昨天功能还没用上,今天已经过时~
菜鸟教程·2025-08-08 01:56