陶哲轩用GPT5-Pro跨界挑战，3年无解的难题，11分钟出完整证明

Core Insights - The collaboration between Terence Tao and GPT-5 Pro successfully addressed a three-year-old unsolved problem in differential geometry, showcasing the potential of AI in academic research [1][10]. Group 1: Problem Solving Process - The original problem involved determining if a smooth topological sphere in three-dimensional space, with principal curvature absolute values not exceeding 1, encloses a volume at least equal to that of a unit sphere [3]. - Tao's initial approach was to restrict the problem to star-shaped regions and utilize integral inequalities, but he sought AI assistance for complex calculations [4]. - GPT-5 Pro completed all calculations in 11 minutes and 18 seconds, providing a complete proof for the star-shaped case using various inequalities, some of which Tao was familiar with, while others were new to him [5]. Group 2: AI's Performance Evaluation - AI demonstrated effectiveness in small-scale problems, contributing useful ideas and only minor errors, but it reinforced Tao's incorrect intuition on medium-scale strategies [11][12]. - In large-scale understanding, AI was beneficial in accelerating research and helping Tao abandon unsuitable methods [14]. - Tao's experience highlighted the necessity of human expertise for further advancements in complex problems, indicating that AI's role is more supportive than substitutive [11][16]. Group 3: Historical Context and Evolution of AI Tools - Tao's exploration of AI's potential in mathematics began with the release of ChatGPT, where initial interactions yielded disappointing results due to a lack of depth in understanding mathematical problems [21][22]. - The introduction of GPT-4 marked a turning point, as it significantly improved efficiency in handling statistical data and mathematical tasks, leading to a more optimistic view of AI's integration into research [22][29]. - Tao's ongoing experiments with AI tools have shown that while AI can assist in numerical searches and problem-solving, it still requires careful oversight to mitigate issues like hallucinations or irrelevant outputs [29][31].