速递｜《指环王》级文本吞吐，谷歌发布Gemini2.5 Pro的能效比突破密码

Core Insights - Google has launched its next-generation AI reasoning model, Gemini 2.5, which incorporates a "thinking" process before answering questions [1] - The new model family includes Gemini 2.5 Pro Experimental, described as the smartest model to date, available on Google AI Studio and for subscribers of the $20 monthly Gemini Advanced plan [2] - All future AI models from Google will feature built-in reasoning capabilities, following the trend initiated by OpenAI's o1 model in September 2024 [3] Performance Metrics - In the Aider Polyglot code editing evaluation, Gemini 2.5 Pro scored 68.6%, outperforming top models from OpenAI, Anthropic, and DeepSeek [4] - In the SWE-bench Verified test, Gemini 2.5 Pro achieved a score of 63.8%, surpassing OpenAI's o3-mini and DeepSeek's R1, but falling short of Anthropic's Claude 3.7 Sonnet, which scored 70.3% [4] - In the Humanity's Last Exam, Gemini 2.5 Pro scored 18.8%, outperforming most competitors' flagship models [4] Technical Specifications - Gemini 2.5 Pro features a context window of 1 million tokens, allowing it to process approximately 750,000 words at once, which is longer than the entire Lord of the Rings series [5] - The model will soon support double the input length, reaching 2 million tokens [5] Future Developments - Google has not disclosed the API pricing for Gemini 2.5 Pro but plans to share more information in the coming weeks [6]