X @Demis Hassabis - Reportify

Model Performance - Gemini 2.5 Deep Think achieves state-of-the-art performance across challenging benchmarks [1] - The model excels in LiveCodeBench V6, evaluating competitive code performance [1] - The model demonstrates expertise in various domains, including science, as measured by Humanity's Last Exam [1] Technology & Innovation - Google DeepMind highlights Gemini 2.5 Deep Think's capabilities compared to other models without tool use [1]