Workflow
FACTS Benchmark Suite
icon
Search documents
X @Demis Hassabis
Demis Hassabis· 2025-12-15 13:36
RT Google DeepMind (@GoogleDeepMind)We’ve developed the FACTS Benchmark Suite with @GoogleResearch. 📊It’s the industry’s first comprehensive test evaluating LLM factuality across four dimensions: internal model knowledge, web search, grounding, and multimodal inputs. https://t.co/hmc8ePBsP9 ...