
Google DeepMind unveils SimpleQA Verified, a new LLM factuality benchmark
Google DeepMind has launched SimpleQA Verified, a new test to see how well AI models answer short, factual questions. It uses 1,000 tough questions from different topics, and an improved AI checks every answer. The latest models, like Gemini 3 Pro, score highest, but some struggle with numbers or certain topics. There's a public scoreboard so everyone can see how the models do. This test is fast, open, and helps show if an AI model really knows its facts.













