AI is dumber than you think

15 November 2024

OpenAI recently introduced SimpleQA, a new benchmark for evaluating the factual accuracy of large language models (LLMs) that underpin generative AI (genAI). Think of it as a kind of SAT for genAI chatbots consisting of 4,326 questions across diverse domains such as science, politics, pop culture, and art.

Source: ComputerWorld

Date:

15 November 2024

Categorie(s):

NEWS