Introducing SimpleQA
来自 OpenAI News
· 2024-10-30
A factuality benchmark called SimpleQA that measures the ability for language models to answer short, fact-seeking questions.