Curator

Introducing SimpleQA

来自 OpenAI News · 2024-10-30

LLM推理模型评测

A factuality benchmark called SimpleQA that measures the ability for language models to answer short, fact-seeking questions.