Measuring the performance of our models on real-world tasks
来自 OpenAI News
· 2025-09-25
精选
OpenAI introduces GDPval, a new evaluation that measures model performance on real-world economically valuable tasks across 44 occupations.