<cd ../feed
measuring-the-performance-of-our-models-on-real-world-tasks.log
|src: openai.com

Measuring the performance of our models on real-world tasks

OpenAI introduces GDPval, a new evaluation that measures model performance on real-world economically valuable tasks across 44 occupations.