Curator

OpenAI News

The OpenAI blog

https://openai.com/news →
Why we no longer evaluate SWE-bench Verified

2026-02-23

SWE-bench Verified is increasingly contaminated and mismeasures frontier coding progress. Our analysis shows flawed tests and training leakage. We recommend SWE-bench Pro.

Our First Proof submissions

2026-02-20

We share our AI model’s proof attempts for the First Proof math challenge, testing research-grade reasoning on expert-level problems.

Introducing OpenAI for India

2026-02-19

OpenAI for India expands AI access across the country—building local infrastructure, powering enterprises, and advancing workforce skills.

Introducing EVMbench

2026-02-18

OpenAI and Paradigm introduce EVMbench, a benchmark evaluating AI agents’ ability to detect, patch, and exploit high-severity smart contract vulnerabilities.

Scaling social science research

2026-02-13

GABRIEL is a new open-source toolkit from OpenAI that uses GPT to turn qualitative text and images into quantitative data, helping social scientists analyze research at scale.

Introducing GPT-5.3-Codex-Spark

2026-02-12

Introducing GPT-5.3-Codex-Spark—our first real-time coding model. 15x faster generation, 128k context, now in research preview for ChatGPT Pro users.

Bringing ChatGPT to GenAI.mil

2026-02-09

OpenAI for Government announces the deployment of a custom ChatGPT on GenAI.mil, bringing secure, safety-forward AI to U.S. defense teams.

Introducing Trusted Access for Cyber

2026-02-05

OpenAI introduces Trusted Access for Cyber, a trust-based framework that expands access to frontier cyber capabilities while strengthening safeguards against misuse.

Introducing OpenAI Frontier

2026-02-05

OpenAI Frontier is an enterprise platform for building, deploying, and managing AI agents with shared context, onboarding, permissions, and governance.