OpenAI unveils GPT-5 with native multimodal reasoning and a 1M token context

OpenAI today announced GPT-5, its most capable model to date, in a livestream from San Francisco. The new system unifies what were previously separate reasoning, vision, and code models into a single API endpoint, and dramatically extends usable context to 1 million tokens.

What’s new

Native multimodal input across text, images, audio and video frames
Persistent memory that carries across sessions, opt-in per workspace
First-class agentic tool-use (browser, code, files) at the model layer
New `reasoning_effort` parameter that trades latency for depth

On internal benchmarks, OpenAI claims a 34% reduction in hallucination rate compared to GPT-4o on long-form factual tasks, and substantial improvements on SWE-bench Verified.

ℹ️ Note: Pricing starts at $5 per million input tokens and $15 per million output tokens, with a flat-rate Pro tier for unlimited reasoning.

Why it matters

GPT-5 collapses the toolchain. Developers who previously stitched together GPT-4o, o1, and Whisper now get one endpoint that decides, internally, how much to "think" before answering. That alone should shorten many AI application stacks by hundreds of lines of orchestration code.

GPT-5 is the first model where we feel comfortable putting it directly in front of users without a safety harness.

— Sam Altman, CEO of OpenAI

Developer access

The API is rolling out today to existing paid customers. ChatGPT Plus, Team and Enterprise users get access in the chat product within 48 hours.

from openai import OpenAI
client = OpenAI()
resp = client.responses.create(
    model="gpt-5",
    input="Plan a 7-day Tokyo itinerary for a vegetarian foodie.",
    reasoning_effort="medium"
)
print(resp.output_text)

Early access partners include Stripe, Shopify, Figma and Notion, all of whom have updated their AI features today.

What’s new

Why it matters

Developer access

Leave a Comment Cancel Reply