OpenAI o3 - chunhualiao/public-docs GitHub Wiki

Here's a detailed comparison of OpenAI's o3 and o3-mini models, highlighting their capabilities, pricing, and access specifications:

Feature	o3	o3-mini
Release Date	April 16, 2025	January 31, 2025
Model Type	Flagship reasoning model with advanced capabilities in coding, math, science, and vision	Cost-efficient reasoning model optimized for STEM tasks; does not support vision
Context Window	Up to 200,000 tokens	Up to 200,000 tokens
Output Limit	Up to 100,000 tokens per response	Up to 100,000 tokens per response
Pricing (per 1M tokens)	Input: $10.00Cached Input: $2.50Output: $40.00	Input: $1.10Cached Input: $0.275Output: $4.40
Reasoning Effort Levels	Fixed reasoning effort	Selectable levels: Low, Medium, High
Supported Features	Web browsingImage generation and visual understandingFunction callingStructured outputsStreaming responsesDeveloper messages	Function callingStructured outputsStreaming responsesDeveloper messagesSearch integration for web-sourced answers
API Access	Available via Responses API	Available via Chat Completions, Assistants, and Batch APIs
Access Requirements	API usage tiers 4 and 5Tiers 1–3 can gain access through API organization verification	API usage tiers 1 through 5Available to ChatGPT Plus, Team, and Pro usersLimited access for free ChatGPT users
Performance Benchmarks	GPQA Diamond: 87.7% accuracyCodeforces Elo Rating: 2727ARC-AGI Benchmark: 87.5% accuracy	AIME 2024 (High Effort): 83.6% accuracyGPQA Diamond (High Effort): 77.0% accuracyCodeforces Elo Rating (High Effort): 2073SWE-bench Verified (High Effort): 48.9% accuracy
Safety Features	Incorporates deliberative alignment for enhanced safety and adherence to human-written safety specifications	Utilizes deliberative alignment; demonstrates significant improvements over previous models in safety and jailbreak evaluations
Latency	Higher latency due to complex reasoning capabilities	Approximately 24% faster response time compared to o1-mini; average response time of 7.7 seconds with medium reasoning effort
Use Cases	Ideal for tasks requiring deep reasoning, complex problem-solving, and multimodal understanding, including visual tasks	Best suited for STEM-related tasks such as math, coding, and science, where cost-efficiency and speed are prioritized over multimodal capabilities

Notes:

Cached Input Tokens:These are repeated inputs that the model has previously processed, allowing for reduced costs
Batch API Discount:Utilizing the Batch API can provide up to a 50% discount on input and output token costs
Access Tiers:Access to these models may depend on your API usage tier. o3 is available to users in tiers 4 and 5, while o3-mini is accessible to tiers 1 through 5

For more detailed information and to access these models, visit the [OpenAI API Pricing page](https://openai.com/api/pricing/).