🧮
AI Model Scoreboard
Independent rankings
SCORESMETHODOLOGY
AI Model Scoreboard
© 2026 · Evidence-first · Open metrics · Community supported
SupportDonateChangelogHealth: Checking

Data is aggregated from public sources; please verify before making critical decisions.

AI Model Scoreboard is informational only and does not provide investment, compliance, or security advice.

← Back to v4

MODEL: openai OpenAI: o3 Deep Research

Provider: openaiSource: fullmodelKey: openai/o3-deep-research-2025-06-26
Overall
74.0 / 100
Updated Jan 26

Status

adopted

Reasons (from decisions.json):

No published decision record.

A) Absolute Metrics (must exist for all models)

Context length
200,000
Max output tokens
Missing
Missing max output tokens => affects score: performance signals reduced.
Pricing (per 1M tokens)
in: USD 0.00 out: USD 0.00
Modalities
text+image+file->text
Tool / JSON support
Missing
Missing tool/JSON support => affects score: tooling capability signals reduced.
Training cutoff (evidence)
Missing
Missing training cutoff => affects score: openness signals reduced. status: ok.
Release date (evidence)
2025-10-10T20:54:21.000Z (status: ok)

Score Summary

Overall: 74 / 100

Category Scores (0–100)

  • performance73
  • safety80
  • adoption66
  • openness58
  • cost100

Top drivers

  • evidence_audit_missing_source_link — Evidence indicates missing/failed; penalty applied per policy. missing_evidence — Evidence indicates missing/failed; penalty applied per policy.
  • missing_minor_incidents — Evidence indicates missing/failed; penalty applied per policy.
  • missing_major_incidents — Evidence indicates missing/failed; penalty applied per policy.
  • missing_critical_incidents — Evidence indicates missing/failed; penalty applied per policy.
  • evidence_paper_not_found — Evidence indicates missing/failed; penalty applied per policy. missing_evidence — Evidence indicates missing/failed; penalty applied per policy.

C) Evidence (4 tiles/cards; reasons required if not ok)

Official Page

✅ ok
refs:
  • https://openai.com
  • https://openrouter.ai/models/openai/o3-deep-research-2025-06-26
extracted:
{
  "url": "https://openai.com"
}
reasons:
  • ok — Evidence indicates missing/failed; penalty applied per policy.
  • openrouter_model_page — Evidence indicates missing/failed; penalty applied per policy.
  • provider_fallback — Evidence indicates missing/failed; penalty applied per policy.
How this affected scoring

Evidence verified => no penalty applied for this signal.

Dev Activity

⚠️ missing source link
refs:
  • missing:github_repo
reasons:
  • A source was referenced but no valid link was provided; penalty applied per policy.
  • No public repository link found; development-activity score reduced.
How this affected scoring

missing source link => dev-activity score capped / penalty applied.

Paper

⚠️ not found
refs:
  • arxiv_query:"OpenAI: o3 Deep Research"
reasons:
  • No known paper source found; transparency score reduced.
  • No verifiable source found; penalty applied per policy.
How this affected scoring

not found => transparency score reduced / penalty applied.

Audit

⚠️ missing source link
refs:
  • missing:audit_link
reasons:
  • A source was referenced but no valid link was provided; penalty applied per policy.
  • No known audit source found; audit score reduced.
How this affected scoring

missing source link => audit and safety scores reduced / penalty applied.

D) Full Breakdown (every item must show score + inputs + used evidence + why)

ItemScoreInputs (raw)Evidence & Why
General benchmarks75

No input data provided.

Why: No reason provided; score derived from available signals.

Evidence: Official page: https://openai.com
Evidence: Official page: https://openrouter.ai/models/openai/o3-deep-research-2025-06-26
Coding benchmarks81

No input data provided.

Why: No reason provided; score derived from available signals.

Evidence: Official page: https://openai.com
Evidence: Official page: https://openrouter.ai/models/openai/o3-deep-research-2025-06-26
Math & chat benchmarks64

No input data provided.

Why: No reason provided; score derived from available signals.

Evidence: Official page: https://openai.com
Evidence: Official page: https://openrouter.ai/models/openai/o3-deep-research-2025-06-26
Safety documentation0

No input data provided.

Why: No reason provided; score derived from available signals.

Evidence: Official page: https://openai.com
Evidence: Official page: https://openrouter.ai/models/openai/o3-deep-research-2025-06-26
Alignment disclosure0

No input data provided.

Why: No reason provided; score derived from available signals.

Evidence: Official page: https://openai.com
Evidence: Official page: https://openrouter.ai/models/openai/o3-deep-research-2025-06-26
Misuse policy coverage100

No input data provided.

Why: No reason provided; score derived from available signals.

Evidence: Official page: https://openai.com
Evidence: Official page: https://openrouter.ai/models/openai/o3-deep-research-2025-06-26
External audit & red teaming—

No input data provided.

Why: evidence_audit_missing_source_link — Evidence indicates missing/failed; penalty applied per policy. missing_evidence — Evidence indicates missing/failed; penalty applied per policy.

Evidence not provided.

Missing evidence link (spec violation).

Transparency updates100

No input data provided.

Why: No reason provided; score derived from available signals.

Evidence: Official page: https://openai.com
Evidence: Official page: https://openrouter.ai/models/openai/o3-deep-research-2025-06-26
Minor incidents100

No input data provided.

Why: missing_minor_incidents — Evidence indicates missing/failed; penalty applied per policy.

Evidence: Official page: https://openai.com
Evidence: Official page: https://openrouter.ai/models/openai/o3-deep-research-2025-06-26
Major incidents100

No input data provided.

Why: missing_major_incidents — Evidence indicates missing/failed; penalty applied per policy.

Evidence: Official page: https://openai.com
Evidence: Official page: https://openrouter.ai/models/openai/o3-deep-research-2025-06-26
Critical incidents100

No input data provided.

Why: missing_critical_incidents — Evidence indicates missing/failed; penalty applied per policy.

Evidence: Official page: https://openai.com
Evidence: Official page: https://openrouter.ai/models/openai/o3-deep-research-2025-06-26
Model documentation67

No input data provided.

Why: No reason provided; score derived from available signals.

Evidence: Official page: https://openai.com
Evidence: Official page: https://openrouter.ai/models/openai/o3-deep-research-2025-06-26
Training data disclosure67

No input data provided.

Why: No reason provided; score derived from available signals.

Evidence: Official page: https://openai.com
Evidence: Official page: https://openrouter.ai/models/openai/o3-deep-research-2025-06-26
Paper / technical report—

No input data provided.

Why: evidence_paper_not_found — Evidence indicates missing/failed; penalty applied per policy. missing_evidence — Evidence indicates missing/failed; penalty applied per policy.

Evidence not provided.

Missing evidence link (spec violation).

External review & transparency—

No input data provided.

Why: evidence_audit_missing_source_link — Evidence indicates missing/failed; penalty applied per policy. missing_evidence — Evidence indicates missing/failed; penalty applied per policy.

Evidence not provided.

Missing evidence link (spec violation).

Missing or failed evidence inputs trigger fixed penalties per policy. No placeholder states are hidden.

References (deduped list)

official_page

  • https://openai.com
  • https://openrouter.ai/models/openai/o3-deep-research-2025-06-26

repo/dev

No references recorded.

paper

No references recorded.

audit

No references recorded.

other

No references recorded.