OpenAI

gpt-realtime-mini

A cost-efficient realtime model for voice and streaming interactions over WebRTC, WebSocket, or SIP.

Overall score

realtimevoicestreaming

Official site Official docs Pricing Start shortlist

Context window

32K tokens

Speed

Fast

Input pricing

$0.60 / 1M text input tokens

Output pricing

$2.40 / 1M text output tokens

Score breakdown

Capability79

Use-case fit88

Cost efficiency89

Speed98

Reliability84

Agent readiness87

Ecosystem90

Scores combine benchmark signals, product experience, and editorial weighting. Use them as a practical guide, not an absolute truth claim.

Best for

Agent automation

Works with

Voice agentsWebRTC appsTelephony

Modalities

textaudioimage

Sources & trust

Officially verified core fields

Official linkSummaryDescriptionModalitiesModality profileContext windowMax outputPricingPricing page

Editorial fields such as shortlist guidance, strengths, caveats, and scoring remain clearly separated from official provider data.

OpenAI official

Official site · Tier 5 · Apr 9, 2026

Official link

↗

gpt-realtime-mini model docs

Official docs · Tier 5 · Apr 9, 2026

SummaryDescriptionModalitiesModality profileContext windowMax output

↗

OpenAI API pricing

Pricing page · Tier 5 · Apr 9, 2026

PricingPricing page

↗

gpt-realtime-mini VerdictLens review

Manual review · Tier 3 · Apr 9, 2026

Best-fit guidanceWorks-with guidanceStrengthsCaveatsOverall scoreScore breakdown

↗

Last verified: Apr 9, 2026

Strengths

Low-latency audio and text interaction is the main strength.
Useful when UX speed matters more than deep long-form reasoning.

Things to watch

Smaller context and output limits than general frontier text models.
Not the right default for research-heavy or long-document tasks.

Best for

Agent automation & operations

Prioritize tool reliability, composability, secret handling, and robust state management across long-running flows.