Models

Browse AI models

A cleaner model directory for discovery, shortlisting, and jumping to official vendor pages without losing the ranking context.

Discovery → shortlist → compare

Built to help teams narrow choices, not just browse inventory.

Each row keeps best-fit workflows, compatibility cues, trust signals, and compare entry points visible so the directory can lead naturally into shortlists and later routing flows.

Compare shortlist

Choose up to three models from the directory, then compare them side by side.

2/3
GPT-5.4 Pro
OpenAI
Claude 3.7 Sonnet
Anthropic
Compare selected
OpenAI

GPT-5.4 Pro

Score
93

Top-tier generalist model with excellent reasoning depth, strong coding reliability, and mature agent tooling support.

Best for
Coding · Research · Agent automation
Works with
Codex CLI · LangGraph · MCP servers
Pricing
$12 / 1M tok · Balanced
Last verified
Apr 8, 2026
Anthropic

Claude 3.7 Sonnet

Score
91

Highly trusted reasoning and coding model with exceptional writing quality and calm, consistent outputs.

Best for
Coding · Research
Works with
Claude Code · LangGraph · Playwright
Pricing
$3 / 1M tok · Balanced
Last verified
Apr 8, 2026
Google

Gemini 2.5 Pro

Score
90

Powerful multimodal model with strong long-context analysis, research workflows, and broad document understanding.

Best for
Research · Agent automation
Works with
Notebook workflows · Document analysis · n8n
Pricing
$3.5 / 1M tok · Balanced
Last verified
Apr 8, 2026
OpenAI

GPT-5 Mini

Score
87

Lean, affordable OpenAI model tuned for responsive assistants, classification, and operational agent tasks.

Best for
Agent automation · Research
Works with
n8n · Zapier AI Actions · 1Password CLI
Pricing
$1.1 / 1M tok · Fast
Last verified
Apr 8, 2026
Google

Gemini 2.5 Flash

Score
86

Low-latency multimodal workhorse for assistant surfaces, routing layers, and lightweight agent execution.

Best for
Agent automation · Research
Works with
Routing layers · Multimodal inboxes · n8n
Pricing
$0.7 / 1M tok · Fast
Last verified
Apr 8, 2026
DeepSeek

DeepSeek R1

Score
85

High-value reasoning model that punches above its price tier for technical problem solving and analytical depth.

Best for
Coding · Research
Works with
Cost-sensitive reasoning stacks · batch analysis · fallback reasoning lanes
Pricing
$0.55 / 1M tok · Deliberate
Last verified
Apr 8, 2026
Anthropic

Claude 3.5 Haiku

Score
84

Fast and lightweight model ideal for summarization, routing, support tasks, and budget-sensitive assistants.

Best for
Research · Agent automation
Works with
LangGraph · SerpApi · Slack assistants
Pricing
$0.8 / 1M tok · Fast
Last verified
Apr 8, 2026
Alibaba Cloud

Qwen 3 Max

Score
84

Ambitious frontier contender with strong multilingual performance and good enterprise utility across Asian markets.

Best for
Research · Agent automation
Works with
multilingual support workflows · enterprise copilots · Asian-market products
Pricing
$2.4 / 1M tok · Balanced
Last verified
Apr 8, 2026
xAI

Grok 3 Beta

Score
83

Strong real-time orientation and increasingly capable reasoning model with distinctive web freshness advantages.

Best for
Research
Works with
Fresh web scans · Market monitoring · social-context workflows
Pricing
$5 / 1M tok · Balanced
Last verified
Apr 8, 2026
Perplexity

Sonar Reasoning Pro

Score
83

Research-first model experience optimized for answer grounding, current web synthesis, and citation-friendly output.

Best for
Research
Works with
citation-heavy briefs · market scans · answer-grounding workflows
Pricing
$2 / 1M tok · Balanced
Last verified
Apr 8, 2026
Mistral

Mistral Large 2

Score
82

European flagship model with solid reasoning, concise style, and attractive deployment flexibility.

Best for
Research · Coding
Works with
EU deployment needs · internal copilots · API-first stacks
Pricing
$2 / 1M tok · Balanced
Last verified
Apr 8, 2026
Cohere

Command A

Score
81

Enterprise-oriented model with strong retrieval posture, business language handling, and dependable workflow integration.

Best for
Research · Agent automation
Works with
RAG stacks · enterprise search · business writing workflows
Pricing
$2 / 1M tok · Balanced
Last verified
Apr 8, 2026
Meta

Llama 4 Maverick

Score
80

Flexible open-weight option with broad community experimentation and strong customization potential.

Best for
Agent automation · Research
Works with
self-hosted inference · vector retrieval · custom fine-tuning
Pricing
Self-host / variable · Balanced
Last verified
Apr 8, 2026