VerdictLens — Browse AI models

Models

Browse AI models

A cleaner model directory for discovery, shortlisting, and jumping to official vendor pages without losing the ranking context.

Discovery → shortlist → compare

Built to help teams narrow choices, not just browse inventory.

Each row keeps best-fit workflows, compatibility cues, trust signals, and compare entry points visible so the directory can lead naturally into shortlists and later routing flows.

Compare shortlist

Choose up to three models from the directory, then compare them side by side.

2/3

GPT-5.4 Pro

OpenAI

Claude 3.7 Sonnet

Anthropic

Compare selected

SearchProviderBest for

Name	Best for	Works with	Pricing	Trust signals	View details
GPT-5.4 Pro OpenAI Score 93 Top-tier generalist model with excellent reasoning depth, strong coding reliability, and mature agent tooling support.	CodingResearchAgent automation	Codex CLILangGraphMCP serversPlaywright	$12 / 1M tok $48 / 1M tok Balanced	GPT-5.4 Pro official ↗ Last verified: Apr 8, 2026	View details
Claude 3.7 Sonnet Anthropic Score 91 Highly trusted reasoning and coding model with exceptional writing quality and calm, consistent outputs.	CodingResearch	Claude CodeLangGraphPlaywrightMCP servers	$3 / 1M tok $15 / 1M tok Balanced	Claude 3.7 Sonnet official ↗ Last verified: Apr 8, 2026	View details
Gemini 2.5 Pro Google Score 90 Powerful multimodal model with strong long-context analysis, research workflows, and broad document understanding.	ResearchAgent automation	Notebook workflowsDocument analysisn8nVertex AI stacks	$3.5 / 1M tok $10.5 / 1M tok Balanced	Gemini 2.5 Pro official ↗ Last verified: Apr 8, 2026	View details
GPT-5 Mini OpenAI Score 87 Lean, affordable OpenAI model tuned for responsive assistants, classification, and operational agent tasks.	Agent automationResearch	n8nZapier AI Actions1Password CLIMCP servers	$1.1 / 1M tok $4.4 / 1M tok Fast	GPT-5 Mini official ↗ Last verified: Apr 8, 2026	View details
Gemini 2.5 Flash Google Score 86 Low-latency multimodal workhorse for assistant surfaces, routing layers, and lightweight agent execution.	Agent automationResearch	Routing layersMultimodal inboxesn8nVertex AI stacks	$0.7 / 1M tok $2.8 / 1M tok Fast	Gemini 2.5 Flash official ↗ Last verified: Apr 8, 2026	View details
DeepSeek R1 DeepSeek Score 85 High-value reasoning model that punches above its price tier for technical problem solving and analytical depth.	CodingResearch	Cost-sensitive reasoning stacksbatch analysisfallback reasoning lanes	$0.55 / 1M tok $2.2 / 1M tok Deliberate	DeepSeek R1 official ↗ Last verified: Apr 8, 2026	View details
Claude 3.5 Haiku Anthropic Score 84 Fast and lightweight model ideal for summarization, routing, support tasks, and budget-sensitive assistants.	ResearchAgent automation	LangGraphSerpApiSlack assistants	$0.8 / 1M tok $4 / 1M tok Fast	Claude 3.5 Haiku official ↗ Last verified: Apr 8, 2026	View details
Qwen 3 Max Alibaba Cloud Score 84 Ambitious frontier contender with strong multilingual performance and good enterprise utility across Asian markets.	ResearchAgent automation	multilingual support workflowsenterprise copilotsAsian-market products	$2.4 / 1M tok $9 / 1M tok Balanced	Qwen 3 Max official ↗ Last verified: Apr 8, 2026	View details
Grok 3 Beta xAI Score 83 Strong real-time orientation and increasingly capable reasoning model with distinctive web freshness advantages.	Research	Fresh web scansMarket monitoringsocial-context workflows	$5 / 1M tok $15 / 1M tok Balanced	Grok 3 Beta official ↗ Last verified: Apr 8, 2026	View details
Sonar Reasoning Pro Perplexity Score 83 Research-first model experience optimized for answer grounding, current web synthesis, and citation-friendly output.	Research	citation-heavy briefsmarket scansanswer-grounding workflows	$2 / 1M tok $8 / 1M tok Balanced	Sonar Reasoning Pro official ↗ Last verified: Apr 8, 2026	View details
Mistral Large 2 Mistral Score 82 European flagship model with solid reasoning, concise style, and attractive deployment flexibility.	ResearchCoding	EU deployment needsinternal copilotsAPI-first stacks	$2 / 1M tok $6 / 1M tok Balanced	Mistral Large 2 official ↗ Last verified: Apr 8, 2026	View details
Command A Cohere Score 81 Enterprise-oriented model with strong retrieval posture, business language handling, and dependable workflow integration.	ResearchAgent automation	RAG stacksenterprise searchbusiness writing workflows	$2 / 1M tok $8 / 1M tok Balanced	Command A official ↗ Last verified: Apr 8, 2026	View details
Llama 4 Maverick Meta Score 80 Flexible open-weight option with broad community experimentation and strong customization potential.	Agent automationResearch	self-hosted inferencevector retrievalcustom fine-tuning	Self-host / variable Self-host / variable Balanced	Llama 4 Maverick official ↗ Last verified: Apr 8, 2026	View details

OpenAI

GPT-5.4 Pro

Score

Top-tier generalist model with excellent reasoning depth, strong coding reliability, and mature agent tooling support.

Best for

Coding · Research · Agent automation

Works with

Codex CLI · LangGraph · MCP servers

Pricing

$12 / 1M tok · Balanced

Last verified

Apr 8, 2026

Official link ↗View details

Anthropic

Claude 3.7 Sonnet

Score

Highly trusted reasoning and coding model with exceptional writing quality and calm, consistent outputs.

Best for

Coding · Research

Works with

Claude Code · LangGraph · Playwright

Pricing

$3 / 1M tok · Balanced

Last verified

Apr 8, 2026

Official link ↗View details

Google

Gemini 2.5 Pro

Score

Powerful multimodal model with strong long-context analysis, research workflows, and broad document understanding.

Best for

Research · Agent automation

Works with

Notebook workflows · Document analysis · n8n

Pricing

$3.5 / 1M tok · Balanced

Last verified

Apr 8, 2026

Official link ↗View details

OpenAI

GPT-5 Mini

Score

Lean, affordable OpenAI model tuned for responsive assistants, classification, and operational agent tasks.

Best for

Agent automation · Research

Works with

n8n · Zapier AI Actions · 1Password CLI

Pricing

$1.1 / 1M tok · Fast

Last verified

Apr 8, 2026

Official link ↗View details

Google

Gemini 2.5 Flash

Score

Low-latency multimodal workhorse for assistant surfaces, routing layers, and lightweight agent execution.

Best for

Agent automation · Research

Works with

Routing layers · Multimodal inboxes · n8n

Pricing

$0.7 / 1M tok · Fast

Last verified

Apr 8, 2026

Official link ↗View details

DeepSeek

DeepSeek R1

Score

High-value reasoning model that punches above its price tier for technical problem solving and analytical depth.

Best for

Coding · Research

Works with

Cost-sensitive reasoning stacks · batch analysis · fallback reasoning lanes

Pricing

$0.55 / 1M tok · Deliberate

Last verified

Apr 8, 2026

Official link ↗View details

Anthropic

Claude 3.5 Haiku

Score

Fast and lightweight model ideal for summarization, routing, support tasks, and budget-sensitive assistants.

Best for

Research · Agent automation

Works with

LangGraph · SerpApi · Slack assistants

Pricing

$0.8 / 1M tok · Fast

Last verified

Apr 8, 2026

Official link ↗View details

Alibaba Cloud

Qwen 3 Max

Score

Ambitious frontier contender with strong multilingual performance and good enterprise utility across Asian markets.

Best for

Research · Agent automation

Works with

multilingual support workflows · enterprise copilots · Asian-market products

Pricing

$2.4 / 1M tok · Balanced

Last verified

Apr 8, 2026

Official link ↗View details

xAI

Grok 3 Beta

Score

Strong real-time orientation and increasingly capable reasoning model with distinctive web freshness advantages.

Best for

Research

Works with

Fresh web scans · Market monitoring · social-context workflows

Pricing

$5 / 1M tok · Balanced

Last verified

Apr 8, 2026

Official link ↗View details

Perplexity

Sonar Reasoning Pro

Score

Research-first model experience optimized for answer grounding, current web synthesis, and citation-friendly output.

Best for

Research

Works with

citation-heavy briefs · market scans · answer-grounding workflows

Pricing

$2 / 1M tok · Balanced

Last verified

Apr 8, 2026

Official link ↗View details

Mistral

Mistral Large 2

Score

European flagship model with solid reasoning, concise style, and attractive deployment flexibility.

Best for

Research · Coding

Works with

EU deployment needs · internal copilots · API-first stacks

Pricing

$2 / 1M tok · Balanced

Last verified

Apr 8, 2026

Official link ↗View details

Cohere

Command A

Score

Enterprise-oriented model with strong retrieval posture, business language handling, and dependable workflow integration.

Best for

Research · Agent automation

Works with

RAG stacks · enterprise search · business writing workflows

Pricing

$2 / 1M tok · Balanced

Last verified

Apr 8, 2026

Official link ↗View details

Llama 4 Maverick

Score

Flexible open-weight option with broad community experimentation and strong customization potential.

Best for

Agent automation · Research

Works with

self-hosted inference · vector retrieval · custom fine-tuning

Pricing

Self-host / variable · Balanced

Last verified

Apr 8, 2026

Official link ↗View details