Discover which AI models are strongest at specific tasks. Rankings combine benchmark performance with pricing, access, and market context.
Start with the skill cards below if you want the short answer.
Code generation, debugging, and software engineering
Complex reasoning, problem solving, and critical thinking
Mathematical problem solving and computation
Web browsing, form filling, and browser-based tasks
Text generation, translation, and communication
Autonomous task execution and tool orchestration
Factual knowledge retrieval and question answering
Code generation, debugging, and software engineering· ranked by average benchmark score
Top Performer
Avg score 90.2 across 1 benchmark.
Best Budget Pick
Free access path · Free
Best Access Path
Subscribe for GPT-4o
Value 72 · Trust 100
Open Weights Leader
Avg score 89.0 with public weights.
| # | Model | Avg Score | Benchmarks | Est. Value | Cheapest Verified | Try It |
|---|---|---|---|---|---|---|
1 | GPT-4oOpenAI | 90.2 | 1 | $244M | $5.00/M Cheapest verified | Subscribe |
2 | GPT-4oOpenAI | 90.2 | 1 | $230M | $2.50/M Cheapest verified | Subscribe |
3 | GPT-4oOpenAI | 90.2 | 1 | $226M | $2.50/M Cheapest verified | Subscribe |
4 | GPT-4oOpenAI | 90.2 | 1 | $275M | $2.50/M Cheapest verified | Subscribe |
5 | Nemotron-Cascade-2-30B-A3BNVIDIA | 89.3 | 1 | $23M | --- | Start Free Trial |
6 | Llama 3.1 405B InstructMeta | 89.0 | 1 | $25M | Free Free | Start Free Trial |
7 | meta-llama-3.1-405b-instructMeta | 89.0 | 1 | $32M | Free Free | Start Free Trial |
8 | llama-3.1-405b-instructMeta | 89.0 | 1 | $36M | Free Free | Start Free Trial |
9 | Llama 3.3 70BMeta | 88.4 | 1 | $36M | Free Free | Start Free Trial |
10 | Mistral Large 2Mistral AI | 85.5 | 1 | $189M | --- | View |
Complex reasoning, problem solving, and critical thinking· ranked by average benchmark score
Top Performer
Avg score 96.5 across 1 benchmark.
Best Budget Pick
Free access path · Free
Best Access Path
Subscribe for GPT-5.5
Value 72 · Trust 100
Open Weights Leader
Avg score 93.5 with public weights.
| # | Model | Avg Score | Benchmarks | Est. Value | Cheapest Verified | Try It |
|---|---|---|---|---|---|---|
1 | Grok-3xAI | 96.5 | 1 | $159M | --- | Subscribe |
2 | GPT-5.5OpenAI | 93.6 | 1 | $368M | $5.00/M Cheapest verified | Subscribe |
3 | GPT-5.5OpenAI | 93.6 | 1 | $312M | $5.00/M Cheapest verified | Subscribe |
4 | Llama 3.3 70BMeta | 93.5 | 1 | $36M | Free Free | Start Free Trial |
5 | Llama 4 MaverickMeta | 93.5 | 1 | $36M | Free Free | Start Free Trial |
6 | Mistral Large 2Mistral AI | 93.0 | 1 | $189M | --- | View |
7 | Gemini 2.0 FlashGoogle | 93.0 | 1 | $28M | --- | Subscribe |
8 | GPT-5.4OpenAI | 92.8 | 1 | $153M | $30.00/M Cheapest verified | Subscribe |
9 | GPT-5.4OpenAI | 92.8 | 1 | $119M | --- | Subscribe |
10 | GPT-5.4OpenAI | 92.8 | 1 | $191M | $2.50/M Cheapest verified | Subscribe |
Mathematical problem solving and computation· ranked by average benchmark score
Top Performer
Avg score 97.5 across 1 benchmark.
Best Budget Pick
Free access path · Free
Best Access Path
Subscribe for Claude Opus 4.6
Value 54 · Trust 100
Open Weights Leader
Avg score 97.5 with public weights.
| # | Model | Avg Score | Benchmarks | Est. Value | Cheapest Verified | Try It |
|---|---|---|---|---|---|---|
1 | R1 0528DeepSeek | 97.5 | 1 | $50M | $0.5000/M Cheapest verified | View |
2 | DeepSeek-R1DeepSeek | 97.3 | 2 | $61M | Free Free | Start Free Trial |
3 | deepseek-r1DeepSeek | 97.3 | 1 | --- | Free Free | Start Free Trial |
4 | R1DeepSeek | 97.3 | 1 | $52M | $0.7000/M Cheapest verified | Start Free Trial |
5 | o4-miniOpenAI | 94.0 | 1 | $158M | $1.10/M Cheapest verified | Get API Access |
6 | o4-miniOpenAI | 94.0 | 1 | $122M | --- | Get API Access |
7 | DeepSeek V3DeepSeek | 90.2 | 1 | $51M | $0.3200/M Cheapest verified | Start Free Trial |
8 | deepseek-v3DeepSeek | 90.2 | 1 | --- | Free Free | Start Free Trial |
9 | DeepSeek-V3DeepSeek | 85.1 | 2 | $63M | Free Free | Start Free Trial |
10 | Claude Opus 4.6Anthropic | 85.0 | 1 | $292M | $5.00/M Cheapest verified | Subscribe |
Web browsing, form filling, and browser-based tasks· ranked by average benchmark score
Top Performer
Avg score 74.3 across 1 benchmark.
Best Budget Pick
Free access path · Free
Best Access Path
Start Free Trial for DeepSeek-V3.2
Value 67 · Trust 60
Open Weights Leader
Avg score 74.3 with public weights.
| # | Model | Avg Score | Benchmarks | Est. Value | Cheapest Verified | Try It |
|---|---|---|---|---|---|---|
1 | DeepSeek-V3.2DeepSeek | 74.3 | 1 | $43M | Free Free | Start Free Trial |
2 | DeepSeek V3.2DeepSeek | 74.3 | 1 | $42M | $0.2520/M Cheapest verified | Start Free Trial |
Text generation, translation, and communication· ranked by average benchmark score
Top Performer
Avg score 91.5 across 1 benchmark.
Best Budget Pick
Free access path · Free
Best Access Path
Subscribe for GPT-5.2
Value 72 · Trust 100
Open Weights Leader
Avg score 91.0 with public weights.
| # | Model | Avg Score | Benchmarks | Est. Value | Cheapest Verified | Try It |
|---|---|---|---|---|---|---|
1 | Claude Opus 4.6Anthropic | 91.5 | 1 | $292M | $5.00/M Cheapest verified | Subscribe |
2 | R1 0528DeepSeek | 91.0 | 1 | $50M | $0.5000/M Cheapest verified | View |
3 | Claude 4 OpusAnthropic | 91.0 | 1 | $365M | --- | Subscribe |
4 | R1DeepSeek | 90.8 | 1 | $52M | $0.7000/M Cheapest verified | Start Free Trial |
5 | deepseek-r1DeepSeek | 90.8 | 1 | --- | Free Free | Start Free Trial |
6 | DeepSeek-R1DeepSeek | 90.8 | 1 | $61M | Free Free | Start Free Trial |
7 | Claude 4 SonnetAnthropic | 90.4 | 1 | $228M | --- | Subscribe |
8 | GPT-5.2OpenAI | 89.6 | 1 | $126M | --- | Subscribe |
9 | GPT-5.2OpenAI | 89.6 | 1 | $156M | $1.75/M Cheapest verified | Subscribe |
10 | GPT-5.2OpenAI | 89.6 | 1 | --- | --- | Subscribe |
Autonomous task execution and tool orchestration· ranked by average benchmark score
Top Performer
Avg score 95.6 across 1 benchmark.
Best Budget Pick
Free access path · Free
Best Access Path
Subscribe for GPT-5.1
Value 72 · Trust 100
Open Weights Leader
Avg score 87.1 with public weights.
| # | Model | Avg Score | Benchmarks | Est. Value | Cheapest Verified | Try It |
|---|---|---|---|---|---|---|
1 | GPT-5.1OpenAI | 95.6 | 1 | --- | --- | Subscribe |
2 | GPT-5.1OpenAI | 95.6 | 1 | $124M | --- | Subscribe |
3 | GPT-5.1OpenAI | 95.6 | 1 | $153M | $1.25/M Cheapest verified | Subscribe |
4 | Qwen-Image-Edit-2511-Multiple-Angles-LoRAfal | 87.1 | 1 | --- | Free Free | View |
5 | GPT-5.5OpenAI | 86.5 | 3 | $312M | $5.00/M Cheapest verified | Subscribe |
6 | GPT-5.5OpenAI | 86.5 | 3 | $368M | $5.00/M Cheapest verified | Subscribe |
7 | GPT-5OpenAI | 84.3 | 2 | $147M | $1.25/M Cheapest verified | Subscribe |
8 | GPT-5.2OpenAI | 81.8 | 2 | $156M | $21.00/M Cheapest verified | Subscribe |
9 | GPT-5.2OpenAI | 81.8 | 2 | --- | --- | Subscribe |
10 | GPT-5.2OpenAI | 81.8 | 2 | $155M | $1.75/M Cheapest verified | Subscribe |
Factual knowledge retrieval and question answering· ranked by average benchmark score
Top Performer
Avg score 91.5 across 1 benchmark.
Best Budget Pick
Free access path · Free
Best Access Path
Subscribe for GPT-5.2
Value 72 · Trust 100
Open Weights Leader
Avg score 91.0 with public weights.
| # | Model | Avg Score | Benchmarks | Est. Value | Cheapest Verified | Try It |
|---|---|---|---|---|---|---|
1 | Claude Opus 4.6Anthropic | 91.5 | 1 | $292M | $5.00/M Cheapest verified | Subscribe |
2 | R1 0528DeepSeek | 91.0 | 1 | $50M | $0.5000/M Cheapest verified | View |
3 | Claude 4 OpusAnthropic | 91.0 | 1 | $365M | --- | Subscribe |
4 | R1DeepSeek | 90.8 | 1 | $52M | $0.7000/M Cheapest verified | Start Free Trial |
5 | deepseek-r1DeepSeek | 90.8 | 1 | --- | Free Free | Start Free Trial |
6 | DeepSeek-R1DeepSeek | 90.8 | 1 | $61M | Free Free | Start Free Trial |
7 | Claude 4 SonnetAnthropic | 90.4 | 1 | $228M | --- | Subscribe |
8 | GPT-5.2OpenAI | 89.6 | 1 | $126M | --- | Subscribe |
9 | GPT-5.2OpenAI | 89.6 | 1 | $156M | $1.75/M Cheapest verified | Subscribe |
10 | GPT-5.2OpenAI | 89.6 | 1 | --- | --- | Subscribe |
Browse the marketplace for fine-tuned models, API access, and specialized solutions tailored to your use case.