Speech-to-text, TTS, and audio analysis
15
Models
3
Benchmarks
6
Providers
7
Open Weight
14
Deployable Now
7 self-host
15
Structured benchmark-backed rows
0
Signal-backed rows without full normalized tables
0
Benchmark-expected rows still catching up
| # | Model | Popularity | Quality | Params | SWE-Bench | GAIA | Aider Polyglot | $/M In | Speed | Open |
|---|---|---|---|---|---|---|---|---|---|---|
19 | TTS-1 HD OpenAI Structured |
| 50.0 |
| 13.7 |
| Undisclosed |
| 42.4 |
| — |
| — |
$30.00/M Cheapest verified |
| — |
5 | GPT-4o Mini Transcribe OpenAI SubscriptionStructured | 49.3 | 28.5 | Undisclosed | 42.4 | 60.6 | — | $0.1500/M Cheapest verified | — |
21 | Qwen3-TTS-12Hz-1.7B-CustomVoice Qwen Self-HostStructured | 47.9 | 52.4 | 2B | 69.6 | 44.2 | 8.0 | Free Free | — |
22 | Qwen3-ASR-1.7B Qwen Self-HostStructured | 46.8 | 52.5 | 2B | 69.6 | 44.2 | 8.0 | Free Free | — |
11 | qwen3-tts Qwen Self-HostStructured | 46.1 | 52.3 | Unknown | 69.6 | 44.2 | — | Free Free | — |
18 | Voxtral-Mini-4B-Realtime-2602 Mistral AI Open WeightsStructured | 46.0 | 48.6 | 4B | 42.4 | — | — | Free Free | — |
17 | GPT-4o Transcribe Diarize OpenAI SubscriptionStructured | 45.1 | 27.2 | Undisclosed | — | 60.6 | — | $2.50/M Cheapest verified | — |
6 | GPT-4o Mini TTS OpenAI SubscriptionStructured | 44.9 | 19.1 | Undisclosed | 42.4 | 60.6 | — | $0.1500/M Cheapest verified | — |
4 | Gemini 3.1 Flash TTS | 44.7 | 40.1 | Undisclosed | — | 63.1 | — | — | — |
20 | cohere-transcribe-03-2026 Cohere Open WeightsStructured | 43.2 | 49.0 | Unknown | 42.4 | — | — | Free Free | — |
23 | Voxtral-4B-TTS-2603 Mistral AI Open WeightsStructured | 42.7 | 43.4 | 4B | 42.4 | — | — | Free Free | — |
1 | GPT Audio OpenAI SubscriptionStructured | 42.3 | 18.6 | Undisclosed | 42.4 | — | — | $2.50/M Cheapest verified | — |
3 | GPT Audio Mini OpenAI SubscriptionStructured | 41.6 | 19.2 | Undisclosed | 42.4 | — | — | $0.6000/M Cheapest verified | — |
13 | GPT Realtime Mini OpenAI SubscriptionStructured | 41.6 | 19.2 | Undisclosed | 42.4 | — | — | $0.6000/M Cheapest verified | — |
24 | granite-4.0-1b-speech ibm-granite Open WeightsStructured | — | — | 1B | 42.4 | — | — | Free Free | — |