Speech-to-text, TTS, and audio analysis
15
Models
4
Benchmarks
6
Providers
7
Open Weight
14
Deployable Now
7 self-host
15
Structured benchmark-backed rows
0
Signal-backed rows without full normalized tables
0
Benchmark-expected rows still catching up
| # | Model | Value | Quality | Params | GAIA | SWE-Bench | Aider Polyglot | LiveBench Language | $/M In | Speed | Open |
|---|---|---|---|---|---|---|---|---|---|---|---|
18 | Qwen3-TTS-12Hz-1.7B-CustomVoice Qwen Open Weights |
| 51.1 |
| 51.1 |
| 2B |
| 44.2 |
| 69.6 |
| 8.0 |
| — |
Free Free |
| — |
19 | Qwen3-ASR-1.7B Qwen Open WeightsStructured | 50.6 | 50.6 | 2B | 44.2 | 69.6 | 8.0 | — | Free Free | — |
22 | cohere-transcribe-03-2026 Cohere Open WeightsStructured | 47.8 | 47.8 | Unknown | — | 42.4 | — | — | Free Free | — |
21 | Voxtral-Mini-4B-Realtime-2602 Mistral AI Open WeightsStructured | 47.7 | 47.7 | 4B | — | 42.4 | — | — | Free Free | — |
5 | qwen3-tts Qwen Open WeightsStructured | 46.7 | 46.7 | Unknown | 44.2 | 69.6 | — | — | Free Free | — |
1 | GPT Audio Mini OpenAI SubscriptionStructured | 43.6 | 18.5 | Undisclosed | — | 42.4 | — | — | $0.6000/M Cheapest verified | — |
24 | Voxtral-4B-TTS-2603 Mistral AI Open WeightsStructured | 43.5 | 43.5 | 4B | — | 42.4 | — | — | Free Free | — |
4 | Gemini 3.1 Flash TTS | 39.8 | 39.8 | Undisclosed | 63.1 | — | — | — | — | — |
8 | GPT-4o Mini Transcribe OpenAI SubscriptionStructured | 30.5 | 30.5 | Undisclosed | 60.6 | 42.4 | — | 57.1 | $0.1500/M Cheapest verified | — |
2 | GPT Audio OpenAI SubscriptionStructured | 26.5 | 18.0 | Undisclosed | — | 42.4 | — | — | $2.50/M Cheapest verified | — |
10 | GPT-4o Transcribe Diarize OpenAI SubscriptionStructured | 19.8 | 19.8 | Undisclosed | 60.6 | — | — | — | $2.50/M Cheapest verified | — |
6 | GPT-4o Mini TTS OpenAI SubscriptionStructured | 18.7 | 18.7 | Undisclosed | 60.6 | 42.4 | — | — | $0.1500/M Cheapest verified | — |
15 | GPT Realtime Mini OpenAI SubscriptionStructured | 18.5 | 18.5 | Undisclosed | — | 42.4 | — | — | $0.6000/M Cheapest verified | — |
20 | TTS-1 HD OpenAI Structured | 13.5 | 13.5 | Undisclosed | — | 42.4 | — | — | $30.00/M Cheapest verified | — |
23 | granite-4.0-1b-speech ibm-granite Open WeightsStructured | — | — | 1B | — | 42.4 | — | — | Free Free | — |