Speech-to-text, TTS, and audio analysis
10
Models
3
Benchmarks
6
Providers
7
Open Weight
9
Deployable Now
7 self-host
10
Structured benchmark-backed rows
0
Signal-backed rows without full normalized tables
0
Benchmark-expected rows still catching up
| # | Model | Adoption | Quality | Params | GAIA | SWE-Bench | Aider Polyglot | $/M In | Speed | Open |
|---|---|---|---|---|---|---|---|---|---|---|
11 | qwen3-tts Qwen Open Weights |
| 64.7 |
| 52.3 |
| Unknown |
| 44.2 |
| 69.6 |
| — |
Free Free |
| — |
19 | TTS-1 HD OpenAI Structured | 63.0 | 13.7 | Undisclosed | — | 42.4 | — | $30.00/M Cheapest verified | — |
22 | Qwen3-ASR-1.7B Qwen Open WeightsStructured | 62.4 | 52.5 | 2B | 44.2 | 69.6 | 8.0 | Free Free | — |
21 | Qwen3-TTS-12Hz-1.7B-CustomVoice Qwen Open WeightsStructured | 62.4 | 52.4 | 2B | 44.2 | 69.6 | 8.0 | Free Free | — |
1 | GPT Audio OpenAI SubscriptionStructured | 61.8 | 18.6 | Undisclosed | — | 42.4 | — | $2.50/M Cheapest verified | — |
17 | GPT-4o Transcribe Diarize OpenAI SubscriptionStructured | 59.0 | 27.2 | Undisclosed | 60.6 | — | — | $2.50/M Cheapest verified | — |
20 | cohere-transcribe-03-2026 Cohere Open WeightsStructured | 56.4 | 49.0 | Unknown | — | 42.4 | — | Free Free | — |
23 | Voxtral-4B-TTS-2603 Mistral AI Open WeightsStructured | 55.7 | 43.4 | 4B | — | 42.4 | — | Free Free | — |
14 | qwenasr twangodev Open WeightsStructuredArchived | — | — | Unknown | — | 69.6 | — | Free Free | — |
24 | granite-4.0-1b-speech ibm-granite Open WeightsStructured | — | — | 1B | — | 42.4 | — | Free Free | — |