Qwen3 VL 32B Instruct

Name: Qwen3 VL 32B Instruct
Price: 0.104 USD
Availability: InStock
Author: Qwen

Vision & ImageOpen Weights

Qwen

Qwen3-VL-32B-Instruct is a large-scale multimodal vision-language model designed for high-precision understanding and reasoning across text, images, and video. With 32 billion parameters, it combines deep visual perception with advanced text...

Running this yourself: likely needs a rented cloud gpu.

Model updates refreshed4h agoJul 5, 2026news + changelog

View Updates Start Free Trial

---

Quality Score

---

Arena ELO

32B

Parameters

262K

Context

Benchmarks and Competitive Signal

Structured

Use this section to answer one simple question first: how much outside evidence do we have that this model performs well? Structured benchmark scores appear first, then official provider evidence, then live arena signal.

This model has normalized benchmark rows, so scores here are directly comparable across benchmark sources.

GAIAreasoning

44.2

Similar Models

Discussion (0)

Loading comments...

Official Benchmark Evidence

These are recent benchmark or leaderboard claims from official provider sources. They are useful for freshness and context, but they are not treated the same as normalized independent benchmark rows.