Track, rank, and compare every AI model in the world.

Platform

Models
Deploy
Leaderboards
Compare
News
Marketplace
Workspace
Deployments
Discover Watchlists
Pricing

Company

About
Roadmap
Contact
FAQ
Providers
API
Terms
Privacy

Qwen3 235B A22B by Qwen | AI Market Cap

Back to Models

Qwen3 235B A22B

Name: Qwen3 235B A22B
Price: 0.455 USD
Availability: InStock
Rating: 50.2 (1 reviews)
Author: Qwen

#34Large Language ModelsOpen Weights

Qwen

Qwen3-235B-A22B is a 235B parameter mixture-of-experts (MoE) model developed by Qwen, activating 22B parameters per forward pass. It supports seamless switching between a "thinking" mode for complex reasoning, math, and...

Running this yourself: likely needs a high-memory cloud gpu.

Model updates refreshed4h agoJul 4, 2026news + changelog

View Updates Start Free Trial

50.2

Quality Score

1367

Arena ELO

235B

Parameters

131K

Context

Similar Models

Discussion (0)

Loading comments...

Downloads

Likes

Apr 2025

Released

Benchmarks

high

Open Source

medium

Research

low

What Changed Recently

Recent launch, pricing, benchmark, and API signals linked to this model or its provider.

BenchmarksToday

Qwen3-235B-A22B - Arena-Hard-Auto

Arena-Hard-Auto official Gemini-2.5 judged score 58.4 with CI -1.9/2.1

View source

BenchmarksToday

Qwen3-235B-A22B - LiveCodeBench

LiveCodeBench pass@1 80.4 across 1055 tasks

View source

Research Papers1

HF PapersQwenresearch1mo ago

Other

ollama-libraryQwenopen_sourceopen sourceToday

Qwen3 235B A22B is now available on Ollama

77.0

Benchmarks3mo ago

https://huggingface.co/Qwen/Qwen3-Coder-480B-A35B-Instruct - SWE-Bench Verified

SWE-Bench Verified resolved rate 69.6

View source

Benchmarks3mo ago

https://huggingface.co/Qwen/Qwen3-Coder-480B-A35B-Instruct - SWE-Bench Verified

SWE-Bench Verified resolved rate 69.6

View source

Benchmarks10mo ago

Qwen3 - GAIA

GAIA score 44.2 from WA0824

View source

ACC: Compiling Agent Trajectories for Long-Context Training

Recent development of agents has renewed demand for long-context reasoning capacity of LLMs. However, training LLMs for this capacity requires costly long-document curation or heuristic context synthesis. We observe that agents produce massive trajectories when solving problems, invoking tools and receiving environment observations across many turns. The evidence needed to answer the original question is thus scattered throughout these turns, requiring integration of distant context segments. Nevertheless, standard agent SFT masks tool responses and only trains turn-level tool selection, creating a supervision blind spot where these scattered signals go unused. We propose Agent Context Compilation (ACC), which converts trajectories from search, software engineering, and database querying agents into long-context QA pairs that combine the original question with tool responses and environment observations gathered across multiple turns, training the model to answer directly without tool use. This makes the dependencies between the question and the evidence explicit, enabling direct supervision of long-context reasoning over distant segments without additional annotation. ACC is a simple but effective approach that can be combined with any existing long-context extension or training method, providing scalable supervised fine-tuning data. We validate ACC on long-range dependency modeling tasks through MRCR and GraphWalks, challenging benchmarks requiring cross-turn coreference resolution and graph traversal over extended contexts. Training Qwen3-30B-A3B with ACC achieves 68.3 on MRCR (+18.1) and 77.5 on GraphWalks (+7.6), results comparable to Qwen3-235B-A22B, while preserving general capabilities on GPQA, MMLU-Pro, AIME, and IFEval. Further mechanism analysis reveals that the ACC-trained model exhibits task-adaptive attention restructuring and expert specialization.

View Source

#huggingface#daily-papers

Qwen3 235B A22B is now available through local Ollama runtime. 40K context window listed. Qwen3 is the latest generation of large language models in Qwen series, offering a comprehensive suite of dense and mixture-of-experts (MoE) models.

View Source

#deployability#ollama#qwen

arena-hard-autoToday