DeepSeek-V3

Name: DeepSeek-V3
Rating: 76.9 (28000 reviews)
Author: DeepSeek

#71Large Language ModelsOpen Weights

DeepSeek

DeepSeek-V3 is a 685B parameter Mixture-of-Experts model that achieves GPT-4 level performance at significantly lower cost. Activates 37B parameters per token.

Running this yourself: likely needs a high-memory cloud gpu.

Model updates refreshed23m agoJul 5, 2026news + changelog

Website View Updates Start Free Trial

76.9

Quality Score

1334

Arena ELO

685B

Parameters

128K

Context

Benchmarks and Competitive Signal

Structured

Use this section to answer one simple question first: how much outside evidence do we have that this model performs well? Structured benchmark scores appear first, then official provider evidence, then live arena signal.

This model has normalized benchmark rows, so scores here are directly comparable across benchmark sources.

BigCodeBenchcode

Similar Models

Discussion (0)

Loading comments...

Official Benchmark Evidence

These are recent benchmark or leaderboard claims from official provider sources. They are useful for freshness and context, but they are not treated the same as normalized independent benchmark rows.

deepseek-v3.2-reasoner - SWE-Bench Verified

Benchmarksswe-benchJul 5, 2026

SWE-Bench Verified resolved rate 60.0

View source

DeepSeek-V3 - LiveCodeBench

BenchmarkslivecodebenchJul 5, 2026

LiveCodeBench pass@1 49.6 across 1055 tasks

View source

deepseek-v3 — LiveBench Scores

BenchmarkslivebenchJul 5, 2026

language: 0.5 | coding: 0.3 | instruction_following: 1.0 | Overall: 0.6

View source