qwen3guard-gen-4b

Name: qwen3guard-gen-4b
Author: ditto--ai

SpecializedOpen Weights

ditto--ai

A 4B-parameter safety and content moderation model that classifies user prompts and assistant responses as Safe, Unsafe, or Controversial with fine-grained category labels and refusal detection. Supports 119 languages.

Running this yourself: consumer gpu should be enough.

Model updates refreshed3w agoJun 10, 2026news + changelog

View Updates Self-Host

Archived

This model is still tracked for research and discovery, but it is excluded from default public rankings until it returns to active status.

---

Quality Score

---

Arena ELO

Parameters

---

Context

Similar Models

Discussion (0)

Loading comments...