ditto--ai
A 4B-parameter safety and content moderation model that classifies user prompts and assistant responses as Safe, Unsafe, or Controversial with fine-grained category labels and refusal detection. Supports 119 languages.
Running this yourself: consumer gpu should be enough.
This model is still tracked for research and discovery, but it is excluded from default public rankings until it returns to active status.
---
Quality Score
---
Arena ELO
4B
Parameters
---
Context
Sign in to join the discussion
0
Downloads
0
Likes
Apr 2026
Released