Stability AI
stable-audio-3-medium is a proprietary Stability AI specialized model.
---
Quality Score
---
Arena ELO
Undisclosed
Parameters
---
Context
Sign in to join the discussion
39.7K
Downloads
150
Likes
May 2026
Released
Launches
4
Open Source
1
Research
1
General
5
Recent launch, pricing, benchmark, and API signals linked to this model or its provider.
Confidence-based loss weighting is usually avoided in generative models because it accelerates errors when the model is confidently wrong, but this intuition breaks down in supervised diffusion training. We introduce the Eisbach log-barrier, a parameter-free weight derived from the entropy of the DiT output's spatial energy distribution: high entropy damps the gradient, while low entropy preserves it. Applied to LoRA fine-tuning of Stable Audio 3 Medium on MusicCaps, it unexpectedly yields stronger thematic development, clearer acoustic differentiation, and higher textural diversity than unweighted training, the opposite of mode collapse. This works because in supervised diffusion the gradient direction is locked to ground truth, so confidence only scales the step size, and because temporal entropy downweights flat samples while preserving high-contrast ones. The result is an online, self-referential data curriculum that emerges purely from the forward pass, with analyzed noise-level dynamics and testable predictions.