Skip to main content

Models Deploy Leaderboards Marketplace

Track, rank, and compare every AI model in the world.

Platform

Models
Deploy
Leaderboards
Compare
News
Marketplace
Workspace
Deployments
Discover Watchlists
Pricing

Categories

LLMs
Image Gen
Vision
Multimodal
Embeddings
Speech
Video
Code
Browser Agents
Specialized

Company

About
Roadmap
Contact
FAQ
Providers
API
Terms
Privacy

© 2026 AI Market Cap. All rights reserved.

stable-audio-3-medium by Stability AI | AI Market Cap

stable-audio-3-medium

SpecializedProprietary

S

Stability AI

stable-audio-3-medium is a proprietary Stability AI specialized model.

Model updates refreshed11m agoJul 5, 2026news + changelog

Website View Updates

---

Quality Score

---

Arena ELO

Undisclosed

Parameters

---

Context

Similar Models

gemma-4-26B-A4B#258

MiniMax·Undisclosed

Discussion (0)

Sign in to join the discussion

Loading comments...

39.7K

Downloads

150

Likes

May 2026

Released

Launches

4

high

Open Source

1

medium

Research

1

low

General

5

low

What Changed Recently

Recent launch, pricing, benchmark, and API signals linked to this model or its provider.

LaunchesStability AIToday

Introducing Stability AI Solutions: Generative AI Solutions to Accelerate Enterprise Creative Production

Social & Blog Posts10

BlogStability AIannouncementgeneral

Research Papers1

HF PapersStability AIresearch1mo ago

70.9

Phi-4-reasoning-vision-15B#309

DeepSeek-OCR-2#316

DeepSeek·Unknown

LaunchesStability AIToday

Stability AI and Arm Collaborate to Release Stable Audio Open Small, Enabling Real-World Deployment for On-Device Audio Generation

LaunchesStability AIToday

Universal Music Group and Stability AI Announce Strategic Alliance to Co-Develop Professional AI Music Creation Tools

LaunchesStability AIToday

Introducing Brand Studio: The creative production platform powered by your brand

Open SourceStability AIToday

Meet Stable Audio 3.0, the model family built for artistic experimentation with open-weight models

Today

Stable Diffusion Now Optimized for AMD Radeon™ GPUs and Ryzen™ AI APUs

#stability ai#blog#update

BlogStability AIannouncementgeneralToday

Stable Diffusion 3.5 Models Optimized with TensorRT Deliver 2X Faster Performance and 40% Less Memory on NVIDIA RTX GPUs

#stability ai#blog#update

BlogStability AIannouncementgeneralToday

Stable Video 4D 2.0: New Upgrades for High-Fidelity Novel-Views and 4D Generation from a Single Video

#stability ai#blog#update

BlogStability AIlaunchlaunchToday

Introducing Stability AI Solutions: Generative AI Solutions to Accelerate Enterprise Creative Production

#stability ai#blog#launch

BlogStability AIlaunchlaunchToday

Stability AI and Arm Collaborate to Release Stable Audio Open Small, Enabling Real-World Deployment for On-Device Audio Generation

#stability ai#blog#launch

BlogStability AIannouncementgeneralToday

Stability AI and NVIDIA Bring Faster Performance and Simplified Enterprise Deployment with the Stable Diffusion 3.5 NIM

#stability ai#blog#update

BlogStability AIlaunchlaunchToday

Universal Music Group and Stability AI Announce Strategic Alliance to Co-Develop Professional AI Music Creation Tools

#stability ai#blog#launch

BlogStability AIannouncementgeneralToday

Stability AI Introduces Stable Audio 2.5, the First Audio Model Built for Enterprise Sound Production at Scale

#stability ai#blog#update

BlogStability AIlaunchlaunchToday

Introducing Brand Studio: The creative production platform powered by your brand

#stability ai#blog#launch

BlogStability AIopen_sourceopen sourceToday

Meet Stable Audio 3.0, the model family built for artistic experimentation with open-weight models

#stability ai#blog#open-source

Entropy as a Structural Prior: How a Log-Barrier on DiT Belief Space Drives Musical Diversity and Development

Confidence-based loss weighting is usually avoided in generative models because it accelerates errors when the model is confidently wrong, but this intuition breaks down in supervised diffusion training. We introduce the Eisbach log-barrier, a parameter-free weight derived from the entropy of the DiT output's spatial energy distribution: high entropy damps the gradient, while low entropy preserves it. Applied to LoRA fine-tuning of Stable Audio 3 Medium on MusicCaps, it unexpectedly yields stronger thematic development, clearer acoustic differentiation, and higher textural diversity than unweighted training, the opposite of mode collapse. This works because in supervised diffusion the gradient direction is locked to ground truth, so confidence only scales the step size, and because temporal entropy downweights flat samples while preserving high-contrast ones. The result is an online, self-referential data curriculum that emerges purely from the forward pass, with analyzed noise-level dynamics and testable predictions.

#huggingface#daily-papers