AudioVisual-Caption
ASID-Captioner-3B is a open-weight AudioVisual-Caption specialized model.
Running this yourself: consumer gpu should be enough.
---
Quality Score
Arena ELO
3B
Parameters
Context
Sign in to join the discussion
3.1K
Downloads
33
Likes
Feb 2026
Released