Name: Stable Audio 2.0
Author: Stability AI

Overview

Stable Audio 2.0 by Stability AI is a music and audio generation model that produces full tracks with coherent musical structure up to 3 minutes long at 44.1kHz stereo. It introduced audio-to-audio generation — upload and transform audio samples using natural language prompts. Trained exclusively on a licensed dataset from AudioSparx.

Key Strengths

Full tracks: Up to 3 minutes with coherent structure — intros, development, and outros
44.1kHz stereo: High-fidelity, production-ready audio quality
Audio-to-audio: Upload audio and transform it with text prompts (style transfer, remixing)
Sound effects: Generate SFX from text descriptions
Variations: Create variations of existing audio
Licensed training data: Trained on AudioSparx library with opt-out respect and fair compensation

Prompting Tips

Describe Musical Attributes

Focus on genre, instruments, mood, and production style:

Cinematic orchestral score, building tension, low strings and
brass, timpani rolls, gradually increasing intensity,
dramatic and epic, suitable for a movie trailer

For Sound Effects

Thunder rolling across a mountain valley with echoes,
heavy rain on a metal roof, occasional wind gusts

Audio-to-Audio Transformation

Upload source audio and describe the transformation:

Transform this acoustic guitar recording into a synthwave version
with pulsing bass, retro synth pads, and drum machine

Don't Use Lyrics or Structure Tags

Stable Audio generates from descriptions, not lyrics. No [Verse]/[Chorus] tags.

Parameters

Parameter	Description
duration	Up to 180 seconds (3 minutes)
steps	Diffusion steps 50-200 (default 100)
cfg_scale	Guidance scale 1-15 (default 7.0)
negative_prompt	What to avoid in the audio
seed	Reproducibility seed

Known Limitations

No vocal/lyrics generation — instrumental and SFX only
Less versatile than Suno for full songs with vocals
Requires more diffusion steps for high quality (slower)
Negative prompt effectiveness is limited
Maximum 3 minutes per generation

Best For

Background music for video and film production
Sound effect generation
Audio style transfer and remixing
Production-quality instrumental tracks
Licensed, IP-safe audio generation

Pricing

Available on stability.ai
Stable Audio 2.5 enterprise pricing available
API access for production workloads
Self-hosted options for enterprise

Stable Audio 2.0