Name: MusicGen
Author: Meta

Overview

MusicGen by Meta AI is an open-source music generation model that creates music from text descriptions or melody inputs. Part of Meta's AudioCraft family, it uses a two-stage architecture combining EnCodec neural audio compression with a transformer-based language model. Trained on 20,000 hours of licensed music, it produces coherent stereo audio at 50kHz.

Key Strengths

Open source: Free to run locally, modify, and fine-tune
Melody conditioning: Unique dual-input — generate from text alone or guide with reference audio
Stereo generation: Native stereophonic music output
Multi-Band Diffusion: Enhanced audio quality decoder (fewer artifacts)
Controllable: Text descriptions for genre, mood, instruments, tempo
Compact: Efficient token representation — 4 tokens/second

Prompting Tips

Describe the Music, Not Lyrics

MusicGen generates from descriptions, not lyrics. Don't use [Verse]/[Chorus] tags:

Upbeat jazz with piano and saxophone, walking bass line,
brushed drums, warm and lively, 130 bpm, reminiscent of
a smoky New York club

Be Concise

Keep descriptions under 500 characters with key musical attributes:

Ambient electronic, soft pads, gentle arpeggios,
reverb-heavy, dreamy and floating, 72 bpm

Use Melody Conditioning

Upload a reference audio file to guide the harmonic and melodic structure while the text controls the style.

Parameters

Parameter	Description
duration	Audio length in seconds (up to 30s)
temperature	Randomness 0-1.5 (default 1.0)
top_k	Token sampling pool (default 250)
top_p	Nucleus sampling threshold
cfg_coeff	Classifier-free guidance (default 3.0)

Known Limitations

Maximum 30 seconds per generation (windowed extension available)
No lyrics/vocal generation — instrumental only
Less sophisticated than Suno or Udio for full songs
Requires GPU for reasonable generation speed
Training data limited to licensed music library

Best For

Background music for videos, games, and podcasts
Melody-guided music creation and style transfer
Research and experimentation with music generation
Local music generation without cloud costs
Audio prototyping and compositional sketching

Pricing

Free (open source, Meta AudioCraft)
Self-hosted: Only hardware costs
Available on Hugging Face and Replicate

MusicGen