Stable Audio 2.5 | Models | Pixio

Stable Audio

Stable Audio on Pixio (e.g. Stable Audio 2.5) lets you create or transform audio: text-to-audio, inpainting (edit parts of a clip), or audio-to-audio for sound design and music. Use it when you need prompt-driven music or sound design with the option to edit existing clips (inpaint) or transform them (audio-to-audio).

Use this when

You need text-to-audio for music or sound design (describe genre, mood, length).
You want to edit part of a clip (inpainting)—replace or fix a segment without re-generating the whole thing.
You need audio-to-audio (transform an existing clip with a prompt—style, mood, or content change).
You are building sound design, background music, or SFX with Stable Diffusion-style control.

Modes in Pixio

Mode	Input	Best for
Text to Audio	Prompt (genre, mood, duration)	New music or sound design from scratch
Inpainting	Existing clip + mask + prompt	Edit or replace a segment
Audio to Audio	Existing clip + prompt	Transform style, mood, or content

Options

Option	Values	Notes
Duration	Depends on backend (e.g. up to 90s or more)	Check Pixio for limits
Prompt	Genre, mood, instruments, structure	Be specific for best results
Credits	Plan-based	Check model card in Pixio

When to use Stable Audio vs other models

Scenario	Best choice
Text-to-audio + inpainting + audio-to-audio	Stable Audio
Music only (no edit)	Pixio Music, Lyria 2, MiniMax Music, Songcraft
Speech / TTS	ElevenLabs TTS, MiniMax Speech
Sound effects only	Music Compose Sound Effects

Tips

Clear prompt: genre, mood, instruments, and length (e.g. "dark ambient, 60s, pads and subtle percussion").
Inpainting when you need to fix or replace a section of an existing clip.
Audio-to-audio for style transfer or mood change on an existing track.

Stable Audio 2.5

How to get the best out of Stable Audio 2.5