Compose songs from a prompt or a composition plan. Create instrumentals and full tracks with ElevenLabs Music (Compose).
Audio prompts work best when they define mood, pacing, structure, and finish. The more clearly you describe the role of the sound, the cleaner the result tends to be.
Best results start with genre, mood, structure, and arrangement.
ElevenLabs Music on Pixio composes songs from a text prompt or composition plan: instrumentals and full tracks with ElevenLabs quality. Use it when you need text-to-music for BGM, ads, or short-form content—and when you want ElevenLabs (same vendor as TTS) for music. For full songs with vocals (Suno-style), see Songcraft; for short BGM or SFX, see Music Compose Sound Effects.
ElevenLabs Music on Pixio composes songs from a text prompt or composition plan: instrumentals and full tracks with ElevenLabs quality. Use it when you need text-to-music for BGM, ads, or short-form content—and when you want ElevenLabs (same vendor as TTS) for music. For full songs with vocals (Suno-style), see Songcraft; for short BGM or SFX, see Music Compose Sound Effects.
| Mode | Input | Best for |
|---|---|---|
| Text to Music | Prompt or composition plan | Instrumentals and full tracks; genre, mood, structure |
| Option | Values | Notes |
|---|---|---|
| Duration | Depends on backend | Check Pixio for limits |
| Output format | MP3, etc. | Check model card in Pixio |
| Credits | Plan-based | Check model card in Pixio |
Credits and duration limits depend on plan; check the model card in Pixio.
[Genre] + [Mood] + [Structure/pacing] + [Instruments or finish]. Describe role, pacing, and finish—not only genre or mood. Example: "Upbeat corporate BGM, 60 seconds, piano and strings, optimistic, clean mix."
"Upbeat corporate BGM, 60 seconds. Piano and strings, optimistic, clean mix. No vocals."
"Cinematic trailer, dark and tense. 90 seconds. Orchestral, building to a climax. Epic, dramatic."
"Lo-fi hip hop, relaxed. 2 minutes. Chill beats, soft piano, vinyl crackle. Late night study vibe."
"Acoustic folk, warm and intimate. 1 minute. Guitar and light percussion. Campfire, storytelling mood."
| Scenario | Best choice |
|---|---|
| ElevenLabs music, compose from prompt or plan | ElevenLabs Music |
| Full songs with vocals (Suno-style) | Songcraft |
| Short BGM or SFX | Music Compose Sound Effects |
| Speech / TTS | ElevenLabs TTS |
Use production language, not just genre labels.
Tell the model how the energy should move over time.
For speech, define delivery style, tone, and pacing.
For music, define arrangement and emotional arc early.
A strong audio prompt describes role, pacing, tone, and finish so the output feels produced rather than generic.
Describe the genre, emotional arc, instrumentation, and structure instead of relying on broad tags alone.
Define how the piece should progress so the output feels intentional instead of flat or repetitive.
Use stronger prompts and cleaner references once the direction is already working.
ElevenLabs Music is strongest when the brief is clear about function: what the sound should do, how it should move, and what it should feel like.
Use structure language early so the output lands closer to production-ready on the first passes.
For voice work, specify delivery and character. For music, specify arrangement and emotional progression.
Decide whether the output is carrying narrative, mood, rhythm, or all three.
Describe the build, energy, and transitions so the result has movement instead of flattening out.
Once the direction is right, refine and separate instead of regenerating blindly.
Pair voice generation with cloning when continuity across campaigns or characters matters.
Use generated music or speech as the finishing layer once the visual cut is already working.