This guide summarizes the most effective techniques for prompting the Eleven Music model. It covers genre & creativity, instrument & vocal isolation, musical control, and structural timing & lyrics.
The model is designed to understand intent and generate complete, context-aware audio based on your goals. High-level prompts like “ad for a sneaker brand” or “peaceful meditation with voiceover” are often enough to guide the model toward tone, structure, and content that match your use case.
The model demonstrates strong adherence to genre conventions and emotional tone. Both musical descriptors of emotional tone and tone descriptors themselves will work. It responds effectively to both:
Prompt length and detail do not always correlate with better quality outputs. For more creative and unexpected results, try using simple, evocative keywords to let the model interpret and compose freely.
You can separate generated music into stems in the download menu for a given track. To create stems with greater control, use targeted prompts and structure:
To improve stem quality and control:
The model accurately follows BPM and often captures the intended musical key. To gain more control over timing and harmony, include tempo cues like “130 BPM” and key signatures like “in A minor” in your prompt.
To influence vocal delivery and tone, use expressive descriptors such as “raw,” “live,” “glitching,” “breathy,” or “aggressive.”
The model can effectively render multiple vocalists, use prompts like “two singers harmonizing in C” to direct vocal arrangement.
In general, more detailed prompts lead to greater control and expressiveness in the output.
You can specify the length of the song (e.g., “60 seconds”) or use auto mode to let the model determine the duration. If lyrics are not provided, the model will generate structured lyrics that match the chosen or auto-detected length.
By default, most music prompts will include lyrics. To generate music without vocals, add “instrumental only” to your prompt. You can also write your own lyrics for more creative control. The model uses your lyrics in combination with the prompt length to determine vocal structure and placement.
To manage when vocals begin or end, include clear timing cues like:
The model supports multilingual lyric generation. To change the language of a generated song in our UI, use follow-ups like “make it Japanese” or “translate to Spanish.”
The model allows you to move beyond song descriptors and into intent for maximum creativity.
For precise control over section structure, lyrics placement, and multi-vocalist arrangements, use composition plans instead of simple text prompts.