
MiniMax Music V2 — Turn Your Lyrics into a Finished Song
From Written Lyrics to a Fully Produced Track in Minutes
MiniMax Music V2 is a lyrics-first AI song generator: you bring the words, it brings the band. Describe the style, mood, and scenario in a short prompt — synthwave ballad, acoustic folk duet, stadium rock anthem — then paste your lyrics, line by line. Structure tags like [Intro], [Verse], [Chorus], [Bridge], and [Outro] give you direct control over the song's architecture, so the hook lands exactly where you wrote it. The model composes the melody, arranges the instruments, and performs the vocals in one pass, returning a finished 44.1kHz track ready to download. Because the lyrics are yours, the song is genuinely yours — ideal for original releases, gifts, jingles, and storytelling content.
Most AI music tools treat lyrics as an afterthought — MiniMax Music V2 makes them the blueprint. The model reads your lyric sheet the way a producer would: section tags define the structure, line breaks define the phrasing, and your style prompt defines the sonic palette. That separation matters in practice. The prompt (10–300 characters) is where you set genre, tempo feel, mood, and instrumentation; the lyric sheet (10–3,000 characters) is where you write the actual words to be sung. Keeping them separate means you can iterate on the production without touching the lyrics, or rewrite a verse without re-describing the whole song.
The vocal engine is what sets V2 apart from earlier text-to-music systems. Vocals are performed with natural phrasing, breath placement, and dynamics that follow the emotional arc of your lyrics — a quiet first verse can build into a belted chorus if your structure suggests it. Output is rendered at 44.1kHz with 256kbps MP3 encoding by default, clean enough for streaming platforms, video soundtracks, and podcast intros.
A practical workflow: start with a tight prompt ('melancholic indie pop, female vocal, sparse piano then full band'), write one verse and one chorus with [Verse] and [Chorus] tags, and generate. Listen, refine the prompt adjectives, and expand the lyric sheet section by section. Because generation takes minutes rather than studio days, you can A/B different choruses or moods cheaply. The model is a partner endpoint on fal.ai and supports commercial use, so tracks you create can ship in client work, ads, games, and monetized content.
How to Generate a Song with MiniMax Music V2
Describe the Style
Write a short prompt covering genre, mood, and scenario — for example 'uplifting synth-pop with a driving beat, female vocal, festival energy'.
Paste Your Lyrics
Add your lyrics one line at a time and structure them with [Intro], [Verse], [Chorus], [Bridge], and [Outro] tags so the song builds the way you wrote it.
Generate & Download
Click Generate and the finished track appears in minutes. Preview it in the player, then download the MP3 — it also stays in your dashboard gallery.
MiniMax Music V2 Technical Specifications
| Provider | MiniMax |
| Platform | fal.ai (partner endpoint) |
| Style Prompt | 10–300 characters — style, mood, scenario |
| Lyrics | 10–3,000 characters, one line per lyric |
| Structure Tags | [Intro], [Verse], [Chorus], [Bridge], [Outro] |
| Vocals | Yes — performed from your lyrics |
| Audio Output | MP3, 44.1kHz, 256kbps (default) |
| Commercial Use | Supported |
| Processing | Asynchronous, typically 1–3 minutes |
Why Choose MiniMax Music V2
Your Lyrics, Sung for Real
The model performs the exact words you wrote with natural phrasing and dynamics — not approximate mumbling. Section tags keep verses, choruses, and bridges exactly where you placed them.
Studio-Quality Output
Tracks render at 44.1kHz with 256kbps MP3 encoding — clean enough for streaming, video soundtracks, and client deliverables without post-processing.
Built for Iteration
Style prompt and lyrics are separate inputs, so you can re-roll the production without touching the words, or rewrite one verse without re-describing the song.
MiniMax Music V2 vs Other AI Music Models
| Feature | MiniMax Music V2 | ElevenLabs Music | Suno v4 |
|---|---|---|---|
| Primary Input | Style prompt + your lyrics | Text prompt (optional plan) | Prompt or lyrics |
| Lyrics Control | Full — line-by-line with section tags | Optional via composition plan | Partial |
| Vocals | Yes | Yes (or forced instrumental) | Yes |
| Max Length | Full song | 10 minutes | ~4 minutes |
| Output Quality | 44.1kHz / 256kbps MP3 | Up to 44.1kHz / 192kbps MP3 | Streaming quality |
| Best For | Original songs from your lyrics | Soundtracks & instrumentals | Quick song sketches |
What Can You Create with MiniMax Music V2
Original Song Releases
Turn finished lyric sheets into release-ready demos and explore how different genres carry the same words before committing to a studio session.
Personalized Gifts
Write lyrics about a friend's story, wedding, or anniversary and deliver a real, singable song — the most memorable greeting card there is.
Content Soundtracks
Create custom theme songs for YouTube channels, podcasts, and TikTok series with lyrics that name-drop your brand or running jokes.
Jingles & Ads
Generate catchy commercial hooks where the product name sits exactly on the beat you want — iterate ten variants in an afternoon.
Songwriting Drafts
Hear your half-finished lyrics performed to test rhyme flow and chorus strength, then refine the writing based on what you hear.
Game & Story Music
Produce in-world songs for games, audiobooks, and animations — tavern ballads, faction anthems, or end-credit themes with story-specific lyrics.
Related AI Models
Frequently Asked Questions About MiniMax Music V2
How do I control the song structure?
Use structure tags directly inside your lyrics: [Intro], [Verse], [Chorus], [Bridge], and [Outro]. Each tag starts a section, and the lines that follow belong to it. The model arranges the music so each section sounds like what the tag promises — choruses lift, bridges pivot, outros resolve.
What should go in the style prompt versus the lyrics?
The style prompt (10–300 characters) describes how the song should sound: genre, mood, tempo feel, instrumentation, and vocal character. The lyrics field (10–3,000 characters) contains only the words to be sung, one line per lyric. Keeping production notes out of the lyrics gives cleaner vocal results.
How long does generation take?
Typically one to three minutes for a full song. The task runs asynchronously — you can watch progress on the page, and the finished track also lands in your dashboard gallery, so you don't need to keep the tab open.
Can I use the generated songs commercially?
Yes. MiniMax Music V2 supports commercial use, so tracks you generate can be used in monetized videos, ads, client projects, games, and releases. Since the lyrics are your own writing, the creative core of the song is yours from the start.
What audio format do I get?
Songs are delivered as MP3 at 44.1kHz with 256kbps encoding by default — high enough quality for streaming platforms and video soundtracks. Download the file from the player or from your dashboard gallery; gallery media is kept for 7 days, so save anything you want to keep.
How many credits does a song cost?
Each generation costs a flat number of credits regardless of song length — the exact amount is shown on the Generate button before you submit. If a generation fails, the held credits are automatically refunded to your balance.


