A music video screenplay is not something most filmmakers talk about — but every music video that came together cleanly on set had some form of written document behind it. Whether you are directing your own band’s first video or working with an independent artist, writing a music video screenplay before you shoot will save you time, money, and a great deal of on-set confusion about what the video was supposed to be.
This guide covers the two main pre-production documents for music videos, how a music video script differs from a conventional film screenplay, and how to structure yours so that it actually works on the day.
Does a Music Video Need a Screenplay?
Not technically. Many music videos get made from a loose paragraph of notes, a mood board, and an afternoon of instinct. Some of those videos are excellent.
But “we’ll figure it out on the day” has a cost: wasted takes, unclear direction for cast, and a finished video that does not match the emotional arc of the song. A written document — even a short one — forces you to make decisions before they become expensive.
The Two Main Documents
Before you write anything, decide which document you actually need.
The treatment is a 2–3 page document written in flowing prose that pitches the visual concept, tone, and narrative of the video. It is written for the artist, the label, or the producer — people who need to approve the concept before money is spent. It reads like a short story blended with a visual manifesto. A treatment does not name specific shots. It creates atmosphere and communicates vision.
The video screenplay (or shooting script) is the working document used on production day. It breaks the video into actionable blocks — by verse, chorus, and bridge — with location, action, wardrobe notes, and lyric sync markers. This is what a music video screenplay actually is, and what the rest of this guide covers.
What Makes a Music Video Script Different
A music video screenplay shares some conventions with a film screenplay — scene headings, action descriptions, present tense — but its structure is fundamentally different.
It syncs to music, not story. Instead of three acts, you have verses, choruses, a bridge, and an outro. Your story, if there is one, has to fit inside those musical containers.
The “dialogue” is the lyrics. You notate lyric cues to mark where specific moments need to land on screen. A climactic image that hits two beats after the drop is only useful if everyone on set knows when that is.
Scenes are measured in time, not pages. A verse might run 45 seconds. Your script block for that verse covers exactly those 45 seconds of screen content.
Story is optional. Plenty of great music videos are pure visual concept — no narrative at all. Your screenplay needs to account for whatever approach you are taking.
The Structure of a Music Video Script
Here is the block structure that works for most music videos.
Header block:
- Artist name and song title
- Director name
- BPM and total runtime
- Key tempo landmarks (chorus hits at 0:42, drop at 1:30, etc.)
- Overall concept in one sentence
Content blocks — one per section of the song:
VERSE 1 (0:00 - 0:45)
Location: EXT. WAREHOUSE DISTRICT - DUSK
Action: Diana walks slowly through scattered streetlights.
She pauses at a payphone, does not pick it up.
Wardrobe: White coat, bare feet.
Lyric cue: At 0:28 - "I left it all behind" - CLOSE on her face.
Repeat this structure for CHORUS 1, VERSE 2, BRIDGE, CHORUS 2, and OUTRO.
How to Notate Lyric Cues
Lyric cue notation is one of the most useful things you can do in a music video script — and almost nobody does it explicitly. The format is simple:
At [timestamp] ([section]) — "[lyric line]" — [visual instruction]
Examples:
At 0:47 (CHORUS) — "I was never yours" — Diana steps forward into lightAt 1:32 (DROP) — instrumental — Cut to wide: entire band, silhouetteAt 2:14 (BRIDGE) — "burning" — Slow push into her eyes
These markers give your editor a map and give your camera operator an objective on the day. Everyone knows what they are capturing and why.
Three Types of Music Video
Most music videos combine elements from all three approaches. Knowing which is primary helps you write the right kind of script.
Concept-driven: Pure visual abstraction — no story, just a sustained aesthetic or idea. Your script is essentially a mood guide with location and wardrobe notes.
Narrative-driven: A mini-story that runs parallel to the song. This type benefits most from a screenplay structure, because the story has to be compressed into 3–4 minutes with no room for setup.
Performance-driven: The artist performs the song, often intercut with narrative or concept footage. Most mainstream music videos are predominantly performance-based. Your script notes performance locations, lighting cues, and any choreography, then documents what the intercutting story looks like.
Practical Tips for Writing Your First Music Video Script
Listen to the song at least ten times before writing anything. Every great music video decision starts with a deep understanding of the music. Know the BPM, know where the energy peaks and drops, know the emotional arc of the lyrics.
Map the emotional arc of the song first. If verse one is quiet and resigned and the chorus is explosive, your visuals need to reflect that arc — even if the story is abstract.
Plan your locations before writing scenes. Unlike a short film screenplay, location access can define your concept in a music video. If you have two hours in one location, write for one location.
Write the chorus moment last. The chorus is your visual climax. Write everything else first, then design the chorus beat to land hardest. If the rest of your script is doing its job, the chorus will feel earned.
Does It Use Standard Screenplay Format?
A hybrid. Use standard scene headings (INT./EXT. LOCATION — TIME) for location clarity, but drop the rigid dialogue margins since there is no spoken dialogue. A music video script is an internal working document, not a submission to a development executive, so strict spec formatting rules matter less.
That said, clear structure and consistent formatting make the document easier to use on set. Once you have your script, building a shot list is a natural next step — especially for a narrative or performance video where specific shots need to be planned in advance.
For guidance on standard screenplay formatting rules, read screenplay format: the complete guide.
Ready to write your music video script?
Screenplay Writer is a Google Docs add-on that automatically formats scene headings, action lines, and all standard screenplay elements as you type — useful for structuring music video scripts and short film screenplays alike. 21-day free trial.
Try Screenplay Writer Free →