Building a World from a Single Album Cover
Ray de Guzman
POV: You're building a world from your album cover.
I stared at the cover art for my latest song CLOCK and thought: what if this one image could become an entire universe?
Turns out it can. And the tools to do it are already here.
It Started with a Character Sheet
I've been deep in AI image generation lately, and one strategy I keep hearing about is building character sheets before you attempt anything complex like animation or video. The logic is simple: if the AI has a fully defined character to reference, it can keep that character consistent across multiple outputs. No more "same person but their face changed" problem.
So I took the album art for CLOCK and fed it to GPT Images 2.0. But I didn't just hand it the image. I also gave it:
- The full lyrics of the song
- The story behind the song -- what it's really about underneath the surface
- The encoded meaning behind every line
The result? Character sheets that feel like they belong in the world the song already built in my head.
🤖 My prompt strategy:The key insight is feeding the AI three layers of context, not just the image:
- The visual -- the album cover as the aesthetic anchor
- The lyrics -- so the AI understands the emotional arc and narrative
- The subtext -- the deeper meaning behind the song, which gives the AI enough context to make design choices that resonate with the story
This produces dramatically better results than just saying "make a character sheet from this image."
The Prompt
Here's the actual prompt I sent to ChatGPT (with the CLOCK album cover attached as an image):
Create a character sheet for this character. For context, this is an album cover for my latest song "CLOCK" which is a subversive electronic indie dance track that on the surface is like a modern day "Baby it's cold outside" meets Daft Punk, but below the surface, it encodes the meaning of not letting internal voices rush you. It's about getting your time back and knowing you have more time than you think.
# Encoded Layers
- Session Encoding Key (from RULER + GIFT consolidation)
- "I need to go / You need to stay" = IFS dialogue between wound voice and awakened self
- "I keep / You keep" alternation = same technique as Nowhere To Run, two voices in one singer
- The train = every artificial deadline, every "I should be further by now"
- "You keep whisperin' we're right on time" = surplus math, the ruler rehired to measure what's here
- "You keep smilin' like you already know" = the Goddess of Time, patient, always waiting
- "I keep checkin' on the time like it's a crime" = the guilt of wound math, measuring deficit
- "She's always on the clock" = the Goddess of Time, always present, always givin'
- "The hands don't stop / So why should we?" = abundance — time never runs out, so neither does life
- "You need nothin' but this moment" = dismantling the deferral loop, choosing today
- "I need to—" breaking mid-sentence = the wound voice losing power, going silent
- "Something's spinnin' and it don't stop" = circular time, no countdown, infinite
- The whole couple dynamic = you talking to yourself the way you'd talk to someone you love
- Surface story = modern "Baby It's Cold Outside" — playful couple, one convincing the other to stay
- Session source: RULER + GIFT Frameworks (The Measurement Reversal & The Time Abundance Discovery), 2026-04-02
| Line # | Lyric | Encoded Meaning |
| --- | --- | --- |
| 1 | I need to go, my train is any minute | The wound voice panicking about artificial deadlines. Vocal ambiguity: sounds like both "any minute" (vague panic) and "in a minute" (fake precision) — the urgency can't even commit to its own story, exposing the deadline as fabricated |
| 2 | You need to stay, this song is just beginnin' | The awakened self / Goddess of Time saying abundance is just starting |
| 3 | I need to leave, I've got an early mornin' | Tomorrow-deferral — sacrificing now for an imaginary future obligation |
| 4 | You need to dance with me until it's dawnin' | Choose the present fully — dawn will come on its own, no need to rush toward it |
| 5 | I keep sayin' that I can't stay long | The scarcity belief repeating — "I don't have enough time" |
| 6 | You keep hummin' me my favorite song | The Goddess of Time gently, patiently offering abundance without forcing |
| 7 | I keep reachin' for my jacket sleeve | The urge to flee into productivity / the next task / the deferral loop |
| 8 | You keep winkin' that I ain't leavin' | The Goddess knows you want to stay — she sees through the wound's performance |
| 9 | 'Round the CLOCK, 'round the CLOCK | Circular time — no linear countdown, no finish line |
| 10 | We're dancin' round the CLOCK | Celebrating time instead of running from it — "Rock Around the Clock" lineage |
| 11 | 'Round the CLOCK, 'round the CLOCK | Circular time — no linear countdown, no finish line |
| 12 | Something's spinnin' and it just don't stop | Time is infinite — the spinning is abundance, not a threat |
| 13 | Tell me why would you run | Challenging the background radiation anxiety — why are you always fleeing? |
| 14 | When the night's this young? | There's so much time left — scarcity was a lie |
| 15 | The hands don't stop | The clock's movement = proof of abundance, not a warning — reframed from threat to promise |
| 16 | So why should we? | If time never stops, why would you? The dare, the flirt, and the philosophy in one line |
| 17-18 | (Post-Chorus) 'Round the CLOCK... 'round the CLOCK... (x2) | The hook lingers — letting circular time wash over you, hypnotic repetition as the lesson soaks in |
| 19 | I need to call a Grab, it's way past twelve | Wound voice using arbitrary thresholds — midnight as fake deadline, like wound math's pass/fail |
| 20 | You need to pop that bubbly off that shelf | The awakened self says celebrate what's here — surplus math, count what you have |
| 21 | I need some shut eye, it's gettin' late | "It's getting late" = the universal scarcity phrase, time running out |
| 22 | You need to spin me one more time 'n stay | Choosing today over tomorrow — dismantling the deferral loop one spin at a time |
| 23 | I keep checkin' my wrist like it's a crime | The guilt of wound math — measuring time as if monitoring a deficit |
| 24 | You keep whisperin' we're ready to climb | Surplus math — Ruler Ray rehired, measuring upward from what you have |
| 25 | I keep tellin' you I really should go | The wound voice weakening — "really should" betrays doubt in its own urgency |
| 26 | You keep smilin' like you already know | The Goddess of Time — patient, all-knowing, waiting for you to see what she sees |
| 27 | 'Round the CLOCK, 'round the CLOCK | Circular time returns — now familiar, the mantra is settling into the body |
| 28 | We're dancin' round the CLOCK | Celebrating time instead of running from it — second time hits with more conviction |
| 29 | 'Round the CLOCK, 'round the CLOCK | Circular time — the repetition itself IS the proof that it never ends |
| 30 | Something's spinnin' and it just don't stop | Time is infinite — after Verse 2's specific anxieties, this line now feels like a direct answer |
| 31 | Tell me why would you run | The challenge deepens — after "you keep smilin' like you already know," running feels absurd |
| 32 | When the night's this young? | Scarcity was a lie — reinforced after the wound voice just tried "it's way past twelve" |
| 33 | The hands don't stop | The thesis repeated — clock hands as abundance, now undeniable after two full verses of evidence |
| 34 | So why should we? | The dare lands harder the second time — the wound voice is running out of arguments |
| 35-36 | (Post-Chorus repeat) 'Round the CLOCK... 'round the CLOCK... (x2) | Second vamp — deeper now, the hypnosis is working, the wound voice is losing ground |
| 37 | I need to— | The wound voice breaking mid-sentence — losing power, can't finish its own argument |
| 38 | You need to come a 'lil closer | The awakened self pulling you toward presence, not away |
| 39 | I need to— | Wound voice tries again, fails again — each attempt weaker |
| 40 | You need to spin me a 'lil slower | Surrendering to the moment — slowing down instead of rushing |
| 41 | I need to— | Third attempt — the wound is almost silent |
| 42 | You're not leavin' | The Goddess of Time states the truth — you were never going to leave, the wound was performing |
| 43 | I need to— | Final attempt — the wound voice is now just a reflex, no conviction left |
| 44 | You need nothin' but this moment | The complete dismantling of the deferral loop — nothing is needed beyond the present |
| 45 | 'Round the CLOCK, 'round the CLOCK | Circular time — now stated with full conviction, no more doubt |
| 46 | We're dancin' round the CLOCK | "We" — the wound and awakened self are now dancing together, integrated |
| 47 | 'Round the CLOCK, 'round the CLOCK | Circular time — triumphant repetition |
| 48 | Always spinning, she don't ever stop | The Goddess of Time revealed — "she" hidden in plain sight, time personified as infinite generous feminine force |
| 49 | Tell me why would I run | Shift from "you" to "I" — the singer has internalized the lesson, no longer needs convincing |
| 50 | When you're this much fun? | Time itself is fun, generous, playful — the complete reversal from threat to joy |
| 51 | The hands don't stop | The thesis — clock hands moving is proof of abundance, not a warning |
| 52 | So why should we? | The final integration — if time never stops giving, why would we stop living? |
The key was feeding it three things together: the visual (album cover image), the surface story (the playful couple dynamic), and the subtext (the encoded meaning behind every single line). The AI used all three to make design decisions that resonated with the song's deeper meaning, not just its surface aesthetic.
That encoded meaning table isn't something I invented for this prompt. Every song on the MIND SET album has one. Every track starts as a Mind Coding session -- a real psychological breakthrough that gets deconstructed into an encoding key and embedded line-by-line into the lyrics. The song sounds mainstream on the surface. The table is the operating manual underneath.
Go deeper:
- Song-Software: The Mind Coding Methodology -- the full process for how I make songs that double as personal-development software.
- MIND SET playlist -- listen to the album with the context in mind.
💡 Reusable version: Below is the same prompt with my CLOCK-specific details swapped out for placeholders. Drop in your own song, your own context, your own meaning -- the AI fills in the rest.
Make It Yours
Create a character sheet for this character. For context, this is an album cover for my latest song "[SONG TITLE]" which is a [GENRE / STYLE] track that is about [story/concept/feeling/idea].
# Song Context
[Paste your full lyrics here]
# Meaning Behind the Lyrics
[Write a line-by-line breakdown of what each lyric actually means, or a short paragraph explaining the deeper themes -- what the song is really doing underneath its surface. The more specific you are, the more the AI can make design choices that reflect the song's emotional core, not just its surface aesthetic.]
# Additional Details
[Anything specific you want the AI to include that might not come through from the lyrics alone. For example:
- Character specifics (the character's role, mythology, archetype, hidden powers, backstory)
- World specifics (the setting, time period, dimension, atmosphere)
- Concept specifics (a key visual metaphor, a symbolic object, a cultural reference, a recurring motif)
- Brand or aesthetic specifics (logo, font choices, color palette, color exclusions)
- Things you do NOT want ("no neon", "no nightclub setting", "no cyberpunk vibes")
This is where you steer the AI toward your specific vision. If you skip this, the AI fills the gaps with its own assumptions -- which is usually generic.]
The four layers -- visual, lyrics, subtext, and your specifics -- are what make this work. Skimp on any of them and you get a generic character sheet that could be from any song.
Results
The generation comes out as a one-pager which will be sufficient for your keyframe generation needs when it comes time to build the music video.

But I wanted to share this character sheet on instagram as a carousel, and through the power of GPT Images 2.0 you can now do this. Simply prompt it to "turn this a 3 slide (or however many slides you want) carousel for instagram".
At first I tried to generate a 10 slide carousel but the generations started to drift and lose consistency, so I tried again and limited it to 3 slides, which is what I ended up publishing.



What I Learned While Iterating
The first generation isn't always the right one. Here's what I discovered across 20+ iterations:
- One slide at a time. Asking for "slides 2 through 4" produced a single combined stack instead of three separate images. Generate each slide as its own request.
- Re-attach the character sheet for every character-heavy slide. Without the reference image freshly attached, the character's appearance drifts -- noticeably by slide 2. For pure typography or process slides, you don't need it.
- Strip the background noise. If you have unrelated screenshots, Notion windows, or app UI visible in the chat, the model will sometimes pull those visual elements into the output. Add: "Do not include browser windows, Notion UI, screenshots, or terminal windows."
- Brand details need explicit instructions. Logo font, body font, color palette, no-go colors -- all of it needs to be stated, not implied. Vague brand direction produces generic output.
- If drift gets bad enough, start a fresh chat. Context contamination from earlier iterations can stack up. A clean session with only the master character sheet and album cover as references often produces better results than fighting through 10 rounds of "no, not that one."
Why Stop at the Character?
Once I had the character sheets, the next thought was obvious: why not build the whole world?
Think about it. A single album cover already contains a visual direction, a mood, a color palette, a setting. If you designed that cover with intention -- if you had the concept in mind from the start -- then everything you need to expand into a full world is already encoded in that one image.
I started prototyping it inside the same carousel. Two slides in, the world was already showing up:
- Slide 2 became a mystical clock-dimension portal. The Goddess of Time stepping out of the album cover and into the universe the song lives in. Cosmic energy, glass and gold, no city, no nightclub. Just the birth of the world the song hints at.
- Slide 3 became the song's central metaphor as visual mythology. The Cinderella midnight reversal: shattered antique clock face, glass fragments, one abandoned slipper-like shard, clock hands frozen near twelve, and the portal opening instead of closing. The world's emotional thesis: you stayed past midnight and realized the magic was yours all along.
These aren't music video assets. They're concept art. But that's the point -- you can take a single album cover and, within an afternoon of prompting, see the world it implies. Environment, mythology, color script, mood. The album cover is the seed. The carousel is the first sketch of the world. The music video, if I ever make it, is just the next step.
From one album cover, you can prototype:
- Character sheets (front, side, back, expressions, poses)
- Environment concepts (the world the character lives in)
- Prop designs (objects that matter to the story)
- Color scripts (how the palette shifts across scenes)
- Mood boards for different moments in the song
The album cover becomes the seed. The AI becomes the world-builder.
The Story Behind the Story
Here's where it gets interesting.
All of my songs on the MIND SET album are intentionally subversive. They're built from Mind Coding sessions -- a process where I use AI to help me work through real psychological problems. Cognitive distortions, limiting beliefs, patterns I want to rewire. When I hit a genuine breakthrough, I develop a concept around it and embed it into a song.
But here's the rule: it has to sound like something you'd actually want to listen to.
If it sounds like psychology or self-help, nobody's going to play it. Even I wouldn't. So the challenge with every track is: how do I make this sound like regular, mainstream, catchy music while secretly embedding reframing, CBT, IFS, and other psychological strategies into the lyrics?
The result is that every time you listen, those neural pathways get a little stronger. The associations get a little more automatic. You're getting a therapy session disguised as a French house groove -- and you don't even know it.
The Visual Layer is Another Layer of Subversion
Building character sheets and expanding the album cover into a full visual world isn't just a cool AI experiment. It's another layer of the same strategy.
The psychology gets dressed up in something visually compelling. The character is appealing. The world is immersive. You want to consume it. And while you're consuming it, the deeper meaning is doing its work underneath.
That's the whole game: make the medicine taste so good that people keep coming back for more.
The Takeaway
If you're making AI music -- or any creative project with AI -- try this:
- Start with intention. Have the concept and story in mind when you create your album art. It makes everything downstream easier.
- Feed the AI context, not just images. Lyrics, backstory, encoded meanings. The more the AI understands what the project is really about, the better its outputs.
- Think in layers. One image becomes a character. A character becomes a world. A world becomes a story. Each layer reinforces the others.
- Use the visual to hide the message. If your music (or any creative work) has a deeper purpose, the visual world is another vehicle to carry it without being obvious about it.
I started with one album cover and ended up with the beginnings of an entire universe. All it took was the right prompts and a willingness to see how far one image could go.
CLOCK is available on all streaming platforms. It's part of the MIND SET album -- psychology-powered music designed to reprogram limiting beliefs through hidden therapeutic layers.
More at nomaditsu.com/mind-set