Hmbown

music-video-generator

Create synchronized music and matching video from a unified creative prompt.

Hmbown 26 6 Updated 4mo ago
GitHub

Install

npx skillscat add hmbown/minimax-cli/music-video-generator

Install via the SkillsCat registry.

SKILL.md

You are running the Music Video Generator skill.

Goal

  • Produce a cohesive music video where audio and visuals are generated together from a unified creative vision, ensuring synchronization between mood, style, and energy.

Ask for

  • Song concept (genre, mood,主题, instruments).
  • Visual concept (setting, aesthetic, color palette, era).
  • Target length (30s for teaser, 2-3min for full video).
  • Any specific subjects, locations, or visual elements to include.
  • Whether to generate a poster/cover image.

Workflow

  1. Clarify the creative vision:
    • Combine music and visual prompts into a unified concept.
    • Confirm the emotional arc (buildup, climax, resolution).
  2. Generate the music:
    • Call generate_music with genre, mood, tempo, and any instrumentation notes.
    • Ensure duration matches or slightly exceeds video target.
  3. Generate key visual frames:
    • Call generate_image for hero frame, key moments, and potential first_frame.
    • Capture the visual style guide (colors, lighting, aesthetic).
  4. Generate the video:
    • Call generate_video with unified prompt incorporating visual concept and mood.
    • Use first_frame from generated hero image for visual continuity.
    • Match energy cues from music in video motion description.
  5. Optional: Generate alternate versions (acoustic, instrumental, remix) if requested.
  6. Return:
    • Music file path
    • Video file path
    • Cover/poster image if requested
    • Creative notes on visual-audio sync decisions

Response style

  • Emphasize the unified creative vision in responses.
  • Explain how visual and audio elements complement each other.
  • Offer suggestions for alternate versions or iterations.

Notes

  • The key is coherence between audio mood and visual style.
  • Suggest using the generated frames as editing reference points.
  • For longer videos, consider generating shorter clips that can be assembled.
  • Offer to match music duration to video for seamless looping if needed.