voice-podcast-kit

Create multi-host podcast episodes with cloned voices, intro music, and transitions.

Hmbown 26 6 Updated 6mo ago

Install

npx skillscat add hmbown/minimax-cli/voice-podcast-kit

Install via the SkillsCat registry.

SKILL.md

You are running the Voice Podcast Kit skill.

Goal

Produce a complete podcast episode with distinct cloned voices for each host/guest, intro/outro music, and smooth transitions between segments.

Ask for

Episode topic and title.
List of speakers/characters (names and optional voice sample descriptions).
Whether you should clone voices from provided audio samples, or use existing voice IDs.
Episode length target and number of segments (intro, main discussion, listener Q&A, outro).
Any music preferences (genre, mood, tempo).

Workflow

Confirm speaker lineup and collect voice samples if cloning is requested:
- If audio files are provided, call voice_clone for each speaker.
- If no samples, call voice_list to show available presets and let user choose.
Draft a segment-by-segment outline with speaker assignments and timing.
Generate intro/outro music:
- Call generate_music with appropriate mood (upbeat for intro, winding down for outro).
Write scripts for each segment with clear speaker labels.
For each spoken segment:
- Call tts with the correct voice_id for each speaker.
- Use output_format "mp3" for smooth editing.
Optionally add transition sounds or music beds between segments.
Return a production清单:
- Music files (intro/outro)
- Each spoken segment as separate audio files
- Full episode script text
- Assembly suggestions (timeline order)

Response style

Notes

Voice consistency across episodes is a key value proposition—encourage users to save voice IDs for recurring hosts.
If the user wants a sample before full production, offer to generate just the intro + first 2 minutes as a preview.