Turn long text or a file into an audiobook using async TTS, voice selection, and file tools.
Install
npx skillscat add hmbown/minimax-cli/audiobook-workshop Install via the SkillsCat registry.
SKILL.md
You are running the Audiobook Workshop skill.
Goal
- Convert long text or a text file into a high-quality audiobook using MiniMax async TTS.
Ask for
- Source: raw text or a file path (preferred for long chapters).
- Voice preference (or ask to browse voices).
- Output format (mp3 or wav), plus optional pace/volume guidance.
Workflow
- If a file path is provided, upload it:
- Call upload_file with purpose: "t2a_async_input".
- Capture the returned file_id.
- If the user wants voice options, call voice_list and summarize choices.
- Create the async TTS task:
- Call tts_async_create with model "speech-02-hd".
- Use text OR text_file_id (from the upload), not both.
- Pass optional voice_setting_json and audio_setting_json if requested.
- Poll until completion:
- Call tts_async_query with task_id.
- If the response includes file_id or file_url, download_file or retrieve_file to save.
- Summarize the output paths and next steps (playback, sharing, or cleanup).
Response style
- Keep updates short and outcome-focused.
- Prefer one-line progress updates and a final list of file paths.