Speech-to-text transcription using Groq Whisper API. Supports m4a, mp3, wav, ogg, flac, webm.
Resources
2Install
npx skillscat add blackhaj/dotfiles/transcribe Install via the SkillsCat registry.
SKILL.md
Transcribe
Speech-to-text using Groq Whisper API.
Setup
The script needs GROQ_API_KEY environment variable. Check if already set:
echo $GROQ_API_KEYIf not set, guide the user through setup:
- Ask if they have a Groq API key
- If not, have them sign up at https://console.groq.com/ and create an API key
- Have them add to their shell profile (~/.zshrc or ~/.bashrc):
export GROQ_API_KEY="<their-api-key>" - Then run
source ~/.zshrc(or restart terminal)
Usage
{baseDir}/transcribe.sh <audio-file>Supported Formats
- m4a, mp3, wav, ogg, flac, webm
- Max file size: 25MB
Output
Returns plain text transcription with punctuation and proper capitalization to stdout.