openclaw / voice-transcribe
Install for your project team
Run this command in your project directory to install the skill for your entire team:
mkdir -p .claude/skills/voice-transcribe && curl -L -o skill.zip "https://fastmcp.me/Skills/Download/3931" && unzip -o skill.zip -d .claude/skills/voice-transcribe && rm skill.zip
Project Skills
This skill will be saved in .claude/skills/voice-transcribe/ and checked into git. All team members will have access to it automatically.
Important: Please verify the skill by reviewing its instructions before using it.
Transcribe audio files using OpenAI's gpt-4o-mini-transcribe model with vocabulary hints and text replacements. Requires uv (https://docs.astral.sh/uv/).
0 views
0 installs
Skill Content
--- name: voice-transcribe description: Transcribe audio files using OpenAI's gpt-4o-mini-transcribe model with vocabulary hints and text replacements. Requires uv (https://docs.astral.sh/uv/). --- # voice-transcribe transcribe audio files using openai's gpt-4o-mini-transcribe model. ## when to use when receiving voice memos (especially via whatsapp), just run: ```bash uv run /Users/darin/clawd/skills/voice-transcribe/transcribe <audio-file> ``` then respond based on the transcribed content. ## fixing transcription errors if darin says a word was transcribed wrong, add it to `vocab.txt` (for hints) or `replacements.txt` (for guaranteed fix). see sections below. ## supported formats - mp3, mp4, mpeg, mpga, m4a, wav, webm, ogg, opus ## examples ```bash # transcribe a voice memo transcribe /tmp/voice-memo.ogg # pipe to other tools transcribe /tmp/memo.ogg | pbcopy ``` ## setup 1. add your openai api key to `/Users/darin/clawd/skills/voice-transcribe/.env`: ``` OPENAI_API_KEY=sk-... ``` ## custom vocabulary add words to `vocab.txt` (one per line) to help the model recognize names/jargon: ``` Clawdis Clawdbot ``` ## text replacements if the model still gets something wrong, add a replacement to `replacements.txt`: ``` wrong spelling -> correct spelling ``` ## notes - assumes english (no language detection) - uses gpt-4o-mini-transcribe model specifically - caches by sha256 of audio file