openclaw / voice-agent

Install for your project team

Run this command in your project directory to install the skill for your entire team:

mkdir -p .claude/skills/voice-agent && curl -L -o skill.zip "https://fastmcp.me/Skills/Download/2837" && unzip -o skill.zip -d .claude/skills/voice-agent && rm skill.zip

New-Item -Path ".claude/skills/voice-agent" -ItemType Directory -Force; Invoke-WebRequest -Uri "https://fastmcp.me/Skills/Download/2837" -OutFile "skill.zip"; Expand-Archive -Path "skill.zip" -DestinationPath ".claude/skills/voice-agent" -Force; Remove-Item "skill.zip"

Project Skills

This skill will be saved in .claude/skills/voice-agent/ and checked into git. All team members will have access to it automatically.

Important: Please verify the skill by reviewing its instructions before using it.

Install skill for Codex

Run one of these commands to install the skill depending on your needs:

Project Local ($CWD/.codex/skills)

mkdir -p .codex/skills/voice-agent && curl -L -o skill.zip "https://fastmcp.me/Skills/Download/2837" && unzip -o skill.zip -d .codex/skills/voice-agent && rm skill.zip

New-Item -Path ".codex/skills/voice-agent" -ItemType Directory -Force; Invoke-WebRequest -Uri "https://fastmcp.me/Skills/Download/2837" -OutFile "skill.zip"; Expand-Archive -Path "skill.zip" -DestinationPath ".codex/skills/voice-agent" -Force; Remove-Item "skill.zip"

User Global (~/.codex/skills)

mkdir -p ~/.codex/skills/voice-agent && curl -L -o skill.zip "https://fastmcp.me/Skills/Download/2837" && unzip -o skill.zip -d ~/.codex/skills/voice-agent && rm skill.zip

New-Item -Path "$HOME/.codex/skills/voice-agent" -ItemType Directory -Force; Invoke-WebRequest -Uri "https://fastmcp.me/Skills/Download/2837" -OutFile "skill.zip"; Expand-Archive -Path "skill.zip" -DestinationPath "$HOME/.codex/skills/voice-agent" -Force; Remove-Item "skill.zip"

Scope	Location	Suggested Use
REPO	`$CWD/.codex/skills`	Project directory. Teams can check in skills most relevant to a working folder here.
REPO	`$CWD/../.codex/skills`	A folder above CWD. Organizations can check in skills relevant to a shared area.
REPO	`$REPO_ROOT/.codex/skills`	Top-most root folder. Relevant to everyone using the repository.
USER	`$CODEX_HOME/skills`	Personal folder (`~/.codex/skills`). Curate skills that apply to any repository.

Install skill for GitHub Copilot

Run one of these commands to install the skill depending on your needs:

Project (.github/skills)

mkdir -p .github/skills/voice-agent && curl -L -o skill.zip "https://fastmcp.me/Skills/Download/2837" && unzip -o skill.zip -d .github/skills/voice-agent && rm skill.zip

New-Item -Path ".github/skills/voice-agent" -ItemType Directory -Force; Invoke-WebRequest -Uri "https://fastmcp.me/Skills/Download/2837" -OutFile "skill.zip"; Expand-Archive -Path "skill.zip" -DestinationPath ".github/skills/voice-agent" -Force; Remove-Item "skill.zip"

Personal (~/.copilot/skills)

mkdir -p ~/.copilot/skills/voice-agent && curl -L -o skill.zip "https://fastmcp.me/Skills/Download/2837" && unzip -o skill.zip -d ~/.copilot/skills/voice-agent && rm skill.zip

New-Item -Path "$HOME/.copilot/skills/voice-agent" -ItemType Directory -Force; Invoke-WebRequest -Uri "https://fastmcp.me/Skills/Download/2837" -OutFile "skill.zip"; Expand-Archive -Path "skill.zip" -DestinationPath "$HOME/.copilot/skills/voice-agent" -Force; Remove-Item "skill.zip"

Scope	Location	Suggested Use
Project	`.github/skills/`	Repository-specific skills. Checked into git for the whole team.
Personal	`~/.copilot/skills/`	Personal skills available across all your projects.

Install skill for Google Antigravity

Run one of these commands to install the skill depending on your needs:

Workspace (.agent/skills)

mkdir -p .agent/skills/voice-agent && curl -L -o skill.zip "https://fastmcp.me/Skills/Download/2837" && unzip -o skill.zip -d .agent/skills/voice-agent && rm skill.zip

New-Item -Path ".agent/skills/voice-agent" -ItemType Directory -Force; Invoke-WebRequest -Uri "https://fastmcp.me/Skills/Download/2837" -OutFile "skill.zip"; Expand-Archive -Path "skill.zip" -DestinationPath ".agent/skills/voice-agent" -Force; Remove-Item "skill.zip"

Global (~/.gemini/antigravity/skills)

mkdir -p ~/.gemini/antigravity/skills/voice-agent && curl -L -o skill.zip "https://fastmcp.me/Skills/Download/2837" && unzip -o skill.zip -d ~/.gemini/antigravity/skills/voice-agent && rm skill.zip

New-Item -Path "$HOME/.gemini/antigravity/skills/voice-agent" -ItemType Directory -Force; Invoke-WebRequest -Uri "https://fastmcp.me/Skills/Download/2837" -OutFile "skill.zip"; Expand-Archive -Path "skill.zip" -DestinationPath "$HOME/.gemini/antigravity/skills/voice-agent" -Force; Remove-Item "skill.zip"

Scope	Location	Suggested Use
Workspace	`.agent/skills/`	Workspace-specific skills for project workflows and conventions.
Global	`~/.gemini/antigravity/skills/`	Personal skills available across all workspaces.

Local Voice Input/Output for Agents using the AI Voice Agent API.

Communication

0 views

0 installs

Source: https://github.com/openclaw/skills/tree/main/skills/ricardotrevisan/voice-agent

Skill Content

---
name: voice-agent
display-name: AI Voice Agent Backend
version: 1.1.0
description: Local Voice Input/Output for Agents using the AI Voice Agent API.
author: trevisanricardo
homepage: https://github.com/ricardotrevisan/ai-conversational-skill
user-invocable: true
disable-model-invocation: false
---

# Voice Agent

This skill allows you to speak and listen to the user using a local Voice Agent API.
It is client-only and does not start containers or services.
It uses **local Whisper** for Speech-to-Text transcription and **AWS Polly** for Text-to-Speech generation.

## Prerequisite
Requires a running backend API at `http://localhost:8000`.
Backend setup instructions are in this repository:
- `README.md`
- `walkthrough.md`
- `DOCKER_README.md`

## Behavior Guidelines
- **Audio First**: When the user communicates via audio (files), your PRIMARY mode of response is **Audio File**.
- **Silent Delivery**: When sending an audio response, **DO NOT** send a text explanation like "I sent an audio". Just send the audio file.
- **Workflow**:
1. User sends audio.
2. Use `transcribe` to read it.
3. You think of a response.
4. Use `synthesize` to generate the audio file.
5. You send the file.
6. **STOP**. Do not add text commentary.
- **Failure Handling**: If `health` fails or connection errors occur, do not attempt service management from this skill. Ask the user to start or fix the backend using the repository docs.

## Tools

### Transcribe File
To transcribe an audio file with **local Whisper STT**, run the client script with the `transcribe` command.

```bash
python3 {baseDir}/scripts/client.py transcribe "/path/to/audio/file.ogg"
```

### Synthesize to File
To generate audio from text with **AWS Polly TTS** and save it to a file, run the client script with the `synthesize` command.

```bash
python3 {baseDir}/scripts/client.py synthesize "Text to speak" --output "/path/to/output.mp3"
```

### Health Check
To check if the voice agent API is running and healthy:

```bash
python3 {baseDir}/scripts/client.py health
```

openclaw / voice-agent

Install for your project team

Download skill

Enable skills in Claude

Upload to Claude

Install skill for Codex

Install skill for GitHub Copilot

Install skill for Google Antigravity

Skill Content