If you use an AI coding agent — Claude Code, Cursor, Windsurf, Copilot, OpenCode, or any of the 37+ agents on skills.sh — you can now give it the ability to generate speech, transcribe audio, and clone voices with a single command.
npx skills add leanvox/leanvox-skill
That's it. Your agent now knows how to use the LeanVox API.
What Is an Agent Skill?
Skills are reusable capability packages for AI agents. They give agents procedural knowledge — not just "what the API does" but how to use it correctly: which model tier to pick, when to use async, how to handle voice cloning, how to batch-generate dialogue. Think of it as a well-written API guide that lives inside the agent's context.
The skills.sh registry is the npm for agent skills. Install once, and any compatible agent can immediately use those capabilities without you writing glue code or pasting in documentation.
What the LeanVox Skill Includes
The skill ships with six helper scripts covering the full LeanVox API:
- tts.sh — text-to-speech (sync for short text, async for long-form)
- stt.sh — audio transcription with optional speaker diarization
- dialogue.sh — multi-speaker conversation generation from a script
- voiceover.sh — transcribe an audio file, edit the transcript, re-voice it
- voices.sh — browse and search 238+ curated voices by category and gender
- clone.sh — voice cloning from a reference audio file
It also includes references/api-reference.md and references/voice-catalog.md — full endpoint docs and the complete voice catalog, so agents can make informed decisions without hitting the docs site.
Pricing-Aware by Design
One thing we built in from the start: the skill teaches agents to pick the cheapest tier that works for the task.
| Tier | Cost | Best For |
|---|---|---|
| Standard | $0.005/1K chars | Fast narration, notifications, bulk generation |
| Pro | $0.01/1K chars | Expressive voices, podcasts, 238 curated voices |
| Max | $0.03/1K chars | Custom voice design from a text description |
The skill defaults agents to Standard unless they specifically need Pro voices or Max-tier instruction-based voice design. A $0.03 Max call where Standard would have worked is money wasted — the skill prevents that.
What Your Agent Can Do Now
With the skill installed, you can tell your agent things like:
- "Generate an MP3 of this blog post intro using a warm female narrator voice"
- "Transcribe this meeting recording and give me a summary with speaker labels"
- "Create a two-host dialogue between Alex and Jordan discussing this product launch"
- "Clone my voice from voice-sample.wav and read this script"
- "Find me a calm male narrator voice in the meditation category"
- "Generate audio for all 500 dialogue lines in my game script"
The agent handles the API calls, model selection, async job management, and file downloads — you just describe what you want.
Works With 37+ Agents
The skill uses the SKILL.md format, which is compatible with any agent that supports it:
- Claude Code
- Cursor
- Windsurf
- GitHub Copilot
- OpenCode
- Codex CLI
- And 31+ more on skills.sh
Install It
# Install the skill
npx skills add leanvox/leanvox-skill
# Set your API key
export LEANVOX_API_KEY="lv_live_..."
Get your API key (free $1.00 credit on signup) at leanvox.com/dashboard/keys.
The skill is open source: github.com/leanvox/leanvox-skill. PRs welcome.