If you use n8n for workflow automation, you can now add voice AI to any workflow. Our community node is live on npm.
n8n-nodes-leanvox
Install it from Settings → Community Nodes in your n8n instance. That's it.
What Can You Build?
The node covers the full LeanVox API — text-to-speech, speech-to-text, and multi-speaker dialogue. Here are some workflows that take minutes to set up:
Blog to Podcast
RSS feed triggers → extract article text → LeanVox Generate Speech → upload MP3 to S3 or your podcast host. Every new blog post automatically becomes an audio version.
Meeting Transcriber
Webhook receives recording → LeanVox Transcribe (with diarization + summary) → post summary to Slack. Know who said what without listening to the whole meeting.
Multilingual Voicemail
Form submission → LeanVox Generate Speech in 10 languages → email each version. One form, global reach.
Content Moderation Pipeline
Audio upload webhook → LeanVox Transcribe → scan transcript for flagged keywords → alert on Slack or email. Automate audio review at scale.
Available Operations
The node gives you 8 operations across 4 resources:
- Generate Speech — convert text to audio using Standard (fast), Pro (238 curated voices), or Max (instruction-based voice design)
- Generate Speech (Async) — same thing, but for long text. Kicks off a background job so your workflow doesn't time out
- Check Job — poll an async job until it's done
- Dialogue — multi-speaker conversations. Pass an array of lines with different voices
- Transcribe — audio → text, with optional speaker diarization and AI summary
- List Voices — get all available voice IDs
- List Curated Voices — browse our 238 curated voices with preview audio
- Check Balance — see your remaining credits
Setup in 60 Seconds
- In n8n, go to Settings → Community Nodes → Install
- Enter
n8n-nodes-leanvox - Add a LeanVox API credential with your API key
- Drag the LeanVox node into any workflow
That's the whole setup. No Docker config, no environment variables, no SDK installation.
Example: Text to Speech in a Workflow
Here's the simplest possible workflow — take text input and generate audio:
- Add a Manual Trigger or Webhook node
- Add the LeanVox node
- Set Resource to Speech, Operation to Generate
- Pick a model:
standardfor speed,profor voice quality,maxfor custom voice instructions - Set a voice ID (e.g.
podcast_conversational_femalefor Pro) - Pass your text — done
The node returns JSON with an audio_url you can pass to any downstream node — upload to S3, send via email, post to Slack, whatever your workflow needs.
Async for Long Content
For longer text (articles, chapters, scripts), use Generate Speech (Async). It queues a background job and returns a job_id. Chain it with the Check Job operation in a loop to poll until complete. No timeout issues, even for book-length content.
Pricing
Same credits, same pricing as the API. Standard at $0.005/1K chars, Pro at $0.01/1K chars, Max at $0.03/1K chars. Transcription at $0.002/min. See full pricing.
Links
Build something cool? We'd love to see it. Drop us a line or open an issue on GitHub.