E

ElevenLabs Text-to-Speech

Convert text and documents to high-quality audio using ElevenLabs TTS API, supporting single narrator and dual-host podcast generation modes

Home/Creative/ElevenLabs Text-to-Speech

What is it?

This skill converts text and documents into high-quality audio using ElevenLabs TTS API. It supports two modes: single-voice narration and two-host conversational podcast generation.

How to use it?

Config at skills/elevenlabs/config.json:

{
  "api_key": "your-elevenlabs-api-key",
  "default_voice": "JBFqnCBsd6RMkjVDRZzb",
  "default_model": "eleven_multilingual_v2",
  "podcast_voice1": "JBFqnCBsd6RMkjVDRZzb",
  "podcast_voice2": "EXAVITQu4vr4xnSDxMaL"
}

Only api_key is required. Or set ELEVENLABS_API_KEY env var.

Dependencies: pip install PyPDF2 python-docx (only needed for PDF/DOCX files).

Requires ffmpeg for multi-chunk narration and podcasts.

Key Features

  • Convert text and documents to high-quality audio using ElevenLabs TTS API, supporting single narrator and dual-host podcast generation modes
  • Seamless integration with Claude's development workflow
  • Comprehensive guidelines and best practices for elevenlabs text-to-speech
View on GitHub

GitHub Stats

Stars
Forks
Last Update
License
Apache-2.0
Version
1.0.0

Features

Related Skills

More from Creative