Some checks are pending
CI / build-check-test (push) Waiting to run
Skills included: - venice-chat: Chat with Venice LLM models, vision, reasoning - venice-chat-benchmark: Benchmark chat models with infographics - venice-image-gen: Generate images via Venice API - venice-list-image-models: List available image models - venice-list-text-models: List available text models - venice-list-video-models: List available video models - venice-tts: Text-to-speech via Venice API - venice-video-generate: Generate videos from text/images - venice-video-queue: Queue video generation jobs - venice-video-quote: Get video generation cost quotes - venice-video-retrieve: Retrieve completed videos All rebranded from Agent Zero paths to Agent JAE (~/.jae/agent/skills/). Requires VENICE_API_KEY environment variable.
2.5 KiB
2.5 KiB
Venice Text-to-Speech
Convert text to speech using the Venice.ai TTS API. Supports 50+ voices across 9 languages with multiple audio formats and adjustable speed.
Features
- 50+ voices across American English, British English, Chinese, French, Hindi, Italian, Japanese, Portuguese
- Multiple formats -- mp3, opus, aac, flac, wav, pcm
- Adjustable speed -- 0.25x to 4.0x
- Max 4096 characters per request
- Model:
tts-kokoro
Prerequisites
pip install requests
export VENICE_API_KEY="your_venice_api_key"
Usage
Basic
python scripts/text_to_speech.py "Hello, welcome to Venice Voice."
With voice selection
python scripts/text_to_speech.py "Hello world" --voice am_adam
All options
python scripts/text_to_speech.py "Your text here" \
--voice af_bella \
--speed 1.2 \
--format wav \
--output greeting.wav
List all voices
python scripts/text_to_speech.py "" --list-voices
Options
| Option | Short | Default | Description |
|---|---|---|---|
text |
-- | (required) | Text to convert (max 4096 chars) |
--voice |
-v |
af_sky |
Voice ID |
--format |
-f |
mp3 |
Audio format |
--speed |
-s |
1.0 |
Speed (0.25-4.0) |
--output |
-o |
auto | Output file path |
--list-voices |
-- | -- | List all available voices |
Available Voices
| Prefix | Language | Voices |
|---|---|---|
af_ |
American Female | alloy, aoede, bella, heart, jadzia, jessica, kore, nicole, nova, river, sarah, sky |
am_ |
American Male | adam, echo, eric, fenrir, liam, michael, onyx, puck, santa |
bf_ |
British Female | alice, emma, lily |
bm_ |
British Male | daniel, fable, george, lewis |
zf_ |
Chinese Female | xiaobei, xiaoni, xiaoxiao, xiaoyi |
zm_ |
Chinese Male | yunjian, yunxi, yunxia, yunyang |
ff_ |
French Female | siwis |
hf_/hm_ |
Hindi | alpha, beta, omega, psi |
if_/im_ |
Italian | sara, nicola |
jf_/jm_ |
Japanese | alpha, gongitsune, nezumi, tebukuro, kumo |
pf_/pm_ |
Portuguese | dora, alex, santa |
Python Import
from text_to_speech import text_to_speech
result = text_to_speech(
text="Hello, this is a test.",
voice="am_adam",
format="mp3",
speed=1.0
)
print(f"Audio saved to: {result['output']}")
Environment Variables
| Variable | Required | Description |
|---|---|---|
VENICE_API_KEY |
Yes | Venice.ai API key |