API Reference

Voice

Clone a voice and generate natural Arabic speech from text using the Voice API.

Overview

The Voice API lets you clone a voice from an audio sample and synthesize natural-sounding Arabic (or English) speech from any text. Generated audio can be sent to contacts as voice messages or used in any workflow that accepts an audio URL.

Voice synthesis supports all major Arabic dialects - Saudi, Egyptian, Levantine, Gulf, and more.

Each account's voice data is fully isolated - your cloned voice, settings, and generated audio are private to your account.

How it works

Configure your dialect and language preferences (PATCH /voice/settings)
Clone a voice by uploading a sample recording (POST /voice/clone)
Synthesize speech from any text (POST /voice/tts)

The system automatically warms up the voice before synthesis to minimize latency.

Voice settings

Get voice settings

GET /api/v1/voice

{
  "enabled": true,
  "voice_configured": true,
  "has_cloned_voice": true,
  "voice_id": "voice_abc123",
  "voice_name": "Layla - Customer Support",
  "language": "ar",
  "dialect": "ksa",
  "voice_preloaded_at": "2026-03-08T14:00:00.000000Z"
}

Field	Description
`voice_configured`	Whether the voice service is active on this platform
`enabled`	Whether voice auto-responds to your contacts
`has_cloned_voice`	Whether a voice is ready for synthesis

Check connectivity

GET /api/v1/voice/status

Tests whether the voice service is connected and ready.

{
  "configured": true,
  "connected": true
}

Update voice settings

PATCH /api/v1/voice/settings

Field	Type	Notes
`language`	string	`ar` or `en`
`dialect`	string	Dialect code from `GET /voice/dialects`
`enabled`	boolean	Enable/disable voice synthesis for your account

# Enable voice with Egyptian Arabic dialect
curl -X PATCH https://dashboard.skylightchat.com/api/v1/voice/settings \
  -H "X-Api-Key: ••••••••••••" \
  -H "Content-Type: application/json" \
  -d '{ "dialect": "egy", "language": "ar", "enabled": true }'

Supported dialects

GET /api/v1/voice/dialects

Returns all supported dialect codes:

Code	Dialect
`arb`	Modern Standard Arabic (MSA)
`ksa`	Saudi / Khaleeji
`egy`	Egyptian
`leb`	Lebanese
`syr`	Syrian
`jor`	Jordanian
`irq`	Iraqi
`uae`	Emirati
`bah`	Bahraini
`qat`	Qatari
`pls`	Palestinian
`msa`	Formal Modern Standard Arabic
`en`	English

Clone a voice

POST /api/v1/voice/clone

Upload an audio sample to create a cloned voice. The pipeline extracts a transcription from the recording, then registers a custom voice model scoped to your account.

This endpoint may take 30-90 seconds to complete. Use a generous request timeout.

This is a multipart/form-data request:

Field	Type	Required	Notes
`audio_file`	file	✓	mp3, wav, ogg, m4a, webm, mp4, aac, flac - max 32 MB
`name`	string	✓	Display name for the voice
`language`	string	No	`ar` or `en` (default: `ar`)
`dialect`	string	No	Dialect code (default: `arb`)
`gender`	string	No	`male` or `female` (default: `female`)
`style`	string	No	`conversational` or `narrator` (default: `conversational`)

Tips for best results:

Use a quiet recording with no background noise
Minimum 10 seconds of clear speech
WAV or MP3 at 44.1 kHz is ideal

curl -X POST https://dashboard.skylightchat.com/api/v1/voice/clone \
  -H "X-Api-Key: ••••••••••••" \
  -F "audio_file=@/path/to/sample.wav" \
  -F "name=Layla - Support Agent" \
  -F "language=ar" \
  -F "dialect=ksa" \
  -F "gender=female"

{
  "success": true,
  "data": {
    "voice_id": "voice_abc123",
    "voice_name": "Layla - Support Agent",
    "language": "ar",
    "dialect": "ksa"
  },
  "message": "Voice cloned successfully. Use POST /voice/preload to cache it for zero-latency TTS."
}

Returns 201 Created. Voice enabled is set to true automatically.

Warm up the voice

POST /api/v1/voice/preload

Pre-loads the cloned voice into the inference cache for near-zero latency on the first TTS request.

The system auto-warms the voice before synthesis if the last preload was more than 60 minutes ago. Call this endpoint before high-traffic periods to ensure immediate availability.

{
  "success": true,
  "data": {
    "voice_id": "voice_abc123",
    "preloaded_at": "2026-03-08T14:00:00.000000Z"
  }
}

Synthesize speech

POST /api/v1/voice/tts

Convert text to speech using your cloned voice. Returns a public URL to the generated audio file.

Prerequisites

enabled: true - set via PATCH /voice/settings
has_cloned_voice: true - clone a voice via POST /voice/clone

Field	Type	Required	Notes
`text`	string	✓	1-5000 characters. Arabic fully supported.

curl -X POST https://dashboard.skylightchat.com/api/v1/voice/tts \
  -H "X-Api-Key: ••••••••••••" \
  -H "Content-Type: application/json" \
  -d '{ "text": "مرحباً بك في شركتنا، كيف يمكنني مساعدتك؟" }'

{
  "success": true,
  "data": {
    "audio_url": "https://dashboard.skylightchat.com/public/client/voice_samples/tts_9_abc.m4a",
    "format": "m4a",
    "text_length": 42
  }
}

The audio URL is publicly accessible. Audio is returned as M4A (AAC) for maximum compatibility with messaging platforms.

Delete voice

DELETE /api/v1/voice

Permanently deletes your cloned voice and clears all voice metadata from your account.

This action is irreversible. You must clone a new voice to re-enable synthesis.

Returns 204 No Content.

Knowledge Base

Manage the AI knowledge base that powers automated responses across your channels.

Media

Upload and manage your image and video library via the API.