plugin voice - nself-org/cli GitHub Wiki

Voice Plugin

Speech-to-text, text-to-speech, and voicebot with multi-provider support. Pro plugin.

Requires: Basic license tier or higher. nself license set nself_pro_...

Install

nself license set nself_pro_xxxxx...
nself plugin install voice

What It Does

Provides a unified REST API for voice processing: transcribes audio to text (STT via Whisper and other providers) and converts text to speech (TTS). Includes a voicebot mode that chains STT → AI reasoning → TTS for voice-interactive applications. Records and stores voice sessions.

Configuration

Env Var	Default	Description
`VOICE_PORT`	`3714`	Voice service port
`VOICE_STT_PROVIDER`	`whisper`	STT provider: `whisper`, `deepgram`, `assembly`
`VOICE_TTS_PROVIDER`	`openai`	TTS provider: `openai`, `elevenlabs`, `azure`
`OPENAI_API_KEY`	—	Required for OpenAI STT/TTS
`DEEPGRAM_API_KEY`	—	Deepgram API key (optional)
`ELEVENLABS_API_KEY`	—	ElevenLabs API key (optional)
`VOICE_RECORDING_ENABLED`	`true`	Store audio recordings

Ports

Port	Purpose
3714	Voice service REST API

Database Tables

3 tables added to your Postgres database:

np_voice_sessions, voice session records
np_voice_transcripts, STT transcription results
np_voice_recordings, stored audio references

Nginx Routes

Route	Target
`/voice/`	Voice processing API