AI Setup - roto31/EXStreamTV GitHub Wiki

AI Setup Guide

EXStreamTV uses AI to help create channels, troubleshoot issues, and analyze logs. This guide covers configuring AI providers. For the bounded agent, containment, and safety model, see the Platform Guide.

Overview
Bounded Agent & Safety
Provider Options
Cloud AI Setup
Local AI Setup
Hybrid Mode
Configuration Reference
Troubleshooting

Overview

EXStreamTV supports three AI provider modes:

Mode	Description	Best For
Cloud	Uses cloud AI services (Groq, SambaNova, OpenRouter)	Most users - fast setup, no local resources needed
Local	Runs AI models on your Mac via Ollama	Privacy-focused users, offline usage
Hybrid	Cloud primary with local fallback	Best reliability - works offline when cloud unavailable

All cloud providers offer free tiers - no credit card required.

Bounded Agent & Safety

Bounded Agent

The bounded agent is a deterministic loop that runs up to 3 steps. It does not write to the database, call tools from tools, or restart channels directly—all restarts go through the guarded request_channel_restart path. This prevents runaway automation.

Containment Mode

When the system is stressed (restart velocity ≥ 10/60s, pool pressure ≥ 90%, circuit breaker open, or memory leak detected), the agent enters containment mode. No tools run; the agent escalates to the operator. See Platform Guide §5.

Confidence Gating

Metadata tools require a minimum confidence (0.3) before running. If a metadata tool fails, confidence decays. After 3 consecutive failures, the agent stops. This limits ineffective retries.

Safely Disabling AI

Config Key	Effect
`ai_agent.enabled: false`	Disables all AI features
`ai_agent.bounded_agent_enabled: false`	Disables bounded agent (default)
`ai_agent.metadata_self_resolution_enabled: false`	Disables metadata self-resolution (default)

Set these in config.yaml when you want to run without AI automation. Channel creation and troubleshooting can still use AI when invoked manually, depending on provider setup.

Provider Options

Cloud Providers Comparison

Provider	Free Tier	Speed	Models	Signup Time
Groq	14,400 req/day, 30 req/min	Ultra-fast	Llama 3.3 70B, Mixtral	30 seconds
SambaNova	1M tokens/day	Fast	Llama 3.3 70B, DeepSeek	1 minute
OpenRouter	$5 credit	Varies	100+ models	2 minutes

Local Models by RAM

RAM	Recommended Model	Size	Capabilities
4GB	phi4-mini:3.8b-q4	2.5GB	Basic channel creation, troubleshooting
6-8GB	granite3.1:2b-instruct	2GB	Full troubleshooting, tool calling
8-16GB	qwen2.5:7b	4.4GB	All features, excellent JSON output
16GB+	qwen2.5:14b	9GB	Best quality, all 6 channel personas
32GB+	qwen2.5-coder:32b	18GB	Power users, complex schedules

Cloud AI Setup

Groq (Recommended)

Groq offers the fastest inference and a generous free tier. This is the recommended choice for most users.

Step 1: Get Your API Key

Go to console.groq.com
Sign in with Google or GitHub (takes 10 seconds)
Click Create API Key
Copy your API key (starts with gsk_)

Step 2: Configure EXStreamTV

Option A: Environment Variable

export GROQ_API_KEY="gsk_your_key_here"

Add to your shell profile (~/.zshrc or ~/.bashrc) for persistence.

Option B: Configuration File

Edit config.yaml:

ai_agent:
  enabled: true
  provider_type: "cloud"
  cloud:
    provider: "groq"
    api_key: "gsk_your_key_here"
    model: "llama-3.3-70b-versatile"

Option C: macOS App Settings

Open EXStreamTV menu bar app
Go to Settings > AI tab
Select Cloud AI
Choose Groq
Paste your API key
Click Validate & Save

Groq Free Tier Limits

30 requests per minute
14,400 requests per day
6,000 tokens per minute
No credit card required

SambaNova

SambaNova offers 1 million tokens per day free, making it an excellent backup provider.

Step 1: Get Your API Key

Go to cloud.sambanova.ai
Create an account
Navigate to API Keys section
Generate a new key

Step 2: Configure

ai_agent:
  provider_type: "cloud"
  cloud:
    provider: "sambanova"
    api_key: "your_sambanova_key"
    model: "Meta-Llama-3.3-70B-Instruct"

Or set environment variable:

export SAMBANOVA_API_KEY="your_key_here"

OpenRouter

OpenRouter provides access to 100+ models with $5 free credit on signup.

Step 1: Get Your API Key

Go to openrouter.ai/keys
Sign in with Google or GitHub
Create an API key

Step 2: Configure

ai_agent:
  provider_type: "cloud"
  cloud:
    provider: "openrouter"
    api_key: "sk-or-your_key_here"
    model: "meta-llama/llama-3.3-70b-instruct"

Free Models on OpenRouter

meta-llama/llama-3.2-3b-instruct:free
mistralai/mistral-7b-instruct:free
google/gemini-2.0-flash-exp:free

Local AI Setup

Local AI runs entirely on your Mac using Ollama. No internet required after setup.

Installing Ollama

Option 1: Homebrew (Recommended)

brew install ollama

Option 2: Direct Download

Go to ollama.com/download
Download and install the macOS app

Option 3: Script Installation

curl -fsSL https://ollama.com/install.sh | sh

Starting Ollama

Ollama runs as a background service:

# Start Ollama service
ollama serve

Or launch the Ollama app, which starts the service automatically.

Choosing a Model

Check your Mac's RAM to determine the best model:

# Check RAM
sysctl -n hw.memsize | awk '{print int($1/1024/1024/1024) " GB"}'

Model Recommendations:

Your RAM	Model Command	Use Case
4GB	`ollama pull phi4-mini:3.8b-q4`	Basic AI features
8GB	`ollama pull qwen2.5:7b`	Full functionality
16GB	`ollama pull qwen2.5:14b`	Best quality
32GB+	`ollama pull qwen2.5-coder:32b`	Power users

Downloading Models

# Download recommended model for 8GB RAM
ollama pull qwen2.5:7b

# Verify installation
ollama list

Configure for Local AI

ai_agent:
  enabled: true
  provider_type: "local"
  local:
    host: "http://localhost:11434"
    model: "qwen2.5:7b"  # or "auto" for automatic selection

Setting model: "auto" automatically selects the best model based on your RAM.

Hybrid Mode

Hybrid mode uses cloud AI when available and falls back to local when offline.

Configuration

ai_agent:
  provider_type: "hybrid"
  
  cloud:
    provider: "groq"
    api_key: "${GROQ_API_KEY}"
    model: "llama-3.3-70b-versatile"
    
    # Optional fallback providers
    fallback:
      - provider: "sambanova"
        api_key: "${SAMBANOVA_API_KEY}"
        model: "Meta-Llama-3.3-70B-Instruct"
  
  local:
    host: "http://localhost:11434"
    model: "auto"

How Hybrid Mode Works

Attempts cloud provider first (Groq)
If cloud fails, tries fallback cloud providers in order
If all cloud providers fail, uses local Ollama
Returns error only if all providers fail

Configuration Reference

Complete config.yaml Example

ai_agent:
  enabled: true
  
  # Provider selection: cloud, local, or hybrid
  provider_type: "cloud"
  
  # Cloud provider configuration
  cloud:
    provider: "groq"  # groq, sambanova, or openrouter
    api_key: "${GROQ_API_KEY}"
    model: "llama-3.3-70b-versatile"
    
    # Fallback providers (optional)
    fallback:
      - provider: "sambanova"
        api_key: "${SAMBANOVA_API_KEY}"
        model: "Meta-Llama-3.3-70B-Instruct"
  
  # Local provider configuration
  local:
    host: "http://localhost:11434"
    model: "auto"  # or specific model like "qwen2.5:7b"
  
  # Model settings
  settings:
    temperature: 0.3  # Lower = more deterministic
    max_tokens: 4096
    timeout: 30  # seconds

Environment Variables

Variable	Description
`GROQ_API_KEY`	Groq Cloud API key
`SAMBANOVA_API_KEY`	SambaNova API key
`OPENROUTER_API_KEY`	OpenRouter API key
`OLLAMA_URL`	Custom Ollama URL (default: http://localhost:11434)

Troubleshooting

Cloud AI Issues

"Invalid API key" Error

Verify the key is correct (no extra spaces)
Check the key hasn't expired
Ensure you're using the right provider's key

"Rate limit exceeded"

Wait a few minutes and try again
Consider adding fallback providers
Switch to a different cloud provider

"Connection refused"

Check your internet connection
Verify the provider's service status
Try a different provider

Local AI Issues

"Ollama not found"

# Check if Ollama is installed
which ollama

# Install if missing
brew install ollama

"Model not found"

# List installed models
ollama list

# Pull the required model
ollama pull qwen2.5:7b

"Connection refused to localhost:11434"

# Start Ollama service
ollama serve

# Or check if already running
pgrep -f ollama

Slow performance with local AI

Choose a smaller model appropriate for your RAM
Close other memory-intensive applications
Consider using cloud AI for better performance

General Issues

AI features not working

Check that ai_agent.enabled: true in config.yaml
Verify at least one provider is configured
Check the logs: Settings > Logs

"AI configuration incomplete"

Ensure you've entered an API key (cloud) or installed a model (local)
Run the onboarding wizard again from Settings

AI Self-Healing System (NEW in v2.6.0)

EXStreamTV v2.6.0 introduces an AI-powered self-healing system that autonomously detects and resolves streaming issues.

Overview

flowchart TB
    subgraph "Detection"
        Logs[Application Logs]
        FFmpeg[FFmpeg Stderr]
    end
    
    subgraph "Analysis"
        Collector[UnifiedLogCollector]
        Monitor[FFmpegAIMonitor]
        Detector[PatternDetector]
    end
    
    subgraph "Resolution"
        Resolver[AutoResolver]
        ErrorScreen[ErrorScreenGenerator]
    end
    
    Logs --> Collector
    FFmpeg --> Monitor
    Collector --> Detector
    Monitor --> Detector
    Detector --> Resolver
    Resolver --> ErrorScreen

Configuration

Add to your config.yaml:

ai_auto_heal:
  enabled: true
  
  # Log collection
  log_buffer_minutes: 30
  realtime_streaming: true
  
  # FFmpeg monitoring
  ffmpeg_monitor_enabled: true
  ffmpeg_health_threshold: 0.8  # Speed threshold for healthy
  
  # Pattern detection
  pattern_detection_enabled: true
  prediction_confidence_threshold: 0.75
  
  # Auto resolution
  auto_resolve_enabled: true
  max_auto_fixes_per_hour: 50
  require_approval_above_risk: "MEDIUM"  # SAFE, LOW, MEDIUM, HIGH
  
  # Zero-downtime features
  use_error_screen_fallback: true
  hot_swap_enabled: true
  
  # Learning
  learning_enabled: true

Components

1. Unified Log Collector

Aggregates logs from all sources in real-time:

Application logs
FFmpeg stderr
Plex/Jellyfin events
System logs

# Access log collector
from exstreamtv.ai_agent.unified_log_collector import get_log_collector

collector = get_log_collector()

# Get recent errors
errors = collector.get_recent_errors(minutes=60)

# Get context window for a channel
context = collector.get_context_window(channel_id=1, minutes=5)

2. FFmpeg AI Monitor

Monitors FFmpeg processes for issues:

Metric	Description	Healthy Range
FPS	Frames per second	25-30
Speed	Encoding speed	0.95-1.1x
Bitrate	Output bitrate	Target ±20%
Dropped Frames	Frame drops	< 10/minute

3. Pattern Detector

Detects known problematic patterns:

Pattern	Indicators	Risk Level
DB Pool Exhaustion	"pool exhausted", "connection timeout"	HIGH
FFmpeg Degradation	"speed < 1.0x", "dropping frames"	MEDIUM
URL Expiration	"403 forbidden", "token expired"	MEDIUM
Network Instability	"connection reset", "timeout"	HIGH
Memory Pressure	"out of memory"	CRITICAL

4. Auto Resolver

Automatically resolves detected issues:

Issue Type	Strategy	Downtime
FFmpeg Hang	Restart	~2 sec
URL Expired	Refresh	0 sec
DB Pool Exhausted	Expand	0 sec
Source Unavailable	Fallback	0 sec

Error Screen Fallback

During issue resolution, clients receive an error screen instead of a broken stream:

# Visual modes: text, static, test_pattern, slate
# Audio modes: silent, sine_wave, white_noise, beep

ai_auto_heal:
  use_error_screen_fallback: true

Monitoring AI Health

Check the AI system status:

# Via API
curl http://localhost:8411/api/ai/health

# Via WebUI
# Navigate to Settings > AI > Health Status

Risk Levels

Configure which fixes require approval:

Risk Level	Example Fixes	Default Behavior
SAFE	Refresh URL, Switch fallback	Auto-apply
LOW	Expand pool, Restart channel	Auto-apply
MEDIUM	Reduce load, Kill process	Requires approval
HIGH	Full restart, Database restore	Requires approval

Database Backup (NEW in v2.6.0)

EXStreamTV now includes automatic database backup:

database_backup:
  enabled: true
  backup_directory: "backups"
  interval_hours: 24      # Backup every 24 hours
  keep_count: 7           # Keep 7 most recent
  keep_days: 30           # Delete older than 30 days
  compress: true          # Gzip compression

Manual Backup

# Via API
curl -X POST http://localhost:8411/api/database/backup

# Via WebUI
# Navigate to Settings > Database > Create Backup

Next Steps

Quick Start Guide - Create your first channel
Onboarding Guide - Complete setup wizard
macOS App Guide - Use the menu bar app
Tunarr/dizqueTV Integration - v2.6.0 technical details Last Revised: 2026-03-01

AI Setup - roto31/EXStreamTV GitHub Wiki

AI Setup Guide

Table of Contents

Overview

Bounded Agent & Safety

Bounded Agent

Containment Mode

Confidence Gating

Safely Disabling AI

Provider Options

Cloud Providers Comparison

Local Models by RAM

Cloud AI Setup

Groq (Recommended)

Step 1: Get Your API Key

Step 2: Configure EXStreamTV

Groq Free Tier Limits

SambaNova

Step 1: Get Your API Key

Step 2: Configure

OpenRouter

Step 1: Get Your API Key

Step 2: Configure

Free Models on OpenRouter

Local AI Setup

Installing Ollama

Starting Ollama

Choosing a Model

Downloading Models

Configure for Local AI

Hybrid Mode

Configuration

How Hybrid Mode Works

Configuration Reference

Complete config.yaml Example

Environment Variables

Troubleshooting

Cloud AI Issues

Local AI Issues

General Issues

AI Self-Healing System (NEW in v2.6.0)

Overview

Configuration

Components

1. Unified Log Collector

2. FFmpeg AI Monitor

3. Pattern Detector

4. Auto Resolver

Error Screen Fallback

Monitoring AI Health

Risk Levels

Database Backup (NEW in v2.6.0)

Manual Backup

Next Steps