8. Usage Cookbook

Purpose: Practical examples and patterns for using mcp-ssh-orchestrator in different environments and scenarios.

Overview

This cookbook provides real-world examples and common patterns for using mcp-ssh-orchestrator effectively. Each example includes configuration, commands, and expected results.

Environment Configurations

Development Environment

Use Case: Safe, permissive access for development and testing.

servers.yml:

hosts:
  - alias: "dev-web-1"
    host: "192.168.1.10"
    port: 22
    credentials: "dev_admin"
    tags: ["development", "web", "linux"]
    description: "Development web server"

  - alias: "dev-db-1"
    host: "192.168.1.20"
    port: 22
    credentials: "dev_admin"
    tags: ["development", "database", "linux"]
    description: "Development database"

credentials.yml:

entries:
  - name: "dev_admin"
    username: "developer"
    key_path: "dev_key"
    key_passphrase_secret: ""  # No passphrase for dev
    password_secret: ""

policy.yml:

limits:
  max_seconds: 120
  max_output_bytes: 2097152  # 2MB
  require_known_host: true  # Always enforced for security (CWE-295)

network:
  allow_cidrs:
    - "192.168.0.0/16"
    - "10.0.0.0/8"
  require_known_host: true  # Always enforced for security (CWE-295)

rules:
  # Allow all commands in development
  - action: "allow"
    aliases: ["dev-*"]
    tags: ["development"]
    commands: ["*"]

Usage Examples:

# Test connectivity
ssh_ping
ssh_list_hosts

# Development operations
ssh_run --alias "dev-web-1" --command "docker ps"
ssh_run --alias "dev-web-1" --command "systemctl restart nginx"
ssh_run --alias "dev-db-1" --command "sudo -u postgres psql -c 'SELECT version()'"

# Bulk operations
ssh_run_on_tag --tag "development" --command "uptime"

Staging Environment

Use Case: Moderate security with some operational flexibility.

servers.yml:

hosts:
  - alias: "staging-web-1"
    host: "10.0.1.10"
    port: 22
    credentials: "staging_admin"
    tags: ["staging", "web", "linux"]
    description: "Staging web server"

  - alias: "staging-db-1"
    host: "10.0.1.20"
    port: 22
    credentials: "staging_admin"
    tags: ["staging", "database", "linux"]
    description: "Staging database"

policy.yml:

limits:
  max_seconds: 90
  max_output_bytes: 1048576  # 1MB
  require_known_host: true  # Always enforced for security (CWE-295)

network:
  allow_cidrs:
    - "10.0.0.0/8"
      require_known_host: true  # Always enforced for security (CWE-295)

rules:
  # Read-only commands for all hosts
  - action: "allow"
    aliases: ["*"]
    tags: []
    commands:
      - "uname*"
      - "uptime*"
      - "df -h*"
      - "ps aux*"
      - "systemctl status *"

  # Staging-specific commands
  - action: "allow"
    aliases: ["staging-*"]
    tags: ["staging"]
    commands:
      - "systemctl restart *"
      - "systemctl stop *"
      - "systemctl start *"
      - "docker ps*"
      - "docker logs *"
      - "kubectl get *"
      - "kubectl describe *"

  # Network diagnostics for staging
  - action: "allow"
    aliases: ["staging-*"]
    tags: ["staging"]
    commands:
      - "ping*"
      - "traceroute*"
      - "ss -tulpn*"
      - "netstat*"

Usage Examples:

# Staging operations
ssh_run --alias "staging-web-1" --command "systemctl restart nginx"
ssh_run --alias "staging-web-1" --command "docker ps"
ssh_run --alias "staging-db-1" --command "systemctl status postgresql"

# Testing deployments
ssh_run_on_tag --tag "staging" --command "systemctl restart nginx"
ssh_run_on_tag --tag "staging" --command "docker pull nginx:latest"

Production Environment

Use Case: Strict security with minimal allowed operations.

servers.yml:

hosts:
  - alias: "prod-web-1"
    host: "10.0.0.10"
    port: 22
    credentials: "prod_admin"
    tags: ["production", "web", "linux", "critical"]
    description: "Primary production web server"

  - alias: "prod-web-2"
    host: "10.0.0.11"
    port: 22
    credentials: "prod_admin"
    tags: ["production", "web", "linux", "critical"]
    description: "Secondary production web server"

  - alias: "prod-db-1"
    host: "10.0.0.20"
    port: 22
    credentials: "prod_admin"
    tags: ["production", "database", "linux", "critical"]
    description: "Primary production database"

policy.yml:

limits:
  max_seconds: 30
  max_output_bytes: 131072  # 128KB
  host_key_auto_add: false
  require_known_host: true
  deny_substrings:
    - "rm -rf /"
    - "shutdown*"
    - "reboot*"
    - "systemctl restart*"
    - "systemctl stop*"
    - "systemctl start*"
    - "apt *"
    - "yum *"
    - "docker run*"
    - "kubectl *"

network:
  allow_cidrs:
    - "10.0.0.0/8"
  block_cidrs:
    - "0.0.0.0/0"  # Block all public internet
  require_known_host: true

rules:
  # Minimal read-only commands for production
  - action: "allow"
    aliases: ["prod-*"]
    tags: ["production"]
    commands:
      - "uptime*"
      - "df -h*"
      - "systemctl status *"
      - "journalctl --no-pager -n 20 *"

  # Explicit deny for production
  - action: "deny"
    aliases: ["prod-*"]
    tags: ["production"]
    commands:
      - "systemctl restart*"
      - "systemctl stop*"
      - "systemctl start*"
      - "apt *"
      - "yum *"
      - "docker *"
      - "kubectl *"

overrides:
  aliases:
    prod-db-1:
      max_seconds: 15
      max_output_bytes: 65536
    prod-web-1:
      max_seconds: 20
      max_output_bytes: 131072

Usage Examples:

# Production monitoring only
ssh_run --alias "prod-web-1" --command "uptime"
ssh_run --alias "prod-web-1" --command "df -h"
ssh_run --alias "prod-web-1" --command "systemctl status nginx"

# Bulk monitoring
ssh_run_on_tag --tag "production" --command "uptime"
ssh_run_on_tag --tag "web" --command "systemctl status nginx"

# Policy testing (always test first!)
ssh_plan --alias "prod-web-1" --command "systemctl restart nginx"  # Should be denied

Common Usage Patterns

System Monitoring

Health Checks:

# Basic health check
ssh_ping

# System information
ssh_run --alias "web1" --command "uptime"
ssh_run --alias "web1" --command "df -h"
ssh_run --alias "web1" --command "free -h"

# Service status
ssh_run --alias "web1" --command "systemctl status nginx"
ssh_run --alias "web1" --command "systemctl is-active nginx"
ssh_run --alias "web1" --command "systemctl is-enabled nginx"

Bulk Monitoring:

# Check all production hosts
ssh_run_on_tag --tag "production" --command "uptime"
ssh_run_on_tag --tag "production" --command "df -h"

# Check specific service types
ssh_run_on_tag --tag "web" --command "systemctl status nginx"
ssh_run_on_tag --tag "database" --command "systemctl status postgresql"

Service Management

Service Operations:

# Check service status
ssh_run --alias "web1" --command "systemctl status nginx"
ssh_run --alias "web1" --command "systemctl is-active nginx"

# Restart services (if allowed by policy)
ssh_run --alias "web1" --command "systemctl restart nginx"
ssh_run --alias "web1" --command "systemctl reload nginx"

# Enable/disable services
ssh_run --alias "web1" --command "systemctl enable nginx"
ssh_run --alias "web1" --command "systemctl disable nginx"

Bulk Service Operations:

# Restart all web servers
ssh_run_on_tag --tag "web" --command "systemctl restart nginx"

# Check all database services
ssh_run_on_tag --tag "database" --command "systemctl status postgresql"

Log Analysis

Log Inspection:

# Recent logs
ssh_run --alias "web1" --command "journalctl --no-pager -n 20 nginx"
ssh_run --alias "web1" --command "tail -n 10 /var/log/nginx/access.log"

# Error logs
ssh_run --alias "web1" --command "journalctl --no-pager -p err nginx"
ssh_run --alias "web1" --command "grep ERROR /var/log/nginx/error.log | tail -5"

Bulk Log Analysis:

# Check logs across all web servers
ssh_run_on_tag --tag "web" --command "journalctl --no-pager -n 10 nginx"
ssh_run_on_tag --tag "web" --command "tail -n 5 /var/log/nginx/access.log"

Process Management

Process Information:

# Running processes
ssh_run --alias "web1" --command "ps aux | grep nginx"
ssh_run --alias "web1" --command "ps aux | head -10"

# Process details
ssh_run --alias "web1" --command "top -n 1"
ssh_run --alias "web1" --command "htop -n 1"

Resource Usage:

# Memory usage
ssh_run --alias "web1" --command "free -h"
ssh_run --alias "web1" --command "cat /proc/meminfo | head -5"

# CPU usage
ssh_run --alias "web1" --command "top -n 1 | head -5"
ssh_run --alias "web1" --command "cat /proc/loadavg"

Network Diagnostics

Network Information:

# Network interfaces
ssh_run --alias "web1" --command "ip addr show"
ssh_run --alias "web1" --command "ifconfig"

# Network connections
ssh_run --alias "web1" --command "ss -tulpn"
ssh_run --alias "web1" --command "netstat -tulpn"

# Routing information
ssh_run --alias "web1" --command "ip route show"
ssh_run --alias "web1" --command "route -n"

Connectivity Testing:

# Ping tests
ssh_run --alias "web1" --command "ping -c 3 8.8.8.8"
ssh_run --alias "web1" --command "ping -c 3 google.com"

# Port connectivity
ssh_run --alias "web1" --command "telnet localhost 80"
ssh_run --alias "web1" --command "nc -zv localhost 80"

Advanced Patterns

Policy Testing Workflow

Always test before executing:

# 1. Test policy first
ssh_plan --alias "web1" --command "systemctl restart nginx"

# 2. If allowed, execute
ssh_run --alias "web1" --command "systemctl restart nginx"

# 3. Verify result
ssh_run --alias "web1" --command "systemctl status nginx"

Policy Tuning: Privileged Maintenance Window

Goal: Allow DEBIAN_FRONTEND=noninteractive sudo apt-get upgrade -y on a small set of hosts without loosening global policy.

Edit policy.yml

rules:
  - action: "allow"
    aliases:
      - "docker-prod-manager1"
      - "docker-prod-manager2"
      - "docker-prod-manager3"
    commands:
      - "sudo apt-get update*"
      - "DEBIAN_FRONTEND=noninteractive sudo apt-get upgrade -y*"

overrides:
  aliases:
    docker-prod-manager1:
      max_seconds: 300
      task_result_ttl: 1800

Remove sudo from the global deny_substrings list or override it for these aliases.
Copy the override block for each host that needs the longer timeout/output window.

Reload & dry-run

ssh_reload_config
ssh_plan --alias docker-prod-manager1 \
  --command "DEBIAN_FRONTEND=noninteractive sudo apt-get upgrade -y"

Execute asynchronously

ssh_run_async --alias docker-prod-manager1 \
  --command "DEBIAN_FRONTEND=noninteractive sudo apt-get upgrade -y"
ssh_get_task_status --task-id "<id>"
ssh_get_task_result --task-id "<id>"

Roll back overrides if temporary once the window closes.

Bulk Operations with Error Handling

Safe bulk operations:

# Check all hosts first
ssh_list_hosts

# Test policy for bulk operation
ssh_plan --alias "prod-web-1" --command "systemctl status nginx"

# Execute on subset if needed
ssh_run_on_tag --tag "web" --command "systemctl status nginx"

Configuration Management

Reload configuration:

# After updating policy.yml
ssh_reload_config

# Verify new configuration
ssh_describe_host --alias "web1"

Command Cancellation

Long-running commands:

# Start long-running command
ssh_run --alias "web1" --command "tail -f /var/log/nginx/access.log"

# Cancel if needed (use task_id from response)
ssh_cancel --task_id "web1:a1b2c3d4:1234567890"

Environment-Specific Examples

Web Server Management

Nginx Operations:

# Check configuration
ssh_run --alias "web1" --command "nginx -t"
ssh_run --alias "web1" --command "nginx -T"

# Service management
ssh_run --alias "web1" --command "systemctl status nginx"
ssh_run --alias "web1" --command "systemctl restart nginx"

# Log analysis
ssh_run --alias "web1" --command "tail -n 20 /var/log/nginx/access.log"
ssh_run --alias "web1" --command "tail -n 20 /var/log/nginx/error.log"

Apache Operations:

# Check configuration
ssh_run --alias "web1" --command "apache2ctl configtest"
ssh_run --alias "web1" --command "apache2ctl -S"

# Service management
ssh_run --alias "web1" --command "systemctl status apache2"
ssh_run --alias "web1" --command "systemctl restart apache2"

# Log analysis
ssh_run --alias "web1" --command "tail -n 20 /var/log/apache2/access.log"
ssh_run --alias "web1" --command "tail -n 20 /var/log/apache2/error.log"

Database Management

PostgreSQL Operations:

# Service status
ssh_run --alias "db1" --command "systemctl status postgresql"
ssh_run --alias "db1" --command "systemctl is-active postgresql"

# Database queries
ssh_run --alias "db1" --command "sudo -u postgres psql -c 'SELECT version()'"
ssh_run --alias "db1" --command "sudo -u postgres psql -c 'SELECT current_database()'"

# Connection info
ssh_run --alias "db1" --command "sudo -u postgres psql -c 'SELECT * FROM pg_stat_activity;'"

MySQL Operations:

# Service status
ssh_run --alias "db1" --command "systemctl status mysql"
ssh_run --alias "db1" --command "systemctl is-active mysql"

# Database queries
ssh_run --alias "db1" --command "mysql -e 'SELECT VERSION()'"
ssh_run --alias "db1" --command "mysql -e 'SHOW DATABASES'"

# Connection info
ssh_run --alias "db1" --command "mysql -e 'SHOW PROCESSLIST'"

Container Management

Docker Operations:

# Container status
ssh_run --alias "web1" --command "docker ps"
ssh_run --alias "web1" --command "docker ps -a"

# Container logs
ssh_run --alias "web1" --command "docker logs nginx"
ssh_run --alias "web1" --command "docker logs --tail 20 nginx"

# Container stats
ssh_run --alias "web1" --command "docker stats --no-stream"

Kubernetes Operations:

# Pod status
ssh_run --alias "k8s-node-1" --command "kubectl get pods"
ssh_run --alias "k8s-node-1" --command "kubectl get pods -o wide"

# Service status
ssh_run --alias "k8s-node-1" --command "kubectl get services"
ssh_run --alias "k8s-node-1" --command "kubectl get deployments"

# Node information
ssh_run --alias "k8s-node-1" --command "kubectl get nodes"
ssh_run --alias "k8s-node-1" --command "kubectl describe node k8s-node-1"

Troubleshooting Patterns

Connection Issues

Test connectivity:

# Basic connectivity
ssh_ping
ssh_list_hosts

# Host details
ssh_describe_host --alias "web1"

# Test policy
ssh_plan --alias "web1" --command "uptime"

Policy Issues

Debug policy decisions:

# Test specific commands
ssh_plan --alias "web1" --command "uptime"
ssh_plan --alias "web1" --command "systemctl restart nginx"

# Check policy configuration
ssh_describe_host --alias "web1"

Performance Issues

Monitor resource usage:

# System resources
ssh_run --alias "web1" --command "uptime"
ssh_run --alias "web1" --command "df -h"
ssh_run --alias "web1" --command "free -h"

# Process information
ssh_run --alias "web1" --command "ps aux | head -10"
ssh_run --alias "web1" --command "top -n 1"

Inspector + Manual Validation Checklist

Use this workflow whenever you need to validate a release, exercise new resources, or reproduce bugs:

Build the image
```
scripts/docker-build.sh
```
Run MCP Inspector against the container
```
scripts/docker-smoketest.sh
```
The helper script mirrors the bundled examples into a temporary directory, mounts them into Docker, and launches npx @modelcontextprotocol/inspector docker run ... so you can drive the stdio server interactively.
Resource tour
- Browse ssh://hosts, ssh://host/{alias}, ssh://host/{alias}/tags, and ssh://host/{alias}/capabilities
- Confirm has_credentials_ref shows credential presence without revealing names or secrets
Tool smoke tests
- ssh_plan → ssh_run for an allowed path
- ssh_plan → ssh_run for a denied path (policy + network)
- ssh_run_on_tag for a populated tag and a tag with zero matches
- Async lifecycle: ssh_run_async, ssh_get_task_status, ssh_get_task_output, ssh_get_task_result, ssh_cancel/ssh_cancel_async_task
- ssh_reload_config after editing the mounted config dir
- Confirm that policy/network denials include the hint field (and that ssh_plan returns why + hint when blocked) so LLM clients learn to re-run ssh_plan, consult the orchestrator prompts, or escalate with a policy-update discussion instead of looping on a forbidden command
Context logging + observability
- Watch Inspector console for new ctx.debug / ctx.info events (task creation/completion, cancellation, reload)
- Tail docker logs -f <container> to capture policy_decision, audit, progress, and trace entries for success, failure, denied, and cancelled flows
Manual checklist recap
- Resource browsing complete
- Allowed + denied commands verified
- Tag fan-out path validated
- Async lifecycle exercised end-to-end
- Cancellation + reload flows tested
- Logs reviewed for all paths, including security denials

Capture the commands you ran (or Inspector screenshots) in PR descriptions to show the release has been validated end-to-end.

Next Steps

Tools Reference - Complete tool documentation
Configuration - Configuration system details
Troubleshooting - Common issues and solutions
Deployment - Production deployment examples

08 Usage Cookbook - samerfarida/mcp-ssh-orchestrator GitHub Wiki

8. Usage Cookbook

Overview

Environment Configurations

Development Environment

Staging Environment

Production Environment

Common Usage Patterns

System Monitoring

Service Management

Log Analysis

Process Management

Network Diagnostics

Advanced Patterns

Policy Testing Workflow

Policy Tuning: Privileged Maintenance Window

Bulk Operations with Error Handling

Configuration Management

Command Cancellation

Environment-Specific Examples

Web Server Management

Database Management

Container Management

Troubleshooting Patterns

Connection Issues

Policy Issues

Performance Issues

Inspector + Manual Validation Checklist

Next Steps

⚠️ GitHub.com Fallback ⚠️

08 Usage Cookbook - samerfarida/mcp-ssh-orchestrator GitHub Wiki

8. Usage Cookbook

Overview

Environment Configurations

Development Environment

Staging Environment

Production Environment

Common Usage Patterns

System Monitoring

Service Management

Log Analysis

Process Management

Network Diagnostics

Advanced Patterns

Policy Testing Workflow

Policy Tuning: Privileged Maintenance Window

Bulk Operations with Error Handling

Configuration Management

Command Cancellation

Environment-Specific Examples

Web Server Management

Database Management

Container Management

Troubleshooting Patterns

Connection Issues

Policy Issues

Performance Issues

Inspector + Manual Validation Checklist

Next Steps

⚠️ **GitHub.com Fallback** ⚠️

⚠️ GitHub.com Fallback ⚠️