Home - jxoesneon/gemini-audio-mcp GitHub Wiki

Welcome to the Gemini Audio MCP Wiki! 🎵

Gemini Audio MCP is a high-performance server designed to bridge the gap between AI assistants and professional-grade audio generation. Whether you're a game developer looking for procedural soundscapes or a power user building the ultimate productivity tool, this wiki will help you get the most out of the system.

🚀 Navigation

🎙️ Prompt Engineering Guide

Learn how to craft the perfect prompts for soundscapes, voices, and music. Understand the nuances of the Gemini 2.0 Live API and Lyria 3 models.

🛠️ Troubleshooting & FFmpeg Setup

Detailed installation guides for all operating systems and solutions to common "FFmpeg not found" errors.

🎛️ Audio Formats Reference

A deep dive into the 10+ supported formats (MP3, OGG, FLAC, OPUS, etc.) and how to optimize for quality vs. file size.

🐳 Docker & Deployment

Advanced configuration for running Gemini Audio MCP in containers or on cloud infrastructure.


🏗️ Technical Foundation

If you are looking for the internal system design, please refer to the Architecture Document in the main repository.

🤝 Community & Support

  • Issues: Report bugs or request features on the GitHub Issues page.
  • Glama: Check our status and scores on Glama.ai.