Release Notes - cho407/CaptionMate GitHub Wiki

Release Notes

Version 1.0.0 - Initial Release 🎉

Release Date: January 2025

✨ New Features

Core Functionality

  • 🎙️ Automatic Speech Recognition
    • High-quality transcription using OpenAI's Whisper model
    • Support for multiple languages
    • On-device processing for privacy

Subtitle Export Formats

  • 📝 SRT (SubRip): Universal subtitle format
  • 🌐 WebVTT: Web-based subtitle format
  • 📊 JSON: Structured data format with timestamps
  • 🎬 Final Cut Pro XML: Professional video editing integration

User Interface

  • 🎨 Modern macOS Design

    • Native SwiftUI interface
    • Light and Dark mode support
    • Intuitive drag-and-drop file import
  • 🎵 Audio Player

    • Built-in audio preview
    • Waveform visualization
    • Playback speed control (0.25x - 2.0x)
    • Volume normalization

Model Management

  • 📦 Flexible Model Selection
    • Multiple Whisper model sizes (Tiny, Base, Small, Medium, Large)
    • Download and manage models locally
    • Automatic model optimization for Apple Silicon

Advanced Settings

  • ⚙️ Transcription Quality Control

    • Adjustable temperature settings
    • Compression check tokens
    • Fallback count configuration
    • Sample length customization
  • 🔧 Performance Optimization

    • Chunking strategy selection
    • Concurrent worker count
    • Compute unit selection (CPU/GPU/Neural Engine)

Localization

  • 🌐 Multi-Language Interface
    • English
    • Korean (한국어)
    • Easy language switching from menu bar

Privacy & Security

  • 🔒 Privacy-First Design
    • All processing done on-device
    • No data collection or tracking
    • No internet connection required (except for model downloads)
    • App Sandbox and Hardened Runtime enabled

🛠️ Technical Improvements

  • Architecture: Clean MVVM architecture for maintainability
  • Performance: Optimized for Apple Silicon (M1/M2/M3)
  • Stability: Comprehensive error handling and recovery
  • Testing: Unit and UI tests included

📋 System Requirements

  • macOS: 15.0 (Sequoia) or later
  • Processor: Apple Silicon (M1 or later) or Intel with Neural Engine
  • Memory: 8GB RAM minimum (16GB recommended for large models)
  • Storage: 500MB - 3GB (depending on model size)

Thank you for using CaptionMate!

For bug reports or feature requests, please contact: [email protected]