models nemotron speech streaming es 0.6b ft cuda gpu - Azure/azureml-assets GitHub Wiki

nemotron-speech-streaming-es-0.6b-ft-cuda-gpu

Overview

This model is a fine-tuned and optimized derivative of nemotron-speech-streaming-en-0.6b, adapted for Spanish speech recognition. The model is optimized for local inference on GPUs.

Model Description

  • Developed by: Microsoft
  • Model type: ONNX
  • License: MIT
  • Model Description: This model is derived from nemotron-speech-streaming-en-0.6b and has been fine-tuned for Spanish speech recognition. It has been converted and optimized for efficient local inference on GPUs.
  • Disclaimer: This model is a fine-tuned and optimized derivative of the base model, adapted for Spanish speech recognition. Due to this adaptation, its capabilities differ substantially from the original English model and are specialized for Spanish-language use cases. The model may not perform as expected on other languages or tasks. Users are responsible for evaluating the model in their specific application context.

Base Model Information

See Hugging Face model https://huggingface.co/nvidia/nemotron-speech-streaming-en-0.6b for details.

Version: 1

Tags

capabilities supportsReasoning reasoningStart reasoningEnd contextLength : 0 minFLVersion : 1.1.0 disable-maap : true

View in Studio: https://ml.azure.com/registries/azureml/models/nemotron-speech-streaming-es-0.6b-ft-cuda-gpu/version/1

⚠️ **GitHub.com Fallback** ⚠️