models Phi 3.5 mini instruct - Azure/azureml-assets GitHub Wiki

Phi-3.5-mini-instruct

Overview

Phi-3.5-mini is a lightweight, state-of-the-art open model built upon datasets used for Phi-3 - synthetic data and filtered publicly available websites - with a focus on very high-quality, reasoning dense data. The model belongs to the Phi-3 model family and supports 128K token context length. The model underwent a rigorous enhancement process, incorporating both supervised fine-tuning, proximal policy optimization, and direct preference optimization to ensure precise instruction adherence and robust safety measures.

Resources

🏡 Phi-3 Portal
📰 Phi-3 Microsoft Blog
📖 Phi-3 Technical Report
👩‍🍳 Phi-3 Cookbook

Model Architecture

Phi-3.5-mini has 3.8B parameters and is a dense decoder-only Transformer model using the same tokenizer as Phi-3 Mini. It is a text-only model best suited for prompts using chat format.

Training Data

Phi-3.5-mini is a static model trained on an offline dataset with 3.4T tokens and a cutoff date October 2023 for publicly available data. Future versions of the tuned models may be released as we improve models.

Version: 6

Tags

maas-inference : true disable-batch : true _aml_system_vanity_registry : azureml-phi maas-finetuning-deploy-regions : ['eastus2', 'eastus', 'northcentralus', 'westus3', 'westus', 'southcentralus'] inference_supported_envs : ['vllm'] model_specific_defaults : {'apply_deepspeed': 'true', 'deepspeed_stage': 2, 'apply_lora': 'true', 'apply_ort': 'false', 'precision': 16, 'max_seq_length': 16384, 'ignore_mismatched_sizes': 'false', 'num_train_epochs': 1, 'per_device_train_batch_size': 1, 'per_device_eval_batch_size': 1, 'gradient_accumulation_steps': 1, 'learning_rate': 2e-05, 'learning_rate_min': 2e-06, 'learning_rate_max': 0.0002, 'lr_scheduler_type': 'cosine', 'logging_strategy': 'steps', 'logging_steps': 10, 'save_total_limit': 1}

View in Studio: https://ml.azure.com/registries/azureml/models/Phi-3.5-mini-instruct/version/6

Properties

SharedComputeCapacityEnabled: True

languages: en

inference-min-sku-spec: 24|1|220|64

inference-recommended-sku: Standard_NC24ads_A100_v4, Standard_NC48ads_A100_v4, Standard_NC96ads_A100_v4, Standard_ND96amsr_A100_v4

finetuning-tasks: chat-completion

finetune-min-sku-spec: 24|1|220|64

finetune-recommended-sku: Standard_NC24ads_A100_v4, Standard_NC48ads_A100_v4, Standard_NC96ads_A100_v4, Standard_ND96amsr_A100_v4

⚠️ **GitHub.com Fallback** ⚠️