components aoai_finetuning - Azure/azureml-assets GitHub Wiki

AOAI Finetuning Job

aoai_finetuning

Overview

Upload data to Azure OpenAI resource, finetune model and delete data

Version: 0.0.12

View in Studio: https://ml.azure.com/registries/azureml/components/aoai_finetuning/version/0.0.12

Inputs

Name Description Type Default Optional Enum
endpoint_name The endpoint name or AOAI resource name. string False
endpoint_resource_group Resource group for the AOAI resource. string True
endpoint_subscription Subscription for the AOAI resource. string True
training_file_path jsonl source file/folder for training dataset. uri_file True
validation_file_path source file/folder for validation dataset. uri_file True
training_import_path json file containing training data uri or key for stored uri in keyvault. uri_file True
validation_import_path json file containing validation data uri or key for stored uri in keyvault. uri_file True
model GPT model engine string gpt-35-turbo-0613 False
task_type Dataset type - chat or completion string False ['chat', 'completion']
n_epochs Number of training epochs. If not provided, it will be determined dynamically based on the input data. integer True
batch_size Global batch size. If not provided, it will be determined dynamically based on the input data. integer True
learning_rate_multiplier The learning rate multiplier to use for training. If not provided, it will be determined dynamically based on the input data. number True
suffix A string of up to 18 characters that will be added to your fine-tuned model name string True
lora_dim The size of LoRA dimensions in self attention layer. If not provided, it will be determined dynamically. integer True
n_ctx Context length of the model. If not provided, context window will be determined dynamically. integer True
weight_decay_multiplier Weight Decay Multiplier for training. Not applicable for embedding finetuning number True
completion_override boolean True

Outputs

Name Description Type
aoai_finetuning_output Contains finetuned model id in output file in JSON/custom class format uri_file

Environment

azureml://registries/azureml-1p-preview/environments/aoai-data-upload-finetune/versions/5

⚠️ **GitHub.com Fallback** ⚠️