components oss_text_generation_data_import - Azure/azureml-assets GitHub Wiki

OSS Text Generation Data Import

oss_text_generation_data_import

Overview

FTaaS component to copy user training data to output

Version: 0.0.25

View in Studio: https://ml.azure.com/registries/azureml/components/oss_text_generation_data_import/version/0.0.25

Inputs

Name Description Type Default Optional Enum
task_name Finetune task name. string ChatCompletion ['ChatCompletion', 'TextGeneration']
train_file_path Path to the registered training data asset. The supported data formats are jsonl, json, csv, tsv and parquet. uri_file
validation_file_path Path to the registered validation data asset. The supported data formats are jsonl, json, csv, tsv and parquet. uri_file True

Validation parameters

Name Description Type Default Optional Enum
system_properties Validation parameters propagated from pipeline. string True
user_column_names Comma separated list of column names to be used for training. string True

Outputs

Name Description Type
output_dataset Output dataset with train and validation data uri_folder

Environment

azureml://registries/azureml/environments/acft-hf-nlp-data-import/versions/4

⚠️ **GitHub.com Fallback** ⚠️