components multimodal_classification_datapreprocessing - Azure/azureml-assets GitHub Wiki

Multimodal Classification Data Preprocessing

multimodal_classification_datapreprocessing

Overview

Component to preprocess data for multimodal classification task

Version: 0.0.3

View in Studio: https://ml.azure.com/registries/azureml/components/multimodal_classification_datapreprocessing/version/0.0.3

Inputs

Multimodal Classification task arguments

Name Description Type Default Optional Enum
problem_type Specify whether its single-label or multi-label multimodal classification task. string multimodal-classification False ['multimodal-classification-singlelabel', 'multimodal-classification-multilabel']
label_column label column name string False
image_column Image column name string False
drop_columns Set of columns to ignore. Provide string that has comma separated column names. string True
numerical_columns_overrides columns to treat as numerical. Overrides automatic column purpose detection. string True
categorical_columns_overrides columns to treat as categorical. Overrides automatic column purpose detection. string True
text_columns_overrides columns to treat as text. Overrides automatic column purpose detection. string True

Inputs

Name Description Type Default Optional Enum
training_data Enter the train mltable path mltable False
validation_data Enter the validation mltable path mltable False
model_selector_output output folder of model selector containing model metadata like config, checkpoints, tokenizer config uri_folder False

Outputs

Name Description Type
output_dir folder to store preprocessed outputs of input data uri_folder

Environment

azureml://registries/azureml/environments/acft-multimodal-gpu/versions/20

⚠️ **GitHub.com Fallback** ⚠️