components multimodal_classification_datapreprocessing - Azure/azureml-assets GitHub Wiki
Component to preprocess data for multimodal classification task
Version: 0.0.3
View in Studio: https://ml.azure.com/registries/azureml/components/multimodal_classification_datapreprocessing/version/0.0.3
Multimodal Classification task arguments
Name | Description | Type | Default | Optional | Enum |
---|---|---|---|---|---|
problem_type | Specify whether its single-label or multi-label multimodal classification task. | string | multimodal-classification | False | ['multimodal-classification-singlelabel', 'multimodal-classification-multilabel'] |
label_column | label column name | string | False | ||
image_column | Image column name | string | False | ||
drop_columns | Set of columns to ignore. Provide string that has comma separated column names. | string | True | ||
numerical_columns_overrides | columns to treat as numerical. Overrides automatic column purpose detection. | string | True | ||
categorical_columns_overrides | columns to treat as categorical. Overrides automatic column purpose detection. | string | True | ||
text_columns_overrides | columns to treat as text. Overrides automatic column purpose detection. | string | True |
Inputs
Name | Description | Type | Default | Optional | Enum |
---|---|---|---|---|---|
training_data | Enter the train mltable path | mltable | False | ||
validation_data | Enter the validation mltable path | mltable | False | ||
model_selector_output | output folder of model selector containing model metadata like config, checkpoints, tokenizer config | uri_folder | False |
Name | Description | Type |
---|---|---|
output_dir | folder to store preprocessed outputs of input data | uri_folder |
azureml://registries/azureml/environments/acft-multimodal-gpu/versions/20