components automl_tabular_data_partitioning - Azure/azureml-assets GitHub Wiki

AutoML - Tabular Data Partitioning

automl_tabular_data_partitioning

Overview

Enables dataset partitioning for AutoML many models and hierarchical timeseries solution accelerators using spark.

Version: 0.0.7

View in Studio: https://ml.azure.com/registries/azureml/components/automl_tabular_data_partitioning/version/0.0.7

Inputs

Name Description Type Default Optional Enum
raw_data Raw input data uri_folder
partition_column_names Partition column names. string
input_type The input data file type. string ['csv', 'parquet']

Outputs

Name Description Type
partitioned_data Spark partitioned data. uri_folder
⚠️ **GitHub.com Fallback** ⚠️