environments ai ml automl dnn - Azure/azureml-assets GitHub Wiki

ai-ml-automl-dnn

Overview

An environment used by Azure ML AutoML for training models.

Version: 46

Tags

OS : Ubuntu20.04 Training Preview OpenMpi : 4.1.0 Python : 3.9

View in Studio: https://ml.azure.com/registries/azureml/environments/ai-ml-automl-dnn/version/46

Docker image: mcr.microsoft.com/azureml/curated/ai-ml-automl-dnn:46

Docker build context

Dockerfile

FROM mcr.microsoft.com/azureml/openmpi5.0-ubuntu24.04:20260614.v1

ENV AZUREML_CONDA_ENVIRONMENT_PATH=/azureml-envs/azureml-automl-dnn
# Prepend path to AzureML conda environment
ENV PATH=$AZUREML_CONDA_ENVIRONMENT_PATH/bin:$PATH

COPY --from=mcr.microsoft.com/azureml/mlflow-ubuntu20.04-py38-cpu-inference:20250506.v1 /var/mlflow_resources/ /var/mlflow_resources/

ENV MLFLOW_MODEL_FOLDER="mlflow-model"

ENV ENABLE_METADATA=true

# Upgrade OS packages to pick up current Ubuntu security errata.
RUN apt-get update && \
    DEBIAN_FRONTEND=noninteractive apt-get -y upgrade --fix-missing && \
    apt-get clean && rm -rf /var/lib/apt/lists/*

# Security: idna>=3.16 fixes GHSA-65pc-fj4g-8rjx / CVE-2026-45409 in
# /opt/miniconda. Parent research (2026-05-26): base requests 2.34.1 requires
# idna<4,>=2.5; latest public requests 2.32.5, observed internal requests 2.34.2,
# and yarl 1.22.0 still keep loose idna ranges, so parent upgrades cannot force
# idna>=3.15. Direct override required for the base miniconda copy.
RUN /opt/miniconda/bin/pip install --no-cache-dir --upgrade 'idna>=3.16' && \
    /opt/miniconda/bin/python -c "import idna; version=tuple(map(int, idna.__version__.split('.'))); assert version >= (3, 16), idna.__version__"

# begin conda create
# Create conda environment (minimal — packages installed via pip to avoid solver OOM)
RUN conda create -p $AZUREML_CONDA_ENVIRONMENT_PATH \
    python=3.10 \
    -c conda-forge && \
    conda clean -a -y

# Install cudatoolkit via conda (not available on pip; single-package solve is trivial)
RUN conda install -p $AZUREML_CONDA_ENVIRONMENT_PATH \
    cudatoolkit=10.0.130 \
    -c nvidia -c conda-forge && \
    conda clean -a -y

# Install scientific packages via pip (avoids conda solver OOM)
RUN conda run -p $AZUREML_CONDA_ENVIRONMENT_PATH pip install --no-cache-dir \
    'numpy>=1.23.5,<1.24' \
    'py-cpuinfo==5.0.0' \
    'joblib==1.2.0' \
    'cloudpickle==1.6.0' \
    'scikit-learn==1.5.1' \
    'pandas>=1.5.3,<1.6' \
    'holidays==0.29' \
    'setuptools-git' \
    'setuptools==82.0.1' \
    'wheel>=0.46.2' \
    'torch==2.8.0' \
    'scipy==1.10.1' \
    'psutil>5.0.0,<6.0.0' \
    'pip>=26.1'
# end conda create

# begin pip install
# Install pip dependencies
# GitPython>=3.1.50 overrides the transitive copy pulled in by mlflow-skinny ->
# databricks-sdk -> gitpython (also pulled by azureml-* telemetry helpers).
# Required for GHSA-7545-fcxq-7j24, GHSA-mv93-w799-cj2w, GHSA-v87r-6q3f-2j67;
# parent packages still allow the older versions, so this direct floor is needed.
RUN pip install \
                # begin pypi dependencies
                azureml-core==1.61.0.post4 \
                azureml-mlflow==1.62.0.post3 \
                azureml-pipeline-core==1.62.0 \
                azureml-telemetry==1.62.0 \
                azureml-interpret==1.62.0 \
                azureml-responsibleai==1.62.0 \
                azureml-automl-core==1.62.0.post3 \
                azureml-automl-runtime==1.62.0.post1 \
                azureml-train-automl-client==1.62.0 \
                azureml-train-automl-runtime==1.62.0 \
                azureml-dataset-runtime==1.62.0 \
                azureml-defaults==1.62.0 \
                'inference-schema' \
                'mlflow-skinny>=2.16.0' \
                'cmdstanpy==1.0.4' \
                'prophet==1.1.4' \
                'xgboost==1.5.2' \
                'mltable>=1.0.0' \
                'pytorch-transformers==1.0.0' \
                'GitPython>=3.1.50' \
                'spacy==3.7.4' \
                'pillow>=12.1.1' \
                'https://aka.ms/automl-resources/packages/en_core_web_sm-3.7.1.tar.gz'
                # end pypi dependencies
# end pip install

# Fix vulnerabilities - security overrides for transitive dependencies.
# Parent research refreshed 2026-05-26; direct floors are kept only where the
# current parent packages still allow vulnerable lower versions.
#
# distributed>=2026.1.0    CVE-2026-23528  XSS-to-RCE via Dask dashboard proxy
#                          Chain: azureml-train-automl-runtime 1.62.0 ->
#                          dask[complete]<=2023.2.0 -> distributed. Latest dask
#                          only aligns distributed to the dask release; it does
#                          not force this security floor.
#
# protobuf>=5.29.6         CVE-2025-4565   DoS via recursive protobuf messages (pure-Python)
#                          Chain (L1): mlflow-skinny -> protobuf. Latest
#                          mlflow-skinny 3.12.0 still allows protobuf>=3.12.0.
#                          Chain (L2): azureml-automl-runtime -> onnxruntime -> protobuf
#
# cryptography>=46.0.5     CVE-2026-26007  EC subgroup validation bypass (ECDH key leak)
#                          Chain (L1): mltable -> cryptography
#                          Chain (L2): azureml-core -> paramiko -> cryptography.
#                          Latest mltable 1.6.3 and paramiko 3.5.1 keep loose
#                          cryptography floors.
#
# bokeh>=3.8.2             GHSA-793v-589g-574v  conda env installs 2.4.3, pip can't auto-upgrade
#                          Chain (L1): azureml-train-automl-runtime 1.62.0 ->
#                          bokeh<3.0.0 and dask[complete]<=2023.2.0.
#
# onnx>=1.21.0             GHSA-cmw6-hcpp-c6jp, GHSA-538c-55jv-c5g9, GHSA-q56x-g2fj-4rj6,
#                          GHSA-p433-9wv8-28xj, GHSA-3r9x-f23j-gc73, GHSA-hqmj-h5c6-369m
#                          azureml-automl-runtime pins onnx<=1.17.0; latest release (1.62.0)
#                          still carries that upper bound, so a parent-package upgrade cannot
#                          resolve the CVEs.  Override required.
#                          Chain (L1): azureml-automl-runtime -> onnx
#
# urllib3>=2.7.0           GHSA-mf9v-mfxr-j63j, GHSA-qccp-gfcp-xxvc
#                          Chain (L1): requests -> urllib3  (pulled by ~every azureml-* pkg,
#                          mlflow-skinny, mltable, azure-* SDKs).  Latest `requests` 2.32.5
#                          still declares `urllib3<3`, never `>=2.7`, so upgrading any
#                          intermediate parent cannot raise the floor.  Direct override required.
#
# ujson>=5.12.1            GHSA-c38f-wx89-p2xg  (decode buffer overflow)
#                          Current parent metadata does not declare a tight ujson
#                          floor, so the floor can only be raised via a direct
#                          override here.
RUN pip install --upgrade 'distributed>=2026.1.0' 'protobuf>=5.29.6' 'cryptography>=46.0.5' \
    'bokeh>=3.8.2' \
    'onnx>=1.21.0' \
    'urllib3>=2.7.0' \
    'ujson>=5.12.1'
RUN rm -rf /opt/miniconda/pkgs/

ENV LD_LIBRARY_PATH=$AZUREML_CONDA_ENVIRONMENT_PATH/lib:$LD_LIBRARY_PATH
⚠️ **GitHub.com Fallback** ⚠️