environments acft hf nlp data import - Azure/azureml-assets GitHub Wiki
Environment used by Hugging Face NLP Finetune components
Version: 27
Preview MaaS DataImport
View in Studio: https://ml.azure.com/registries/azureml/environments/acft-hf-nlp-data-import/version/27
Docker image: mcr.microsoft.com/azureml/curated/acft-hf-nlp-data-import:27
# openmpi image
FROM mcr.microsoft.com/azureml/openmpi5.0-ubuntu24.04:20260409.v4
USER root
# sudo is expected by Singularity inside the image
# Security: upgrade all OS packages and install security-patched system libraries
RUN apt-get update && ACCEPT_EULA=Y apt-get -y upgrade && apt-get install -y sudo libxml2 sqlite3 libc-bin libc-bin libc-dev locales libc6 dpkg-dev dpkg libdpkg-perl libssl-dev libssl3 openssl
COPY requirements.txt .
RUN pip install -r requirements.txt --no-cache-dir
# pip==26.0.1, setuptools==82.0.1, wheel==0.46.3, cryptography==46.0.5, urllib3==2.6.3, h2==4.3.0
# are already at fixed versions in the openmpi base image (20260315.v1).
# The override below only targets packages NOT fixed in base or pulled in vulnerable by requirements.txt.
RUN pip install --no-cache-dir scikit-learn==1.5.1 aiohttp==3.13.3 && rm -rf /root/.cache/pip
# The below file is required for baking the code into the environment
COPY data_import_run.py /azureml/data_import/run.py
# dummy number to change when needing to force rebuild without changing the definition: 2