environments acft hf nlp data import - Azure/azureml-assets GitHub Wiki

acft-hf-nlp-data-import

Overview

Environment used by Hugging Face NLP Finetune components

Version: 20

Tags

Preview MaaS DataImport

View in Studio: https://ml.azure.com/registries/azureml/environments/acft-hf-nlp-data-import/version/20

Docker image: mcr.microsoft.com/azureml/curated/acft-hf-nlp-data-import:20

Docker build context

Dockerfile

# openmpi image
FROM mcr.microsoft.com/azureml/openmpi5.0-ubuntu24.04:20251002.v1

USER root

# sudo is expected by Singularity inside the image
RUN apt-get update && ACCEPT_EULA=Y apt-get -y upgrade && apt-get install -y sudo libxml2 sqlite3 libc-bin libc-bin libc-dev locales libc6 dpkg-dev dpkg libdpkg-perl libssl-dev libssl3 openssl 

RUN pip install --upgrade pip

COPY requirements.txt .

RUN pip install -r requirements.txt --no-cache-dir

# temp fix for vulnerabilties, remove after pypi release
RUN pip install scikit-learn==1.5.1 h2==4.3.0

# The below file is required for baking the code into the environment 
COPY data_import_run.py /azureml/data_import/run.py

# dummy number to change when needing to force rebuild without changing the definition: 2

⚠️ **GitHub.com Fallback** ⚠️