environments acft group relative policy optimization - Azure/azureml-assets GitHub Wiki

acft-group-relative-policy-optimization

Overview

Environment used by Group Relative Policy Optimization component

Version: 8

Tags

Preview

View in Studio: https://ml.azure.com/registries/azureml/environments/acft-group-relative-policy-optimization/version/8

Docker image: mcr.microsoft.com/azureml/curated/acft-group-relative-policy-optimization:8

Docker build context

Dockerfile

#PTCA image
FROM mcr.microsoft.com/aifx/acpt/stable-ubuntu2204-cu126-py310-torch271:biweekly.202508.2

USER root

RUN pip install --upgrade pip

COPY requirements.txt .
RUN pip install -r requirements.txt --no-cache-dir

RUN pip install azureml-acft-common-components==0.0.79
RUN pip install numpy==2.2.5
RUN pip install azureml-evaluate-mlflow==0.0.79
RUN pip install mlflow==3.1.0
RUN pip install transformers==4.53.0

# Upgrade requests in the system Python (3.13) for fixing vulnerability
RUN /opt/conda/bin/python3.13 -m pip install --upgrade requests urllib3 || true

# clean conda and pip caches
RUN rm -rf ~/.cache/pip
⚠️ **GitHub.com Fallback** ⚠️